DOI QR코드

DOI QR Code

Novel Intent based Dimension Reduction and Visual Features Semi-Supervised Learning for Automatic Visual Media Retrieval

  • Received : 2022.06.05
  • Published : 2022.06.30

Abstract

Sharing of online videos via internet is an emerging and important concept in different types of applications like surveillance and video mobile search in different web related applications. So there is need to manage personalized web video retrieval system necessary to explore relevant videos and it helps to peoples who are searching for efficient video relates to specific big data content. To evaluate this process, attributes/features with reduction of dimensionality are computed from videos to explore discriminative aspects of scene in video based on shape, histogram, and texture, annotation of object, co-ordination, color and contour data. Dimensionality reduction is mainly depends on extraction of feature and selection of feature in multi labeled data retrieval from multimedia related data. Many of the researchers are implemented different techniques/approaches to reduce dimensionality based on visual features of video data. But all the techniques have disadvantages and advantages in reduction of dimensionality with advanced features in video retrieval. In this research, we present a Novel Intent based Dimension Reduction Semi-Supervised Learning Approach (NIDRSLA) that examine the reduction of dimensionality with explore exact and fast video retrieval based on different visual features. For dimensionality reduction, NIDRSLA learns the matrix of projection by increasing the dependence between enlarged data and projected space features. Proposed approach also addressed the aforementioned issue (i.e. Segmentation of video with frame selection using low level features and high level features) with efficient object annotation for video representation. Experiments performed on synthetic data set, it demonstrate the efficiency of proposed approach with traditional state-of-the-art video retrieval methodologies.

Keywords

References

  1. Ambareesh Ravi, Amith Nandakumar,"A multimodal deep learning framework for scalable content based visual media retrieval", arXiv:2105.08665v1 [cs.LG] 18 May 2021.
  2. Wei Weng, Yan-Nan Chen, Chin-Ling Chen, Shun-Xiang Wu, Jing-Hua Liu, Non-sparse Label Specific Features Selection for Multi-label Classification, Neurocomputing (2019),doi: https://doi.org/10.1016/j.neucom.2019.10.016.
  3. Jianghong Ma, Tommy W.S. Chow, "Label-specific feature selection and two-level label recovery for multi-label classification with missing labels", https://doi.org/10.1016/j.neunet.2019.04.0110893-6080/© 2019 Elsevier Ltd. All rights reserved.
  4. Gong, C., Tao, D., Liu, W., Liu, L., & Yang, J. (2017). Label propagation via teachingto-learn and learning-to-teach. IEEE Transactions on Neural Networks and Learning Systems, 28(6), 1452-1465. http://dx.doi.org/10.1109/TNNLS.2016.2514360.
  5. Gong, C., Tao, D., Maybank, S. J., Liu, W., Kang, G., & Yang, J. (2016). Multimodal curriculum learning for semi-supervised image classification. IEEE Transactions on Image Processing, 25(7), 3249-3260. http://dx.doi.org/10.1109/TIP.2016.2563981.
  6. Huang, J., Qin, F., Zheng, X., Cheng, Z., Yuan, Z., & Zhang, W. (2018). Learning label-specific features for multi-label classification with missing labels. In 2018 IEEE fourth international conference on multimedia big data (BigMM) (pp. 1-5). http://dx.doi.org/10.1109/BigMM.2018.8499080.
  7. Zhang, Z., Li, F., Jia, L., Qin, J., Zhang, L., & Yan, S. (2018). Robust adaptive embedded label propagation with weight learning for inductive classification. IEEE Transactions on Neural Networks and Learning Systems, 29(8), 3388-3403. http://dx.doi.org/10.1109/TNNLS.2017.2727526.
  8. Zhang, Z., Zhang, Y., Li, F., Zhao, M., Zhang, L., & Yan, S. (2017). Discriminative sparse flexible manifold embedding with novel graph for robust visual representation and label propagation. Pattern Recognition, 61, 492-510. http://dx.doi.org/10.1016/j.patcog.2016.07.042
  9. Zhang, R., Nie, F., & Li, X. (2018). Self-weighted supervised discriminative feature selection. IEEE Transactions on Neural Networks and Learning Systems, 29(8), 3913-3918. http://dx.doi.org/10.1109/TNNLS.2017.2740341.
  10. Zhang, Z., Jia, L., Zhao, M., Liu, G., Wang, M., & Yan, S. (2018). Kernelinduced label propagation by mapping for semi-supervised classification. IEEE Transactions on Big Data.
  11. Pang, T., Nie, F., Han, J., & Li, X. (2019). Efficient feature selection via l2,0- norm constrained sparse regression. IEEE Transactions on Knowledge and Data Engineering, 1. http://dx.doi.org/10.1109/TKDE.2018.2847685
  12. Liu, H., Li, X., & Zhang, S. (2017). Learning instance correlation functions for multilabel classification. IEEE Transactions on Cybernetics, 47(2), 499-510. http://dx.doi.org/10.1109/TCYB.2016.2519683.
  13. Ma, J., & Chow, T. W. (2018a). Robust non-negative sparse graph for semisupervised multi-label learning with missing labels. Information Sciences, 422(Supplement C), 336-351. http://dx.doi.org/10.1016/j.ins.2017.08.061.
  14. Ma, J., & Chow, T. W. S. (2018b). Topic-based algorithm for multilabel learning with missing labels. IEEE Transactions on Neural Networks and Learning Systems, 1-15. http://dx.doi.org/10.1109/TNNLS.2018.2874434.
  15. Ma, J., Tian, Z., Zhang, H., & Chow, T. W. (2017). Multi-label low-dimensional embedding with missing labels. Knowledge-Based Systems, 137(Supplement C), 65-82. http://dx.doi.org/10.1016/j.knosys.2017.09.005.
  16. Bandeira, A.S.;Mixon,D.G.;Recht, B.:Compressive classification and the rare eclipse problem. arXiv:1404.3203, 2014.
  17. Oymak, S.; Recht, B.:Near-optimal bounds for binary embeddings of arbitrary sets. arXiv:1512.04433, 2015.
  18. Li, M.; Rane, S.; Boufounos, P.: Quantized embeddings of scaleinvariant image features for mobile augmented reality, in IEEE Int. Workshop on Multimedia Signal Processing, Banff, Canada, 17-19 September 2012, 1-6.
  19. Jacques, L.: Smallwidth, lowdistortions: quasi-isometric embeddings with quantized sub-Gaussian randomprojections. arXiv:1504.06170, 2015.
  20. J. Liu, Y. Lin, , Y. Li, W. Weng, S. Wu, Online multi-label streaming feature selection based on neighborhood rough set, Pattern Recognition. 84 (2018) 273o287. https://doi.org/10.1016/j.patcog.2018.07.021
  21. J. Lee, W. Seo, J. H. Park and D. W. Kim, Compact feature subset-based multi-label music categorization for mobile devices, Multimedia Tools & Applications. (2018) 1-15.
  22. L. Liu, L. Tang, L. He, S. Yao and W. Zhou, Predicting protein function via multi-label supervised topic model on gene ontology, Biotechnology & Biotechnological Equipment. 31(1) (2017) 1-9. https://doi.org/10.1080/13102818.2016.1258330
  23. Y. Lin, Q. Hu, J. Liu and D. Jie, Multi-label feature selection based on max-dependency and min-redundancy, Neurocomputing. 168 (2015) 92-103. https://doi.org/10.1016/j.neucom.2015.06.010
  24. W. Weng, Y. Lin, S. Wu, Y. Li and Y. Kang, Multi-label learning based on label specific features and local pairwise label correlation, Neurocomputing. 273 (2018) 385o394. https://doi.org/10.1016/j.neucom.2017.07.044
  25. Shiv Ram Dubey. A decade survey of content based image retrieval using deep learning. arXiv preprint arXiv:2012.00641, 2020.
  26. Afshan Latif, Aqsa Rasheed, Umer Sajid, Jameel Ahmed, Nouman Ali, Naeem Iqbal Ratyal, Bushra Zafar, Saadat Hanif Dar, Muhammad Sajid, and Tehmina Khalil. Content-based image retrieval and feature extraction: a comprehensive review. Mathematical Problems in Engineering, 2019, 2019.
  27. Yumeng Liu and Aina Sui. Research on feature dimensionality reduction in content based public cultural video retrieval. In 2018 IEEE/ACIS 17th International Conference on Computer and Information Science (ICIS),pages 718-722. IEEE, 2018.
  28. Subhadip Maji and Smarajit Bose. Cbir using features derived by deep learning. arXiv preprint arXiv:2002.07877, 2020.
  29. Antoine Miech, Jean-Baptiste Alayrac, Lucas Smaira, Ivan Laptev, Josef Sivic, and Andrew Zisserman. End-to-end learning of visual representations from uncurated instructional videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9879-9889, 2020.
  30. Alexey Potapov, Innokentii Zhdanov, Oleg Scherbakov, Nikolai Skorobogatko, Hugo Latapie, and Enzo Fenoglio. Semantic image retrieval by uniting deep neural networks and cognitive architectures. In International Conference on Artificial General Intelligence, pages 196-206.Springer, 2018.
  31. Mohsen Ramezani and Farzin Yaghmaee. Retrieving human action by fusing the motion information of interest points. International Journal on Artificial Intelligence Tools, 27(03):1850008, 2018. https://doi.org/10.1142/S0218213018500082
  32. Hiroki Tanioka. A fast content-based image retrieval method using deep visual features. In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), volume 5, pages 20-23. IEEE,2019.
  33. and Fang Huang. Cnn-vwii: An efficient approach for large-scale video retrieval by image queries. Pattern Recognition Letters, 123:82-88, 2019. https://doi.org/10.1016/j.patrec.2019.03.015
  34. Wengang Zhou, Houqiang Li, Jian Sun, and Qi Tian. Collaborative index embedding for image retrieval. IEEE transactions on pattern analysis and machine intelligence, 40(5):1154-1166, 2017. https://doi.org/10.1109/TPAMI.2017.2676779
  35. Lei Zhu, Jialie Shen, Liang Xie, and Zhiyong Cheng. Unsupervised visual hashing with semantic assistant for content-based image retrieval. IEEE Transactions on Knowledge and Data Engineering, 29(2):472-486, 2016. https://doi.org/10.1109/TKDE.2016.2562624
  36. Mohammadreza Zolfaghari, Kamaljeet Singh, and Thomas Brox. Eco: Efficient convolutional network for online video understanding. In Proceedings of the European conference on computer vision (ECCV),pages 695-712, 2018.