[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3837/tiis.2016.02.018

A Salient Based Bag of Visual Word Model (SBBoVW): Improvements toward Difficult Object Recognition and Object Location in Image Retrieval

Mansourian, Leila (University Putra Malaysia, Faculty of Computer Science and Information Technology, Department of Multimedia, UPM Serdang)
Abdullah, Muhamad Taufik (University Putra Malaysia, Faculty of Computer Science and Information Technology, Department of Multimedia, UPM Serdang)
Abdullah, Lilli Nurliyana (University Putra Malaysia, Faculty of Computer Science and Information Technology, Department of Multimedia, UPM Serdang)
Azman, Azreen (University Putra Malaysia, Faculty of Computer Science and Information Technology, Department of Multimedia, UPM Serdang)
Mustaffa, Mas Rina (University Putra Malaysia, Faculty of Computer Science and Information Technology, Department of Multimedia, UPM Serdang)

Publication Information

KSII Transactions on Internet and Information Systems (TIIS) / v.10, no.2, 2016 , pp. 769-786 More about this Journal

Abstract

Object recognition and object location have always drawn much interest. Also, recently various computational models have been designed. One of the big issues in this domain is the lack of an appropriate model for extracting important part of the picture and estimating the object place in the same environments that caused low accuracy. To solve this problem, a new Salient Based Bag of Visual Word (SBBoVW) model for object recognition and object location estimation is presented. Contributions lied in the present study are two-fold. One is to introduce a new approach, which is a Salient Based Bag of Visual Word model (SBBoVW) to recognize difficult objects that have had low accuracy in previous methods. This method integrates SIFT features of the original and salient parts of pictures and fuses them together to generate better codebooks using bag of visual word method. The second contribution is to introduce a new algorithm for finding object place based on the salient map automatically. The performance evaluation on several data sets proves that the new approach outperforms other state-of-the-arts.

Keywords

saliency map; SIFT feature; Bag of Visual Words model (BoVW); image retrieval; object recognition; and object location;

Citations & Related Records

Reference

1	G. Csurka, C. R. Dance, L. Fan, J. Willamowski, C. Bray, and D. Maupertuis, “Visual Categorization with Bags of Keypoints,” Work. Stat. Learn. Comput. vision, ECCV, vol. 1, pp. 1–2, 2004. Article (Ref Link)
2	C.-C. Chang and C.-J. Lin, “Libsvm,” ACM Trans. Intell. Syst. Technol., vol. 2, no. 3, pp. 1–27, Apr. 2011. Article (CrossRef Link) DOI
3	D. G. Lowe, "Object recognition from local scale-invariant features," in Proc. of Seventh IEEE Int. Conf. Comput. Vis., vol. 2, pp. 1150-1157, 1999. Article (CrossRef Link)
4	A. Vedaldi and B. Fulkerson, “VLFeat - An open and portable library of computer vision algorithms,” Design, vol. 3, no. 1, pp. 1–4, 2010. Article (CrossRef Link)
5	J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong, "Locality-constrained linear coding for image classification," in Proc. of IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 3360-3367, 2010. Article (CrossRef Link)
6	M. Oquab, I. P. France, B. Leon ; L. Ivan ,S. Josef, "Is object localization for free ? - Weakly-supervised learning with convolutional neural networks," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 685-694. 2015. Article (CrossRef Link)
7	L. Fei-Fei, R. Fergus, and P. Perona, “Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories,” Comput. Vis. Image Underst., vol. 106, no. 1, pp. 59–70, Apr. 2007. Article (CrossRef Link) DOI
8	P. Griffin, G. Holub, AD. Perona, “Caltech-256 object category dataset,” California Institute of Technology, 2007. Article (Ref Link)
9	H. Jiang, J. Wang, Z. Yuan, Y. Wu, N. Zheng, and S. Li, "Salient object detection: A discriminative regional feature integration approach," in Proc. of IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 2083-2090, Jun. 2013. Article (CrossRef Link)
10	X. Long, H. Lu, and W. Li, “Image classification based on nearest neighbor basis vectors,” Multimed. Tools Appl., vol. 71, no. 3, pp. 1559–1576, Nov. 2012. Article (CrossRef Link) DOI
11	H. Bannour and C. Hudelot, “Building and using fuzzy multimedia ontologies for semantic image annotation,” Multimed. Tools Appl., pp. 2107–2141, May 2013. Article (CrossRef Link)
12	M.-U. Kim and K. Yoon, “Performance evaluation of large-scale object recognition system using bag-of-visual words model,” Multimed. Tools Appl., Jun. 2014. Article (CrossRef Link)
13	S. Zhong, Y. Liu, Y. Liu, and F. Chung, “Region level annotation by fuzzy based contextual cueing label propagation,” Multimed. Tools Appl., vol. 70, no. 2, pp. 625–645, Jan. 2012. Article (CrossRef Link) DOI
14	G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg, "Baby talk: Understanding and generating simple image descriptions," Cvpr 2011, pp. 1601-1608, Jun. 2011. Article (CrossRef Link)
15	K. Murphy, A. Torralba, D. Eaton, and W. Freeman, “Object Detection and Localization Using Local and Global Features,” pp. 382–400, 2006. Article (CrossRef Link)
16	C. H. Lampert, M. B. Blaschko, and T. Hofmann, "Beyond sliding windows: Object localization by efficient subwindow search," in Proc. of 2008 IEEE Conf. Comput. Vis. Pattern Recognit., pp. 1-8, Jun. 2008.Article (CrossRef Link)
17	H. Jiang, J. Wang, Z. Yuan, T. Liu, N. Zheng, and S. Li, “Automatic Salient Object Segmentation Based on Context and Shape Prior,” BMVC, Vol. 6. No. 7, pp. 110.1--110.12,. 2011. Article (CrossRef Link)
18	V. Dey, Y. Zhang, M. Zhong, and G. Engineering, “A REVIEW ON IMAGE SEGMENTATION TECHNIQUES WITH REMOTE SENSING PERSPECTIVE,” na, vol. XXXVIII, pp. 31–42, 2010. Article (Ref Link)
19	A. Borji, M.-M. Cheng, H. Jiang, and J. Li, “Salient Object Detection: A Survey,” arXiv preprint arXiv:1411.5878, pp. 1–26, Nov. 2014. Article (Ref Link)
20	J. Kim and K. Grauman, "Boundary preserving dense local regions," Cvpr 2011, pp. 1553-1560, Jun. 2011. Article (CrossRef Link)
21	A. Borji, D. N. Sihite, and L. Itti, "Salient Object Detection : A Benchmark," in Proc. of the 12th European Conference on Computer Vision - Volume Part II, Springer-Verlag, pp. 414-429, 2012. Article (CrossRef Link)
22	P. Wang, J. Wang, G. Zeng, J. Feng, H. Zha, and S. Li, "Salient object detection for searched web images via global saliency," in Proc. of 2012 IEEE Conf. Comput. Vis. Pattern Recognit., pp. 3194-3201, Jun. 2012. Article (CrossRef Link)
23	G. H. Le Wang, Jianru Xue, Nanning Zheng, "Automatic salient object extraction with contextual cue," in Proc. of 2011 Int. Conf. Comput. Vis., pp. 105-112, Nov. 2011. Article (CrossRef Link)
24	C. Liu, J. Yuen, A. Torralba, J. Sivic, and W. T. Freeman, "SIFT Flow : Dense Correspondence across Different Scenes," in Computer Vision-ECCV 2008, Springer Berlin Heidelberg, Vol. 1, no. 1, pp. 28-42, 2008. Article (CrossRef Link)
25	Q. Yan, L. Xu, J. Shi, and J. Jia, "Hierarchical Saliency Detection," in Proc. of 2013 IEEE Conf. Comput. Vis. Pattern Recognit., pp. 1155-1162, Jun. 2013. Article (CrossRef Link)
26	A. Borji, “What is a salient object? A dataset and a baseline model for salient object detection,” Image Processing, IEEE Transactions on, Vol. 24(2), pp.742-756, 2015. Article (CrossRef Link) DOI
27	H. Bay, T. Tuytelaars, and L. Van Gool, "SURF : Speeded Up Robust Features," in Computer vision-ECCV 2006, Springer Berlin Heidelberg, pp. 404-417, 2006. Article (CrossRef Link)
28	Z. Li, J. Liu, J. Tang, and H. Lu, "Robust Structured Subspace Learning for Data Representation," IEEE Trans. Pattern Anal. Mach. Intell., vol. X, no. X, pp. 1-1, 2015. Article (CrossRef Link) DOI
29	N. Dalal, B. Triggs, and D. Europe, "Histograms of Oriented Gradients for Human Detection," in Proc. of Computer Vision and Pattern Recognition, 2005. CVPR 2005, IEEE Computer Society Conference on, vol. 1, pp. 886-893, 2005. Article (CrossRef Link)
30	N. M. Elfiky, F. Shahbaz Khan, J. van de Weijer, and J. Gonzàlez, “Discriminative compact pyramids for object and scene recognition,” Pattern Recognit., vol. 45, no. 4, pp. 1627–1636, Apr. 2012. Article (CrossRef Link) DOI
31	Z. Li, J. Liu, Y. Yang, X. Zhou, and S. Member, “Clustering-Guided Sparse Structural Learning for Unsupervised Feature Selection,” Knowledge and Data Engineering, IEEE Transactions on Vol. 26, no. 9, pp. 2138–2150, 2014. Article (CrossRef Link) DOI
32	C.-C. Chiang, “Interactive tool for image annotation using a semi-supervised and hierarchical approach,” Comput. Stand. Interfaces, vol. 35, no. 1, pp. 50–58, Jan. 2013. Article (CrossRef Link) DOI
33	B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman, “LabelMe: A Database and Web-Based Tool for Image Annotation,” Int. J. Comput. Vis., Vol. 77, no. 1–3, pp. 157–173, Oct. 2007. Article (CrossRef Link) DOI
34	A. C. Berg, "SVM-KNN : Discriminative Nearest Neighbor Classification for Visual Category," in Proc. of Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on. Vol. 2, pp. 2126-2136, 2006. Article (CrossRef Link)
35	A. M. Tousch, S. Herbin, and J. Y. Audibert, “Semantic hierarchies for image annotation: A survey,” Pattern Recognit., vol. 45, no. 1, pp. 333–345, Jan. 2012. Article (CrossRef Link) DOI
36	O. Boiman, E. Shechtman, and M. Irani, "In Defense of Nearest-Neighbor Based Image Classi fi cation," in Proc. of Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pp. 1-8. IEEE, 2008 Article (CrossRef Link)
37	A. Fakhari and A. M. E. Moghadam, "Combination of classification and regression in decision tree for multi-labeling image annotation and retrieval," Appl. Soft Comput., vol. 13, no. 2, pp. 1292-1302, Feb. 2013. Article (CrossRef Link) DOI
38	C.-H. Lee, H.-C. Yang, and S.-H. Wang, “An image annotation approach using location references to enhance geographic knowledge discovery,” Expert Syst. Appl., vol. 38, no. 11, pp. 13792–13802, May 2011. Article (CrossRef Link)
39	S. Lazebnik, C. Schmid, and J. Ponce, "Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories," in Proc. of 2006 IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit. - Vol. 2, vol. 2, pp. 2169-2178, 2006. Article (CrossRef Link)
40	P. Jain, B. Kulis, and K. Grauman, "Fast Image Search for Learned Metrics," in Proc. of Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pp. 1-8. IEEE, 2008. Article (CrossRef Link)
41	and A. W. S. Jan C. Gemert, Jan-Mark Geusebroek, Cor J. Veenman, "Kernel Codebooks for Scene Categorization," in Proc. of the 10th European Conference on Computer Vision: Part III (ECCV '08), 2008, p. 7_52. Article (CrossRef Link)
42	and T. H. J. Yang, K. Yu, Y. Gong, "Linear spatial pyramid matching using sparse coding for image classification," in CVPR'09, 2009. Article (CrossRef Link)
43	H. Bilen, V. P. Namboodiri, and L. J. Van Gool, “Object and Action Classification with Latent Window Parameters,” Int. J. Comput. Vis., vol. 106, no. 3, pp. 237–251, Aug. 2013. Article (CrossRef Link) DOI
44	M. S. Biagio, L. Bazzani, M. Cristani, and V. Murino, "Weighted bag of visual words for object recognition," in Proc. of Image Processing (ICIP), 2014 IEEE International Conference on, pp. 2734-2738, 2014.Article (CrossRef Link)
45	C. Engineering, "MULTI-STAGE OBJECT CLASSIFICATION FEATURING CONFIDENCE ANALYSIS OF CLASSIFIER AND INCLINED LOCAL NAIVE BAYES NEAREST NEIGHBOR," in Proc. of Image Processing (ICIP), 2014 IEEE International Conference on, pp. 5177-5181, 2014. Article (CrossRef Link)

3	(2017) KSII Transactions on internet and information systems : TIIS Ear Recognition by Major Axis and Complex Vector Manipulation / 11 (3) , 1650
9	(2016) KSII Transactions on internet and information systems : TIIS RLDB: Robust Local Difference Binary Descriptor with Integrated Learning-based Optimization / 12 (9) , 4429