[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5392/IJoC.2014.10.4.001

Automatic Segmentation of Product Bottle Label Based on GrabCut Algorithm

Na, In Seop (school of Electronics and Computer Engineering Chonnam National University)
Chen, Yan Juan (School of Electronics and Computer Engineering Chonnam National University)
Kim, Soo Hyung (School of Electronics and Computer Engineering Chonnam National University)

Publication Information

International Journal of Contents / v.10, no.4, 2014 , pp. 1-10 More about this Journal

Abstract

In this paper, we propose a method to build an accurate initial trimap for the GrabCut algorithm without the need for human interaction. First, we identify a rough candidate for the label region of a bottle by applying a saliency map to find a salient area from the image. Then, the Hough Transformation method is used to detect the left and right borders of the label region, and the k-means algorithm is used to localize the upper and lower borders of the label of the bottle. These four borders are used to build an initial trimap for the GrabCut method. Finally, GrabCut segments accurate regions for the label. The experimental results for 130 wine bottle images demonstrated that the saliency map extracted a rough label region with an accuracy of 97.69% while also removing the complex background. The Hough transform and projection method accurately drew the outline of the label from the saliency area, and then the outline was used to build an initial trimap for GrabCut. Finally, the GrabCut algorithm successfully segmented the bottle label with an average accuracy of 92.31%. Therefore, we believe that our method is suitable for product label recognition systems that automatically segment product labels. Although our method achieved encouraging results, it has some limitations in that unreliable results are produced under conditions with varying illumination and reflections. Therefore, we are in the process of developing preprocessing algorithms to improve the proposed method to take into account variations in illumination and reflections.

Keywords

GrabCut; Segmentation; Bottle label; Initial trimap; Human interaction;

Citations & Related Records

Reference

1	C. Li, C. Y. Kao, J. C. Gore, and Z. Ding, "Implicit Active Contours Driven by Local Binary Fitting Energy," IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1-7.
2	V. Caselles, F. Catte, T. Coll, and F. Dibis, "A geometric model for active contours in image processing," Nummerische Mathematik, no. 66, 1993, pp. 1-31.
3	M. Rousson and R. Seriche, "A Variational Framework for Active and Adapt active Segmentation of Vector Valued Images," Proceeding of IEEE Workshop on Motion and Video Computing, 2002.
4	J. B. Shi and J. D. Malik, "Normalized Cuts and Image Segmentation," IEEE Transactions on pattern analysis and machine intelligence, vol. 22, no. 8, 2000, pp. 888-905. DOI ScienceOn
5	Y. Boykov and Vladimir Kolmogorov, "An Experimental Comparison of Min-Cut/Flow Algorithms for Energy Minimization in Vision," IEEE Transactions on pattern analysis and machine intelligence, vol. 26, no. 9, 2004, pp. 1124-1137. DOI ScienceOn
6	Y. Boycov and M. Jolly, "Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images," In Proc. IEEE Int. Conf. on Computer Vision, In ICCV, 2001, pp. 105-112.
7	C. Rother, V. Kolmogorov, and A. Blake, "GrabCut: interactive for foreground using iterate graph cuts," SCM Transaction on Graphics(TOG), 2004.
8	Justin F. Talbot and Xiaoqian Xu, "Implementing GrabCut," Brigham Yong University Revised, Apr. 7, 2006.
9	Peng Wang, "GrabCut-Interactive Foreground Extraction," https://mywebspace.wisc.edu/pwang6/CS766.
10	B. C. Ko and J. Y. Nam, "Object-of-interest image segmentation based on human attention and semantic region," Optical Society of Society of America, vol. 23, Oct. 2006, pp. 2462-2470. DOI ScienceOn
11	L. Itti, C. Koch, and E. Niebur, "A Model of Saliency-Based Visual Attention for Rapid Scene Analysis," Institute of Electrical and Electronics Engineers Transactions on Pattern Analysis and Machine Intelligence, vol. 20, 1998, pp. 1254-1259.
12	L. Itti, "Automatic Foveation for Video Compression Using a Neurobiological Model of Visual Attention," Institute of Electrical and Electronics Engineers Transactions on Image Processing, vol. 13, no. 10, 2004, pp. 1304-1318.
13	L. Elazary and L. Itti, "Interesting objects are visually salient," Journal of Vision, vol. 8, no. 33, 2008, pp. 1-15.
14	L. Itti and C. Koch, "Computational modeling of visual attention," Nature Reviews Neuroscience, vol. 2, no. 3, 2001, pp. 194-203. DOI ScienceOn
15	S. W. Hong and L. Choi, "Automatic Flowers Recognition Using Segmentation," Korea Computer Congress, vol. 38, no. 1(A), 2011, pp. 463-465.
16	J. S. Lee, S. H. Kim, and J. H Park, G. S Lee, H. J Yang, C. W. Lee, "Recognition of Text in Wine Label Images," IEEE Pattern Recognition on Chinese Conference, 2009, pp. 1-5.
17	N. Otsu, "A Threshold Selection Method from Gray-Level Histograms," IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, 1979, pp. 62-66. DOI ScienceOn
18	D. Comaniciu and P. Meer, "Mean Shift: A Robust Approach Toward Feature Space Analysis," IEEE Transactions on PAMI, vol. 24, no. 5, 2002, pp. 1-17. DOI ScienceOn
19	Jong Hyun Park, Guee Sang Lee, and Soon Yong Park, "Color image segmentation using adaptive mean shift and statistical model-based method," Computers and mathematics with Applications, vol. 57, issue 6, Mar. 2009, pp. 970-980. DOI ScienceOn
20	T. F. Chan and L. A. Vese, "Active Contours Without Edge," IEEE Transactions on Image Processing, vol. 10, no. 2, 2001, pp. 266-277. DOI ScienceOn
21	A. Tasi, A. Yezzi, and A. Willsky, "Curve Evolution Implementation of the Mumford-Shah Functional for Image Segmentation, Depositing, Interpolation and Margination," Institute of Electrical and Electronics Engineers Transactions on Image Processing, vol. 10, no. 8, 2001, pp. 1169-1186.
22	D. Cremers, "A multiphase level set framework for variational motion segmentation," In Scale Space Methods in Computer Vision, vol. 2695, 2003, pp. 599-614.
23	C. Li, C. Xu, and M. D. Fox, "Level set evolution without reinitialization: A new variational formulation," Institute of Electrical and Electronics Engineers Conference on Computer Vision and Pattern Recognition, vol. 1, 2005, pp. 430-436.
24	J. Harel, C. Koch, and P. Perona, Graph-based visual saliency, in Advances in Neural Information Processing System 19, Cambridge, MA: MIT Press, 2007, pp. 545-552.
25	D. Ballard, "Generalizing the Hough Transform to Detection Arbitrary Shape," Pattern Recognition, vol. 13, no. 2, 1981, pp. 111-122. DOI ScienceOn
26	R. Duda and P. Hart, "Use of the Hough Transformation to Detect Line and Curves in picturers," Communication of the ACM, vol. 15, no. 1, Jan. 1972, pp. 11-15. DOI
27	R. C. Gonzalez, E. W. and S. L. Eddins, Digital Image Processing using MATLAB, Publishing House of Electronics Industry, 2002.
28	J. B. MacQueen, Some Methods for Classification and Analysis of Multiplicate Observations, Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, University of California Press, 1967, pp. 281-297.