[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5392/IJoC.2012.8.4.095

Title Extraction from Book Cover Images Using Histogram of Oriented Gradients and Color Information

Do, Yen (School of Electronics & Computer Engineering, Chonnam National University)
Kim, Soo Hyung (School of Electronics & Computer Engineering, Chonnam National University)
Na, In Seop (School of Electronics & Computer Engineering, Chonnam National University)

Publication Information

International Journal of Contents / v.8, no.4, 2012 , pp. 95-102 More about this Journal

Abstract

In this paper, we present a technique to extract the title areas from book cover images. A typical book cover image may contain text, pictures, diagrams as well as complex and irregular background. In addition, the high variability of character features such as thickness, font, position, background and tilt of the text also makes the text extraction task more complicated. Therefore, we propose a two steps efficient method that uses Histogram of Oriented Gradients and color information to find the title areas. Firstly, text localization is carried out to find the title candidates. Finally, refinement process is performed to find the sufficient components of title areas. To obtain the best result, we also use other constraints about the size, ratio between the length and width of the title. We achieve encouraging results of extracted title regions from book cover images which prove the advantages and efficiency of the proposed method.

Keywords

Library Automation; Text Extraction; Histogram of Orientated Gradient; Localization; Connected Component; Color Clustering;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	Yassin M. Y. Hasan and Lina J. Karam. (2000),"Morphological Text Extraction from Images", IEEE Transaction on Image Processing, Vol. 9, No. 11, pp.1978-1983. DOI ScienceOn
2	M. Chen and X. Ding(2000),"Analysis, understanding and representation of Chinese newspaper with complex layout," in Proc. 7th IEEE Int. Conf. Image Processing, Vancouver, 2000, BC, Canada, vol. 2, pp. 590-593.
3	A. Antonacopoulos, B. Gatos, and D. Bridson (2005),"ICDAR 2005 page segmentation competition," in Proc. ICDAR,2005, Seoul, Korea, pp.75-80.
4	A. K. Jain and B. Yu (1998), "Document representation and its application to page decomposition", IEEE Trans. Pattern Anal. Mach. Intell., Vol. 20, No. 3, pp. 294-308. DOI ScienceOn
5	Navneet Dalal, Bill Triggs, "Histograms of Oriented Gradients for Human Detection". Computer Vision and Pattern Recognition, 886-893, 2005.
6	B. Epshtein, E. Ofek, and Y. Wexler, "Detecting Text in Natural Scenes with Stroke Width Transform", in Proc. CVPR, 2010, pp.2963-2970.
7	Nguyen Noi Bai, Kim Nam and Youngjun Song "Extracting curved text lines using the chain composition and the expanded grouping method", in Korea information processing society, vol. 14-B, No.6, pp.453- 460, Oct. 2007. 과학기술학회마을 DOI ScienceOn
8	Kee chul Jung, K. I. Kim, Anil K. Jain.(2004),"Text Information Extraction in Images and Video: A Survey", Pattern Recognition, Vol. 37 No.5, pp. 977-997. DOI ScienceOn
9	Do Yen, S. H. Kim, S. C. Park, Ha Le, Y. J. Chen, S. H. Jeong, I. S. Na (2012), "Generation of Training Database Using a Noise Model for OCR Systems", in The 6th International Conference on Ubiquitous Information Management and Communication, Kuala Lumpur, 2012.(CD-Pub.)
10	Ha Le, S. H. Kim, S. C. Park, Do Yen, Y.J. Chen, S. H. Jeong, I. S. Na (2012), " Automatic Generation of Database to Train New Fonts for OCR Systems" in The 2012 International Workshop on Advanced Image Technology IWAIT2012, Ho Chi Minh City, 2012. pp. 178-182.
11	C. J. Park, K. A. Moon, O. W. Geun, and H. M. Choi(2000), "An efficient extraction of character string positions using morphological operator", in Proc. IEEE Int. Conf. Systems, Man, Cybernetics, 2000, vol. 3, pp. 1616-1620.
12	Z. Yu, K. Karu, and A. K. Jain (1995),"Locating text in complex color images," in Proc. 3rd Int. Conf. Document Analysis and Recognition,1995, pp. 146-149.
13	D. Chen, H. Bourlard, and J. P. Thiran (2001), "Text identification in complex background using SVM," in Proc. IEEE Computer Soc.Conf. Computer Vision and Pattern Recognition, 2001, pp. 621-626.