Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2005.12B.7.737

Three-Level Color Clustering Algorithm for Binarizing Scene Text Images  

Kim Ji-Soo (전남대학교 전산학과)
Kim Soo-Hyung (전남대학교 전산컴퓨터정보통신공학부)
Abstract
In this paper, we propose a three-level color clustering algerian for the binarization of text regions extracted from natural scene images. The proposed algorithm consists of three phases of color segmentation. First, the ordinary images in which the texts are well separated from the background, are binarized. Then, in the second phase, the input image is passed through a high pass filter to deal with those affected by natural or artificial light. Finally, the image Is passed through a low pass filter to deal with the texture in texts and/or background. We have shown that the proposed algorithm is more effective used gray-information binarization algorithm. To evaluate the effectiveness of the proposed algorithm we use a commercial OCR software ARMI 6.0 to observe the recognition accuracies on the binarized images. The experimental results on word and character recognition show that the proposed approach is more accurate than conventional methods by over $35\%$.
Keywords
Scene Text Recognition; Adaptive Binarization; Color Clustering;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 P. Clark and M Mirmehdi, 'Recognizing Text in Real Scene,' International Journal of Document Analysis and Recognition, Vol.4, pp.243-257, 2002   DOI
2 H.R. Byun, MC. Roh, K.C. Kim, Y.W. Choi and S.W. Lee, 'Scene Text Extraction in Complex Images,' Proc. 5th International Workshop on Document Analysis Systems, pp.307-318, 2002
3 D. Chen, H. Bourlard and H.P. Thiran, 'Text Identification in Complex Background Using SVM,' Proc. IEEE Computer Society Conference on CVPR, Vol.2, pp.621-626, 2001   DOI
4 B.T. Chun, Y. Bae and T.Y. Kim, 'Automatic Text Extraction in Digital Videos using FFT and Neural Network,' Proc. IEEE Fuzzy Systems Conference, pp.1112-1115, Seoul, Korea, 1999   DOI
5 D.H. Ballard and C.M. Brown, Computer Vision, Prentice-Hall, 1982
6 강나영, '시공간 데이터를 위한 클러스터링 기법의 성능 비교', 학위논문(석사), 이화여자대학교 과학기술대학원 : 컴퓨터학과, 2003. 8
7 김지수, 김수형, 최영우, '명도 정보와 Split/Merge 분할을 이용한 자연 영상에서의 텍스트 영역 추출', 한국정보과학회 논문지 : 소프트웨어 및 응용, Vol.32, No.6, pp.502-511, 2005   과학기술학회마을
8 A. E. Savakis, 'Adaptive Document Image Thresholding sing Foreground and Background Clustering,' Int. Conf. Image Proc. ICIP'98, Chicago, October, 1998
9 N. Otsu, 'A Threshold Selection Method from Gray-level Histograms,' IEEE Trans. on System Man and Cybernetics, 9(1), pp.62-66, 1979   DOI   ScienceOn
10 김길천, 최영우, 변혜란, '장면(Scene) 텍스트 추출 및 기울기/원근 추정', 제14회 영상처리 및 이해에 관한 워크샵 발표 논문집, pp.277-282, 제주도, 2002
11 김의정, 정원일, '칼라 문서에서 문자 영역 추출을 위한 클러스터링 기법', 대전산업대학교 논문집, 제14권, pp.104-116, 1997
12 김형균, 최원호, '자연 영상에서의 문자 패턴 추출', 울산대학교 공학연구논문집, 제26권 제2호, pp.35-54, 1995
13 노명철, 최영우, 이성환, '색상 및 명도 정보를 이용한 장면 텍스트 추출', 제14회 영상처리 및 이해에 관한 워크샵 발표 논문집, pp.515-520, 제주도, 2002
14 C. Wolf and J.M. Jolion, 'Extraction and Recognition of Artificial Text in Multimedia Documents,' Pattern Analysis and Applications, Vol.6, No.4, pp.306-326, 2003   DOI
15 김지수, 김수형, '명도 정보를 이용한 자연 영상에서의 텍스트 영역 추출', 한국정보처리학회 호남.제주지부 학술발표논문집, 제3권 제1호, pp. 127-132, 2003
16 J. Zhang, X. Chen, A. Hanneman, J. Yang and A. Waibel, 'A Robust Approach for Recognition of Text Embedded in Natural Scenes,' Proc. 16th International Conference on Pattern Recognition, Vol.3, pp.204-207, 2002   DOI
17 X. Wang, X. Ding and C. Liu, 'Character Extraction and Recognition in Natural Scene Images,' Proc. Sixth International Conference on Document Analysis and Recognition, pp.1084-1088, 2001
18 V. Wu, R. Manmatha and EM. Riseman, 'An Automatic System to Detect and Recognize Text in Images,' IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.21, No.11, pp.1224-1229, 1999   DOI   ScienceOn
19 J. Gao and J. Yang, 'An Adaptive Algorithm for Text Detection from Natural Scenes,' Proc. IEEE Computer Society Conference on CVPR, Vol.2, pp.84-89, 2001
20 O. Hori, 'A Video Text Extraction Method for Character Recognition,' Proc. Fifth International Conference on Document Analysis and Recognition, pp.25-28, 1999   DOI
21 J. Hoya, A. Shio and S. Akamatsu, 'Recognizing Characters in Scene Images,' IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 16, No. 2, pp. 67-82, 1995   DOI   ScienceOn
22 P. Clark and M. Mirmehdi, 'Combining Statistical Measures to Find Image Text Regions,' Proc. 15th International Conference on Pattern Recognition, Vol.1, pp.450-453, 2000   DOI