Browse > Article
http://dx.doi.org/10.9717/kmms.2014.17.3.277

Language Identification by Fusion of Gabor, MDLC, and Co-Occurrence Features  

Jang, Ick-Hoon (경운대학교 항공전자공학과)
Kim, Ji-Hong (동의대학교 영상정보공학과)
Publication Information
Abstract
In this paper, we propose a texture feature-based language identification by fusion of Gabor, MDLC (multi-lag directional local correlation), and co-occurrence features. In the proposed method, for a test image, Gabor magnitude images are first obtained by Gabor transform followed by magnitude operator. Moments for the Gabor magniude images are then computed and vectorized. MDLC images are then obtained by MDLC operator and their moments are computed and vectorized. GLCM (gray-level co-occurrence matrix) is next calculated from the test image and co-occurrence features are computed using the GLCM, and the features are also vectorized. The three vectors of the Gabor, MDLC, and co-occurrence features are fused into a feature vector. In classification, the WPCA (whitened principal component analysis) classifier, which is usually adopted in the face identification, searches the training feature vector most similar to the test feature vector. We evaluate the performance of our method by examining averaged identification rates for a test document image DB obtained by scanning of documents with 15 languages. Experimental results show that the proposed method yields excellent language identification with rather low feature dimension for the test DB.
Keywords
Language Identification; Texture Feature; Gabor Transform; MDLC;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 G.S. Pearke and T.N. Tan, "Script and Language Identification from Document Images," Proc. the IEEE Workshop Document Image Anal., pp. 10-17, 1997.
2 T.N. Tan, "Rotation Invariant Texture Features and Their use in Automatic Script Identification," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 20, No. 7, pp. 743-756, 1998.
3 W. Chan and G. Coghill, "Text Analysis using Local Energy," Pattern Recognit., Vol. 34, No. 12, pp. 2523-2532, 2001.   DOI   ScienceOn
4 A. Busch, W.W. Boles, and S. Sridharan, "Texture for Script Identification," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 27, No. 11, pp. 1720-1732, 2005.   DOI   ScienceOn
5 P.S. Hiremath and S. Shivashankar, "Wavelet Based Co-occurrence Histogram Features for Texture with an Application to Script Identification in a Document Image," Pattern Recognit. Lett., Vol. 29, No. 9, pp. 1182-1189, 2008.   DOI   ScienceOn
6 장익훈 외, "Gabor 특징과 웨이브렛 영역의 BDIP와 BVLC 특징을 이용한 질감 특징 기반 언어 인식," 전자공학회논문지, 제48권, SP편, 제4호, pp. 72-82, 2011.   과학기술학회마을
7 I.H. Jang, N.C. Kim, and M.H. Park, "Texture- feature Based Language Identification using Gabor and MDLC Features," Proc. the IEEE Int. Conf. Multimedia Expo, 2011.
8 B.S. Manjunath and W.Y. Ma, "Texture Features for Browsing and Retrieval of Image Data," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 18, No. 8, pp. 837-842, 1996.   DOI   ScienceOn
9 C. Liu, "The Bayes Decision Rule Induced Similarity Measures," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 29, No. 6, pp. 1086- 1090, 2007.   DOI   ScienceOn
10 김원희 외, "Gabor 웨이블릿을 이용한 회전 변환에 무관한 질감 분류 기법," 한국멀티미디어학회논문지, 제10권, 제9호, pp. 1125-1134, 2007.
11 R.M. Haralick, K. Shanmugam, and I. Dinstein, "Textural Features for Image Classification," IEEE Trans. Syst., Man, Cybern., Vol. SMC- 3, No. 6, pp. 610-621, 1973.   DOI
12 G.V. Wouver, P. Scheunders, and D.V. Dyck, "Statistical Texture Characterization from Discrete Wavelet Representation," IEEE Trans. Image Process., Vol. 8, No. 4, pp. 592- 598, 1999.   DOI   ScienceOn
13 A.L. Spitz, "Determination of the Script and Language Content of Document Images," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 19, No. 3, pp. 235-245, 1997.   DOI   ScienceOn
14 D. Ghosh, T. Dube, and A.P. Shivaprasad, "Script Recognition-a Review," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 32, No. 12, pp. 2142-2161, 2010.   DOI   ScienceOn
15 J. Hochberg, L. Kerns, P. Kelly, and T. Thomas, "Automatic Script Identification from Document Images using Cluster-based Templates," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 19, No. 2, pp. 176-181, 1997.   DOI   ScienceOn
16 Q.A. Holmes, D.R. Neusch, and R.A. Shuchman, "Textural Analysis and Real-time Classification of Sea-ice Types using Digital SAR Data," IEEE Trans. Geosci. Remote Sensing, Vol. GE-22, No. 2, pp. 113-120, 1984.   DOI   ScienceOn
17 L. Shijian and C.L. Tan, "Script and Language Identification in Noisy and Degraded Document Images," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 30, No. 1, pp. 14-24, 2008.   DOI   ScienceOn