Browse > Article

Document Image Segmentation and Classification using Texture Features and Structural Information  

Park, Kun-Hye (영남대학교 컴퓨터공학과)
Kim, Bo-Ram (영남대학교 컴퓨터공학과)
Kim, Wook-Hyun (영남대학교 컴퓨터공학과)
Publication Information
Journal of the Institute of Convergence Signal Processing / v.11, no.3, 2010 , pp. 215-220 More about this Journal
Abstract
In this paper, we propose a new texture-based page segmentation and classification method in which table region, background region, image region and text region in a given document image are automatically identified. The proposed method for document images consists of two stages, document segmentation and contents classification. In the first stage, we segment the document image, and then, we classify contents of document in the second stage. The proposed classification method is based on a texture analysis. Each contents in the document are considered as regions with different textures. Thus the problem of classification contents of document can be posed as a texture segmentation and analysis problem. Two-dimensional Gabor filters are used to extract texture features for each of these regions. Our method does not assume any a priori knowledge about content or language of the document. As we can see experiment results, our method gives good performance in document segmentation and contents classification. The proposed system is expected to apply such as multimedia data searching, real-time image processing.
Keywords
Page Segmentation; Contents Classification; Gabor filter; Document Image Processing; Textrure Analysis;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 N. Otsu, "A threshold selection method from gray level histograms", IEEE Trans. on Syst. Man Cybern. VoI.9, No.1, pp.62-66, 1979.
2 R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addison Wesley, New York, 1992.
3 Anil K. .Jain , Farshid Farrokhnia, "Unsupervised texture segmentation using Gabor filters", Pattern Recognition, Vol.24 No.12, pp.1167-1186, Dec. 1991.   DOI   ScienceOn
4 김보람, 오준택, 김욱현, "지역적 엔트로피와 텍스처의 주성분 분석을 이용한 문서양상의 분할 및 구성요소 분류", 정보처리학회, 제16-B권, 제5호, pp. 377-384, 2009.   과학기술학회마을   DOI   ScienceOn
5 M-W Lin, J-R Tapamo, B Ndovie, "A texture-based method for document segmentation and classification," ARIMA/SACJ, Vol.36, pp.49-56, 2006.
6 R. M. Haralick, "Statistical and structural approaches to texture", Proceeding IEEE, 67(5), pp.786-804, 1990
7 F. M. Wahi K. Y. Wong, and R. G. Casey, "Block segmentation and text extraction in mixed text/image documents," Computer Graphics and Image Processing, vol. 22, pp.375-390, Feb. 1982.
8 J. L. Fisher, S. C. Hinds and D. P. D'Amato, "A rule-based system for document image segmentation," Proc. 10th Int. conf. Pattern Recognition, pp.567-572, 1990.
9 서정, 김보람, 오준택, 김욱현, "텍스쳐 기반 BP 신경망을 이용한 위성영상의 도로영역 추출 ", 한국신호처리시스템학회논문지, v.10, no.3, pp.164-169, 2009년 7월.   과학기술학회마을
10 K. Y. Wong, R. G. Casey and F. M. Wahl, "Document analysis system", IBM J.Res. Development, Vol. 6, pp.642-656. Nov. 1982.