Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2002.9B.3.311

Document Image Layout Analysis Using Image Filters and Constrained Conditions  

Jang, Dae-Geun (Telecommunication Network Research Lab, Dept.of Electronics Electrical Computer,Graduate School of Kyungpook National University)
Hwang, Chan-Sik (Telecommunication Network Research Lab, Dept.of Electronics Electrical Computer, Kyungpook National University)
Abstract
Document image layout analysis contains the process to segment document image into detailed regions and the process to classify the segmented regions into text, picture, table or etc. In the region classification process, the size of a region, the density of black pixels, and the complexity of pixel distribution are the bases of region classification. But in case of picture, the ranges of these bases are so wide that it's difficult to decide the classification threshold between picture and others. As a result, the picture has a higher region classification error than others. In this paper, we propose document image layout analysis method which has a better performance for the picture and text region classification than that of previous methods including commercial softwares. In the picture and text region classification, median filter is used in order to reduce the influence of the size of a region, the density of black pixels, and the complexity of pixel distribution. Futhermore the classification error is corrected by the use of region expanding filter and constrained conditions.
Keywords
document image; region analysis; page segmentation; region classification;
Citations & Related Records
연도 인용수 순위
  • Reference
1 X. Li, W. Gao, S.Y. Chi, K.A. Moon and H.J. Kim, 'An Efficient Method for Page Segmentation,' Proc. ICICS, Vol.2, pp.957-961, 1997   DOI
2 K. Kise, M. Iwata and K. Matsumoto, 'A Computational Geometric Approach to Text-line Extraction from Binary Document Images,' Proc 3th Int. Work Doument Analysis System, pp.346-355, 1998
3 H. Fujisawa and Y. Nakano, 'A Top-Down Approach for the Analysis of Document Images,' Proc. Work. Syntatic and Structural Pattern Recognition, Murray Hill, USA, pp.113-122, 1990
4 Y.Y. Tang, C.Y. Suen, C.D. Yan and M. Cheriet, 'Document Analysis and understang : A Brief Survey,' Proc. 1st Int. Conf. Document Analysis and Recognition, Saint-Malo, France, pp.17-31, 1991
5 J. Kong and Z. Chi, 'Image Classification Using Kolmogorov Complexity Measure with Extracted Blocks,' IEICE Trans. Inf. & Syst., Vol.1, E81-D, pp.1239-1246, 1998
6 Mario I. Chacon Murguia, 'Document Segmentation Using Texture Variance and Low Resolution Images,' IEEE Southwest. Symp. Image Analysis and Interpretation, pp.164-167, 1998   DOI
7 D. Drivas and A. Amin, 'Page Segmentation and Classification Utilizing Bottom-up Approach,' Proc. ICDAR, pp.610-614, 1995
8 S.K. Yip and Z. Chi, 'Page Segmentation and Content Classification for Automatic Document Image Processing,' Proc. Int. Symp. Intelligent Multimedia, Video and Speech Processing, pp.279-282, 2001   DOI