Browse > Article
http://dx.doi.org/10.5392/IJoC.2015.11.2.001

Document Layout Analysis Based on Fuzzy Energy Matrix  

Oh, KangHan (School of Electronics and Computer Engineering Chonnam National University)
Kim, SooHyung (School of Electronics and Computer Engineering Chonnam National University)
Publication Information
Abstract
In this paper, we describe a novel method for document layout analysis that is based on a Fuzzy Energy Matrix (FEM). A FEM is a two-dimensional matrix that contains the likelihood of text and non-text and is generated through the use of Fuzzy theory. The key idea is to define an Energy map for the document to categorize text and non-text. The proposed mechanism is designed for execution with a low-resolution document image, and hence our method has a fast processing speed. The proposed method has been tested on public ICDAR 2009 datasets to conduct a comparison against other state-of-the-art methods, and it was also tested with Korean documents. The results of the experiment indicate that this scheme achieves superior segmentation accuracy, in terms of both precision and recall, and also requires less time for computation than other state-of-the-art document image analysis methods.
Keywords
Fuzzy Energy Matrix; Document Layout Segmentation; Fuzzy Set;
Citations & Related Records
연도 인용수 순위
  • Reference
1 K. J. Anil and Y. Bin, “Document representation and its application to pagedecomposition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, 1998, pp. 294-308.   DOI
2 A. Antonacopoulos, S. Pletschacher, D. Bridson, and C. Papadopoulos, “ICDAR2009 Page Segmentation Competition”, Proc. ICDAR, 2009, pp. 1370-1374.
3 Ray Smith, “Hybrid page Layout Analysis via Tab-Stop Detection”, Proc. ICDAR, Barcelona, Spain, 2009, pp. 241-245.
4 J. Sauvola and M. Pietikainen, “Adaptive document image binarization,” J. The Journal of the Pattern Recognition, 2000, pp. 225-236   DOI
5 A. Fletcher and R. Kasturi, “A robust algorithm for text string separation from mixed text/graphics images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 10, 1998, pp. 294-308.
6 K. J. Anil and Y. Bin, “Document representation and its application to page decomposition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, 1998, pp. 294-308.   DOI
7 D. L. O’Gorman, “The document spectrum for page layout analysis,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 15, 1993, pp. 1162-1173.   DOI
8 C. Laura, C. Ciro, and G. Przemyslaw, “Document page segmentation using neuro-fuzzy approach,” Applied Soft Computing, vol. 8, 2008, pp. 118-126.   DOI
9 K. Kise, A. Sato, and M. Iwata, “Segmentation of page images using the area Voronoi diagram,” In Computer Vision Image Understanding, vol. 70, no. 3, 1998, pp. 370-382.   DOI
10 P. Bhupendra Kumar and S. Sanjay, “Integrated Fuzzy-HMM for project uncertainties in time-cost tradeoff problem,” J. Applied Soft Computing, vol. 21, 2014, pp. 320-329.   DOI
11 S. M. Chen, “Forecasting enrollments based on fuzzy time series,” Fuzzy Sets and Systems, vol. 81, 1996, pp. 311-319.   DOI
12 N. Otsu, “A Threshold Selection method from Gray Level Histogram,” IEEE Transactions on Systems, Man and Cybernetics, vol. 9, no. 1, 1975, pp. 62-66.   DOI
13 D. H. S. Baird, “Backgroud structure in document images,” Int. J. Pattern Recognition Artif. Intell., vol. 8, 1994, pp. 1013-1030.   DOI