Browse > Article
http://dx.doi.org/10.5392/IJoC.2014.10.3.035

Text Line Segmentation using AHTC and Watershed Algorithm for Handwritten Document Images  

Oh, KangHan (School of Electronics and Computer Engineering Chonnam National University)
Kim, SooHyung (School of Electronics and Computer Engineering Chonnam National University)
Na, InSeop (School of Electronics and Computer Engineering Chonnam National University)
Kim, GwangBok (School of Electronics and Computer Engineering Chonnam National University)
Publication Information
Abstract
Text line segmentation is a critical task in handwritten document recognition. In this paper, we propose a novel text-line-segmentation method using baseline estimation and watershed. The baseline-detection algorithm estimates the baseline using Adaptive Head-Tail Connection (AHTC) on the document. Then, the watershed method segments the line region using the baseline-detection result. Finally, the text lines are separated by watershed result and a post-processing algorithm defines the lines more correctly. The scheme successfully segments text lines with 97% accuracy from the handwritten document images in the ICDAR database.
Keywords
Watershed; Text Line Segmentation; Handwritten Document;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Manisha Bhagwat, R. K. Krishna, and Vivek Pise, "Watershed Trans-formation," International Journal of Computer Science and Communication, vol. 1, no. 1, 2010, pp. 175-177.
2 P. Soille, Morphological Image Analysis, 2nd ed., New York: Springer, 2002.
3 L. O'Gorman, "The document spectrum for page layout analysis," IEEE Trans. Pattern Anal. Mach. Intell.,vol. 15, no. 11,1993, pp. 1162-1173.   DOI   ScienceOn
4 B. Gatos, "ICFHR 2010 Handwriting Segmentation Contest," in ICFHR, 2010, pp. 737-742.
5 Z. Shi, S. Setlur, and V. Govindaraju, "A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines," Proc. 10th International Conference on Document Analysis and Recognition (ICDAR'09), 2009, pp. 176-180.
6 A. Nicolaou and B. Gatos, "Handwritten Text Line Segmentation by Shredding Text into its Lines," Proc. 10th International Conference on Document Analysis and Recognition (ICDAR'09), Barcelona, 2009, pp. 626-630.
7 A. Lemaitre, J. Camillerapp, and B. Couasnon, "Interest of perceptive vision for document structure analysis," Proc. Human Vision and Electronic Imaging XV, 2010.
8 MATLAB Notes, http://www.mathworks.de/company/news_notes/win02/watershed.html
9 L. Likforman-Sulem, A. Zahour, and B. Taconet, "Text line segmentation of historical documents: a survey," Int. J. Doc. Anal. Recognit, vol. 9, no. 2, 2007, pp. 123-138.   DOI   ScienceOn
10 M. Bulacu, R. Koert, L. Schomaker, and T. Zant, "Layout an alysis of handwritten historical documents for searching the archive of the Cabinet of the Dutch Queen," in: Proceeding ICDAR, 2007, pp. 357-361.
11 L. Gorman, "The document spectrum for page lay-out analysis," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 15, no. 11, 1993, pp. 1162-1173.   DOI   ScienceOn
12 Y. G. Ciardiello, G. Scafuro, M. T. Degrandi, M. R. Spada, and M. P. Roccotelli, "An experimental system for office document handling and text recognition," Proc 9th Int. Conf. on Pattern Recognition, 1988, pp. 739-743.
13 A. Zahour, B. Taconet, P. Mercy, and S. Ramdane, "Arabic hand-written text-line extraction," Proc. ICDAR, 2001, pp. 281-285.
14 L. O'Gorman, "The document spectrum for page layout analysis," IEEE Trans. Pattern Anal. Mach. Intel., vol. 15, no. 11, 1993, pp. 1162-1173.   DOI   ScienceOn
15 M. Feldbach and K. D. Tonnies, "Line detection and segmentation in historical church registers," Proc. ICDAR, 2001, pp. 743-747.
16 L. Likforman-Sulem and C. Faure, "Extracting text lines in handwritten documents by perceptual grouping," Proc. Advances in handwriting and drawing: a multidisciplinary approach, 1994, pp. 117-135.
17 L. O'Gorman, "The document spectrum for page layout analysis," IEEE Trans. Pattern Anal. Mach. Intell., vol. 15, no. 11,1993, pp. 1162-1173.   DOI   ScienceOn