Browse > Article
http://dx.doi.org/10.3745/JIPS.2014.10.1.055

Stroke Width-Based Contrast Feature for Document Image Binarization  

Van, Le Thi Khue (Dept. of Electronics and Computer Engineering, Chonnam National University)
Lee, Gueesang (Dept. of Electronics and Computer Engineering, Chonnam National University)
Publication Information
Journal of Information Processing Systems / v.10, no.1, 2014 , pp. 55-68 More about this Journal
Abstract
Automatic segmentation of foreground text from the background in degraded document images is very much essential for the smooth reading of the document content and recognition tasks by machine. In this paper, we present a novel approach to the binarization of degraded document images. The proposed method uses a new local contrast feature extracted based on the stroke width of text. First, a pre-processing method is carried out for noise removal. Text boundary detection is then performed on the image constructed from the contrast feature. Then local estimation follows to extract text from the background. Finally, a refinement procedure is applied to the binarized image as a post-processing step to improve the quality of the final results. Experiments and comparisons of extracting text from degraded handwriting and machine-printed document image against some well-known binarization algorithms demonstrate the effectiveness of the proposed method.
Keywords
Degraded Document Image; Binarization; Stroke Width; Contrast Feature; Text Boundary;
Citations & Related Records
연도 인용수 순위
  • Reference
1 H. G. Adelmann, "An edge-sensitive noise reduction algorithm for image processing," Computers in Biology and Medicine, vol. 29, 1999, pp.137-145.   DOI   ScienceOn
2 Y. Yang, and H. Yan, "An adaptive logical method for the binarization of degraded document images," Pattern Recognition, vol. 33, 2000, pp.787-807.   DOI   ScienceOn
3 M. Valizadeh, M. Komeili, N. Armanfard, and E. Kabir, "Degraded document image binarization based on combination of two complementary algorithms," International Conference in Advances in Computer Tools for Engineering Applications, 2009, pp.595-599.
4 R.J. Schilling, Fundamentals of Robotics Analysis and Control, Prentice-Hall, Englewood Cliffs, NJ, 1990.
5 N. Otsu, "A threshold selection method from grey level histogram," IEEE Transactions on System, Man, and Cybernetics, vol. 9, 1979, pp.62-66.   DOI   ScienceOn
6 J. Kitter and J. Illingworth, "On threshold selection using clustering criteria," IEEE Transactions on System, Man, and Cybernetics, vol. SMC-15, 1985, pp.652-655.   DOI   ScienceOn
7 J. Bernsen, "Dynamic thresholding of gray level images," Proceedings of the 8th International Conference on Pattern Recognition, vol. 2, 1986, pp. 1251-1255.
8 W. Niblack, An Introduction to Digital Image Processing, NJ, USA: Prentice Hall, Englewood Cliffs, 1986.
9 J. Sauvola, and M. Pietikainen, "Adaptive document image binarization," Pattern Recognition, vol. 33, 2000, pp. 225-236.   DOI   ScienceOn
10 I. K. Kim, D. W. Jung, and R. H. Park, "Document image binarization based on topographic analysis using a water flow model," Pattern Recognition, vol. 35, 2002, pp.265-277.   DOI   ScienceOn
11 B. Gatos, I. Pratikakis, and S. Perantonis, "Adaptive degraded document image binarization," Pattern Recognition, vol. 39, 2006, pp.317-327.   DOI   ScienceOn
12 O. Trier and T. Taxt, "Evaluation of binarization methods for document images," IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995, pp.312-315.
13 Y. Liu and S. Srihari, "Document image binarization based on texture features," IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997, pp.540-544.
14 Y. Chen and G. Leedham, "Decompose algorithm for thresholding degraded historical document images," IEEE Proceedings on Vision, Image, and Signal Processing, 2005, pp.702-714.