Browse > Article
http://dx.doi.org/10.4218/etrij.10.1510.0086

A Novel Character Segmentation Method for Text Images Captured by Cameras  

Lue, Hsin-Te (Institute of Computer Science and Information Engineering, National Central University)
Wen, Ming-Gang (Department of Information Management, National United University)
Cheng, Hsu-Yung (Institute of Computer Science and Information Engineering, National Central University)
Fan, Kuo-Chin (Institute of Computer Science and Information Engineering, National Central University)
Lin, Chih-Wei (Institute of Computer Science and Information Engineering, National Central University)
Yu, Chih-Chang (Department of Computer Science and Information Engineering, Vanung University)
Publication Information
ETRI Journal / v.32, no.5, 2010 , pp. 729-739 More about this Journal
Abstract
Due to the rapid development of mobile devices equipped with cameras, instant translation of any text seen in any context is possible. Mobile devices can serve as a translation tool by recognizing the texts presented in the captured scenes. Images captured by cameras will embed more external or unwanted effects which need not to be considered in traditional optical character recognition (OCR). In this paper, we segment a text image captured by mobile devices into individual single characters to facilitate OCR kernel processing. Before proceeding with character segmentation, text detection and text line construction need to be performed in advance. A novel character segmentation method which integrates touched character filters is employed on text images captured by cameras. In addition, periphery features are extracted from the segmented images of touched characters and fed as inputs to support vector machines to calculate the confident values. In our experiment, the accuracy rate of the proposed character segmentation system is 94.90%, which demonstrates the effectiveness of the proposed method.
Keywords
Webcam-based OCR; character segmentation; typographical structure; periphery features; dynamic programming;
Citations & Related Records

Times Cited By Web Of Science : 2  (Related Records In Web of Science)
Times Cited By SCOPUS : 1
연도 인용수 순위
1 R. Lienhart and A. Wernicke, "Localizing and Segmenting Text in Images and Videos," IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 4, 2002, pp. 256-268.   DOI   ScienceOn
2 F. Chang, et al., "A Prototype Classification Method and its Application to Handwritten Character Recognition," IEEE Int. Conf. Syst., Man Cybern., vol. 5, 2004, pp. 4738-4743.
3 K.C. Fan, L.S. Wang, and Y.K. Wang, "Page Segmentation and Identification for Intelligent Signal Processing," Signal Process., vol. 45, no. 3, 1995, pp. 329-346.   DOI   ScienceOn
4 M.C. Jung, Y.C. Shin, and S.N. Srihari, "Machine Printed Character Segmentation Method Using Side Profiles," IEEE Int. Conf. Syst., Man, Cybernetics, vol. 6, 1999, pp. 863-867.
5 S. Liang, M. Shridhar, and M. Ahmadi, "Efficient Algorithms for Segmentation and Recognition of Printed Characters in Document Processing," IEEE Pacific Rim Conf. Commun., Comput. Signal Process., vol. 1, 1993, pp. 240-243.
6 C.L. Liu, H. Sako, and H. Fujisawa, "Effects of Classifier Structures and Training Regimes on Integrated Segmentation and Recognition of Handwritten Numeral Strings," IEEE Trans. Pattern Anal. Mach. Intell., vol. 26, no. 11, 2004, pp.1395-1407.   DOI   ScienceOn
7 S. Marinai, M. Gori, and G. Soda, "Artificial Neural Networks for Document Analysis and Recognition," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 1, 2005, pp. 23-35.   DOI
8 N. Otsu, "A Threshold Selection Method from Gray Level Histograms," IEEE Trans. Syst., Man Cybern., vol. 9, no. 1, 1979, pp. 62-66.   DOI
9 K.I. Kim et al., "Support Vector Machines for Texture Classification," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 11, 2002, pp. 1542-1550.   DOI   ScienceOn
10 V. Vapnik, The Nature of Statistical Learning Theory, New York: Springer-Verlag, 1995.
11 L.S. Wang, Document Analysis for Lossless Reproduction, Dissertation, Institute of Computer Science Information Engineering, National Central University, 1997.
12 A. Zramdini and R. Ingold, "Optical Font Recognition Using Typographical Features," IEEE Trans. Pattern Anal. Mach. Intelligence, vol. 20, no. 8, 1998, pp. 877-882.   DOI   ScienceOn
13 H. Beker and f. Piper, Cipher System: The Protection of Communication, John Wiley & Sons, 1983.
14 V. Wu, R. Manmatha, and E.M. Riseman, "Textfinder: An Automatic System to Detect and Recognize Text in Images," IEEE Trans. Pattern Anal. Mach. Intell., vol. 21, no. 11, 1999, pp. 1224-1229.   DOI   ScienceOn
15 C. M. Thillou et al., "A Multifunctional Reading Assistant for the Visually Impaired," EURASIP J. Image Video Process., vol. 2007, no. 4, 2007. Available athttp://downloads.hindawi.com/ journals/ivp/2007/064295.pdf.
16 X. Chen et al., "Automatic Detection and Recognition of Signs from Natural Scenes," IEEE Trans. Image Process., vol. 13, no. 1, 2004, pp. 87-99.   DOI   ScienceOn
17 M.R. Lyu, J. Song, and M. Cai, "A Comprehensive Method for Multilingual Video Text Detection, Localization, and Extraction," IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 2, 2005, pp. 243-255.   DOI
18 W. Wu, X. Chen, and J. Yang, "Detection of Text on Road Signs from Video," IEEE Trans. Intell. Transport. Syst., vol. 6, no. 4, 2005, pp. 378-390.   DOI   ScienceOn
19 S. Hu and M. Chen, "Adaptive Frechet Kernel Based Support Vector Machine for Text Detection," IEEE Int. Conf. Acoustics, Speech, Signal Process., vol. 5, 2005, pp. 365-368.
20 K.C. Kim et al., "Scene Text Extraction in Natural Scene Images Using Hierarchical Feature Combining and Verification," Proc. 17th Int. Conf. Pattern Recognition, vol. 2, 2004, pp. 679-682.
21 K.L. Kim, K. Jung, and J.H. Kim, "Texture-Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm," IEEE Trans. Pattern Anal. Mach. Intell., vol. 25, no. 12, 2003, pp. 1631-1639.   DOI   ScienceOn
22 T. Yamguchi and M. Maruyama, "Character Extraction from Natural Scene Images by Hierarchical Classifiers," Proc. 17th Int. Conf. Pattern Recognition, vol. 2, 2004, pp. 687-690.
23 K.C. Fan and L.S. Wang, "Classification of Machine-Printed and Handwritten Texts Using Character Block Layout Variance," Pattern Recognition, vol. 31, no. 9, 1998, pp.1275-1284.   DOI   ScienceOn
24 J.L. Meunier, "Optimized XY-Cut for Determining a Page Reading Order," Proc. 8th Int. Conf. Document Anal. Recog., vol. 1, 2005, pp. 347-351.