Browse > Article

Classification of Handwritten and Machine-printed Korean Address Image based on Connected Component Analysis  

장승익 (한국전자통신연구원 우정기술연구센터)
정선화 (한국전자통신연구원 우정기술연구센터)
임길택 (한국전자통신연구원 우정기술연구센터)
남윤석 (한국전자통신연구원 우정기술연구센터)
Abstract
In this paper, we propose an effective method for the distinction between machine-printed and handwritten Korean address images. It is important to know whether an input image is handwritten or machine-printed, because methods for handwritten image are quite different from those of machine-printed image in such applications as address reading, form processing, FAX routing, and so on. Our method consists of three blocks: valid connected components grouping, feature extraction, and classification. Features related to width and position of groups of valid connected components are used for the classification based on a neural network. The experiment done with live Korean address images has demonstrated the superiority of the proposed method. The correct classification rate for 3,147 testing images was about 98.85%.
Keywords
Optical character recognition (OCR); Address type identification; Address reading system; Multi-layer perceptrons;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Performance Comparison of Statistical and Neural Network Classifiers in Handwritten Digits Recognition /
[ S.Jeong;S.Kim;W.Cho ] / Proceedings of 6th International Workshop on Frontiers in Handwriting Recognition
2 A threshold selection method from gray-level histogram /
[ N.Otsu ] / IEEE Transactions on Systems, Man, and Cybernetics   ScienceOn
3 G. Dzuba, A. Filatov and A. Volgunin, 'Handwritten ZIP Code Recognition,' Proceedings of 4th International Conference on Document Analysis and Recognition, pp. 766-770, Ulm, Germany, 1997   DOI
4 N. Kata, K. Todumoto and Y. Nemoto, 'A Large Scale Japanese Handwritten Address Recognition System Using Rough and Fine Classification,' Proceedings of 5th International Conference on Signal Processing, Vol. 3, pp. 1423-1426, 2000   DOI
5 F. Kimura and M. Shridhar, 'Handwritten Address Interpretation Using Extended Lexicon Word Matching,' Proceedings of 5th International Workshop on Frontiers in Handwriting Recognition, pp. 369-372, Essex, England, 1996
6 K. Fan, L. Wang and Y. Tu, 'Classification of Machine-Printed and Hand-Written Texts Using Character Block Layout Variance,' Pattern Recognition, Vol. 31, No. 9, pp. 1275-1284, 1998   DOI   ScienceOn
7 U. Pal and B. Chaudhuri, 'Automatic Separation of Machine-Printed and Hand-Written Text Lines,' Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 645-648, Bangalore, India, 1999   DOI
8 K. Kuhnke, L. Simonicini and Z. Kovacs-V, 'A System for Machine-Written and Hand-Written Character Distinction,' Proceedings of 3rd International Conference on Document Analysis and Recognition, pp. 811-814, 1995   DOI
9 S. Image, S. Tatsuta and T. Wada, 'Segmentation and Classification for Mixed Text/Image Documents Using Neural Network,' Proceedings of 2nd International Conference on Document Analysis and Recognition, pp. 930-934, 1993   DOI
10 J. Franke and M. Oberlandwr, 'Writing Style Detection by Statistical Combination of Classifiers in Form Reader Applications,' Proceedings of 2nd International Conference on Document Analysis and Recognition, pp. 581-584, 1993   DOI
11 S. Tsujimoto and H. Asada, 'Major Components of a Complete Text Reading System,' Proceeding IEEE, Vol. 80, No. 7, pp. 1133-1149, 1992   DOI   ScienceOn
12 N. Otsu, 'A threshold selection method from gray-level histogram,' IEEE Transactions on Systems, Man, and Cybernetics, Vol. 9, pp. 62-66, 1979   DOI   ScienceOn
13 S. Jeong, S. Kim and W. Cho, 'Performance Comparison of Statistical and Neural Network Classifiers in Handwritten Digits Recognition,' Proceedings of 6th International Workshop on Frontiers in Handwriting Recognition, pp. 419-428, Taejon, Korea, 1998
14 U. Mahadevan and S.N. Srihari, 'Parsing and Recognition of City, State, and ZIP Codes in Handwritten Addresses,' Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 325-328, Bangalore, India, 1999   DOI
15 '순로구분 자동처리 시스템 개발' - 최종 연구개발보고서, 정보통신부, 2001
16 A. Brakensiek, J. Rottland, G. Rigall, 'Handwritten Address Recognition with Open Vocabulary Using Character N-grams,' Proceedings of 8th International Workshop on Frontiers in Handwriting Recognition, pp. 357-362, Niagara-on-the-Lake, Canada, 2002   DOI