Browse > Article

Font Classification of English Printed Character using Non-negative Matrix Factorization  

Lee, Chang-Woo (Dept. of Computer Eng. At Kyungpook National Univ)
Kang, Hyun (Dept. of Computer Eng. At Kyungpook National Univ)
Jung, Kee-Chul (College of Information Science at Soongsil Univ)
Kim, Hang-Joon (Dept. of Computer Eng. At Kyungpook National Univ)
Publication Information
Abstract
Today, most documents are electronically produced and their paleography is digitalized by imaging, resulting in a tremendous number of electronic documents in the shape of images. Therefore, to process these document images, many methods of document structure analysis and recognition have already been proposed, including font classification. Accordingly, the current paper proposes a font classification method for document images that uses non-negative matrix factorization (NMF), which is able to learn part-based representations of objects. In the proposed method, spatially total features of font images are automatically extracted using NMF, then the appropriateness of the features specifying each font is investigated. The proposed method is expected to improve the performance of optical character recognition (OCR), document indexing, and retrieval systems, when such systems adopt a font classifier as a preprocessor.
Keywords
Font classification; NMF; Part-based unsupervised learning; OCR;
Citations & Related Records
연도 인용수 순위
  • Reference
1 G. Nagy, 'Twenty Years of Document Image Analysis in PAMI,' IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 38-62, 2000   DOI   ScienceOn
2 Y. Zhu, T. Tan, Y. Wang, 'Font Recognition Based on Global Texture Analysis,' IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. 23, no. 10, pp. 1192-1200, 2001   DOI   ScienceOn
3 D. D. Lee, H. S. Seung, 'Learning the Parts of Objects by Non-Negative Matrix Factorization,' Nature 401, pp. 788-791, 1999   DOI   ScienceOn
4 J. H. Bae, K. Jung, J. W. Kim, H. J. Kim, 'Segmentation of Touching Characters Us ing an MLP,' Pattern Recognition Letters, vol. 19, no. 8, pp, 701-709, 1998   DOI   ScienceOn
5 K. lung, 'Neural network-based Text Location in Color Images,' Pattern Recognition Letters, vol. 22, no. 14, pp, 1503-1515, 2001   DOI   ScienceOn
6 S. Khoubyari, J. J. Hull, 'Font and function word identification in document recognition,' Computer Vision and Image Understanding, vol. 63, no. 1, pp. 66-74, 1996   DOI   ScienceOn
7 H. Shi, T. Pavlidis, 'Font Recognition and Contextual Processing for More Accurate Text Recognition,' Proceedings of Document Analysis and Recognition '97, pp. 39-44, 1997   DOI
8 A. Zramdini, R. Ingold, 'Optical Font Re cognition Using Typographical Features,' IEEE Transaction on Pattern Analysis and Machine Intelligence, vol. 20, no. 8, pp. 877-882, 1998   DOI   ScienceOn
9 I. Biederman, 'Recognition-by-components: A theory of human understanding,' Pyschological Review, vol. 94, no. 2, pp, 115-147, 1987   DOI   ScienceOn
10 L. Shams, Development of Visual Shape Primitives, PhD thesis, University of Southern California, 1999
11 M. Weber, M. Welling, and P. Perona, 'Un supervised learning of models for recognition,' In Proc. of 6th European Conference of Computer Vision, 2000
12 H. S. Seung, 'Derivation of the objective function (Eq.2),' http://jounalclub.mit.edu
13 D. D. Lee, H. S. Seung, 'Algorithms for non-negative matrix factorization,' In Advances in Neural Information Processing Systems, 13, pp. 556-562, 2001
14 Y. Lu, 'Machine printed character segmentation - an overview,' Pattern Recognition, vol. 28, no. 1, pp. 67-80, 1995   DOI   ScienceOn
15 A. K. Jain and D. Zongker, 'Representation and Recognition of Handwritten Digits Using Defromable Templates,' IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 12, pp. 1386-1391, 1997   DOI   ScienceOn
16 B. Yu, A. K. lain, 'A Robust And Fast Skew Detection Algorithm for Generic Documents,' Pattern Recognition, vol 29, no. 10, pp. 1599-1629, 1996   DOI   ScienceOn
17 C. W. Lee, H. Kang, K. Jung, H. J. Kim, 'Font Classification Using NMF,' Lecture Notes in Computer Science 2756, pp. 470 -477, 2003