DOI QR코드

DOI QR Code

Correction for Misrecognition of Korean Texts in Signboard Images using Improved Levenshtein Metric

  • Lee, Myung-Hun (Media Service Group, Konan Technology Co. LTD.) ;
  • Kim, Soo-Hyung (Department of Computer Science, Chonnam National University) ;
  • Lee, Guee-Sang (Department of Computer Science, Chonnam National University) ;
  • Kim, Sun-Hee (Department of Computer Science, Carnegie Mellon University) ;
  • Yang, Hyung-Jeong (Department of Computer Science, Chonnam National University)
  • Received : 2011.09.05
  • Accepted : 2011.12.15
  • Published : 2012.02.28

Abstract

Recently various studies on various applications using images taken by mobile phone cameras have been actively conducted. This study proposes a correction method for misrecognition of Korean Texts in signboard images using improved Levenshtein metric. The proposed method calculates distances of five recognized candidates and detects the best match texts from signboard text database. For verifying the efficiency of the proposed method, a database dictionary is built using 1.3 million words of nationwide signboard through removing duplicated words. We compared the proposed method to Levenshtein Metric which is one of representative text string comparison algorithms. As a result, the proposed method based on improved Levenshtein metric represents an improvement in recognition rates 31.5% on average compared to that of conventional methods.

Keywords

References

  1. A. Wojciechowski and K. Siek, "Barcode scanning from mobile phone camera photos delivered via MMS: Case Study," in Proc. of 27th Int. Conf. on Conceptual Modeling, pp.218-227, Oct.2008.
  2. D.M. Chen, S.S. Tsai, R.Vedantham, R. Grzeszczuk and B. Girod, "Streaming mobile augmented reality on mobile phones," in Proc. of 8th IEEE International Symposium on Mixed and Augmented Reality, pp.181-182, Oct.2009.
  3. C. Mancas-Thillou and B. Gosselin, "Natural scene text understanding," Vision Systems: Segmentation and Pattern Recognition, pp.307-333, Jun.2007.
  4. A. Canedo-Rodriguez, S.H. Kim, J.H. Kim and Y. Blanco-Fernandez, "English to Spanish translation of signboard images from mobile phone camera," in Proc. of IEEE SoutheastCon, pp.356-361, Mar.2009.
  5. I. Haritaoglu, "Scene text extraction and translation for handheld devices," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, pp.408-413, Dec. 2001.
  6. J. Yang, X. Chen, J. Zhang, Y. Zhang and A. Waibel, "Automatic detection and translation of text from natural scenes," in Proc. of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp.2101-2104, May.2002.
  7. M.L.Wick, M.G. Ross and E.G. Learned-Miller, "Context-Sensitive error correction: Using Topic Models to Improve OCR," in Proc. Of 9th Int. Conf. on Document Analysis and Recognition, pp.1168-1172, September, 2007.
  8. W.S. Rosenbaum and J.J. Hilliard, "Multifont OCR Postprocessing System," IBM Journal of Research and Development, vol.19, pp.398-421, Jul.1975.
  9. S. Dobrisek, J. Zibert, N. Pavesic and F. Mihelic, "An edit distance model for the approximate matching of timed strings," Pattern Analysis and Machine Intelligence, IEEE Transactions, vol.31, pp.736-741, Apr. 2009.
  10. R.S. Boyer and J.S. Moore, "A fast string searching algorithm," Communications of the ACM, vol.20, pp.762-772, Oct.1977. https://doi.org/10.1145/359842.359859
  11. V. Bansal and R.M.K. Sinha, "Integrating knowledge sources in Devanagari text recognition system," IEEE Transactions on Systems, Man, and Cybernetics, vol.30, pp.500-505, Jul.2000. https://doi.org/10.1109/3468.852443
  12. H. Takashi, N.I. Amano and A. Yamashita, "A spelling correction method and its application to and OCR system," Pattern Recognition, vol.23, pp.363-377, May.1990. https://doi.org/10.1016/0031-3203(90)90023-E
  13. T. Okuda, E. Tanaka and T. Kasai, "A method for the correction of Garbled Words based on the Levenshtein Metric," IEEE Transactions on Computers, vol.C-25, pp.172-178, Feb.1976.