Correction for Misrecognition of Korean Texts in Signboard Images using Improved Levenshtein Metric

Lee, Myung-Hun;Kim, Soo-Hyung;Lee, Guee-Sang;Kim, Sun-Hee;Yang, Hyung-Jeong;

doi:10.3837/tiis.2012.02.016

KSII Transactions on Internet and Information Systems (TIIS)

Volume 6 Issue 2
/
Pages.722-733
/
2012
/
1976-7277(pISSN)
/
1976-7277(eISSN)

Korean Society for Internet Information (한국인터넷정보학회)

DOI QR Code

Correction for Misrecognition of Korean Texts in Signboard Images using Improved Levenshtein Metric

Lee, Myung-Hun (Media Service Group, Konan Technology Co. LTD.) ;
Kim, Soo-Hyung (Department of Computer Science, Chonnam National University) ;
Lee, Guee-Sang (Department of Computer Science, Chonnam National University) ;
Kim, Sun-Hee (Department of Computer Science, Carnegie Mellon University) ;
Yang, Hyung-Jeong (Department of Computer Science, Chonnam National University)

Received : 2011.09.05
Accepted : 2011.12.15
Published : 2012.02.28

https://doi.org/10.3837/tiis.2012.02.016 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Recently various studies on various applications using images taken by mobile phone cameras have been actively conducted. This study proposes a correction method for misrecognition of Korean Texts in signboard images using improved Levenshtein metric. The proposed method calculates distances of five recognized candidates and detects the best match texts from signboard text database. For verifying the efficiency of the proposed method, a database dictionary is built using 1.3 million words of nationwide signboard through removing duplicated words. We compared the proposed method to Levenshtein Metric which is one of representative text string comparison algorithms. As a result, the proposed method based on improved Levenshtein metric represents an improvement in recognition rates 31.5% on average compared to that of conventional methods.

Keywords

References

A. Wojciechowski and K. Siek, "Barcode scanning from mobile phone camera photos delivered via MMS: Case Study," in Proc. of 27th Int. Conf. on Conceptual Modeling, pp.218-227, Oct.2008.
D.M. Chen, S.S. Tsai, R.Vedantham, R. Grzeszczuk and B. Girod, "Streaming mobile augmented reality on mobile phones," in Proc. of 8th IEEE International Symposium on Mixed and Augmented Reality, pp.181-182, Oct.2009.
C. Mancas-Thillou and B. Gosselin, "Natural scene text understanding," Vision Systems: Segmentation and Pattern Recognition, pp.307-333, Jun.2007.
A. Canedo-Rodriguez, S.H. Kim, J.H. Kim and Y. Blanco-Fernandez, "English to Spanish translation of signboard images from mobile phone camera," in Proc. of IEEE SoutheastCon, pp.356-361, Mar.2009.
I. Haritaoglu, "Scene text extraction and translation for handheld devices," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, pp.408-413, Dec. 2001.
J. Yang, X. Chen, J. Zhang, Y. Zhang and A. Waibel, "Automatic detection and translation of text from natural scenes," in Proc. of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing, pp.2101-2104, May.2002.
M.L.Wick, M.G. Ross and E.G. Learned-Miller, "Context-Sensitive error correction: Using Topic Models to Improve OCR," in Proc. Of 9th Int. Conf. on Document Analysis and Recognition, pp.1168-1172, September, 2007.
W.S. Rosenbaum and J.J. Hilliard, "Multifont OCR Postprocessing System," IBM Journal of Research and Development, vol.19, pp.398-421, Jul.1975.
S. Dobrisek, J. Zibert, N. Pavesic and F. Mihelic, "An edit distance model for the approximate matching of timed strings," Pattern Analysis and Machine Intelligence, IEEE Transactions, vol.31, pp.736-741, Apr. 2009.
R.S. Boyer and J.S. Moore, "A fast string searching algorithm," Communications of the ACM, vol.20, pp.762-772, Oct.1977. https://doi.org/10.1145/359842.359859
V. Bansal and R.M.K. Sinha, "Integrating knowledge sources in Devanagari text recognition system," IEEE Transactions on Systems, Man, and Cybernetics, vol.30, pp.500-505, Jul.2000. https://doi.org/10.1109/3468.852443
H. Takashi, N.I. Amano and A. Yamashita, "A spelling correction method and its application to and OCR system," Pattern Recognition, vol.23, pp.363-377, May.1990. https://doi.org/10.1016/0031-3203(90)90023-E
T. Okuda, E. Tanaka and T. Kasai, "A method for the correction of Garbled Words based on the Levenshtein Metric," IEEE Transactions on Computers, vol.C-25, pp.172-178, Feb.1976.

KSII Transactions on Internet and Information Systems (TIIS)

Correction for Misrecognition of Korean Texts in Signboard Images using Improved Levenshtein Metric

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)