DOI QR코드

DOI QR Code

A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

  • Milevskiy, Igor (Department of Computer Science and Engineering, Kangwon National University) ;
  • Ha, Jin-Young (Department of Computer Science and Engineering, Kangwon National University)
  • 투고 : 2011.03.25
  • 심사 : 2011.06.01
  • 발행 : 2011.09.30

초록

We present a fast algorithm for Korean text extraction and segmentation from subway signboards using smart phone sensors in order to minimize computational time and memory usage. The algorithm can be used as preprocessing steps for optical character recognition (OCR): binarization, text location, and segmentation. An image of a signboard captured by smart phone camera while holding smart phone by an arbitrary angle is rotated by the detected angle, as if the image was taken by holding a smart phone horizontally. Binarization is only performed once on the subset of connected components instead of the whole image area, resulting in a large reduction in computational time. Text location is guided by user's marker-line placed over the region of interest in binarized image via smart phone touch screen. Then, text segmentation utilizes the data of connected components received in the binarization step, and cuts the string into individual images for designated characters. The resulting data could be used as OCR input, hence solving the most difficult part of OCR on text area included in natural scene images. The experimental results showed that the binarization algorithm of our method is 3.5 and 3.7 times faster than Niblack and Sauvola adaptive-thresholding algorithms, respectively. In addition, our method achieved better quality than other methods.

키워드

참고문헌

  1. J. Park, G. Lee, A. N. Lai, E. Kim, J. Lim, S. Kim, H. Yang, and S. Oh, "Automatic detection and recognition of shop name in outdoor signboard images," IEEE International Symposium on Signal Processing and Information Technology 2008, Sarajebo, Bosnia & Herzegovina, 2008, pp. 111-116. https://doi.org/10.1109/ISSPIT.2008.4775652
  2. T. N. Dinh, J. H. Park, and G. S. Lee, "Low-complexity text extraction in Korean signboards for mobile applications," 8th IEEE International Conference on Computer and Information Technology 2008, Sydney, 2008, pp. 333-337. https://doi.org/10.1109/CIT.2008.4594697
  3. N. Otsu, "A threshold selection method from gray-level histograms," IEEE Transactions on System, Man, and Cybernetics, vol. 9, no. 1, pp. 62-66, Jan. 1979. https://doi.org/10.1109/TSMC.1979.4310076
  4. J. He, Q. D. M. Do, A. C. Downton, and J. J. Kim, "A comparison of binarization methods for historical archive documents," Eighth International Conference on Document Analysis and Recognition, Seoul, Korea, 2005, pp. 538-542.
  5. John Canny, "A Computational Approach to Edge Detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-8, no. 6, pp. 679-698, 1986. https://doi.org/10.1109/TPAMI.1986.4767851
  6. S. H. Lee, J. H. Seok, K. M. Min, and J. H. Kim, "Scene Text Extraction using Image Intensity and Color Information," Chinese Conference on Pattern Recognition, Nanjing, China, 2009, pp. 1-5.
  7. S. H. Lee, M. S. Cho, K. Jung, and J. H. Kim, "Scene text extraction with edge constraint and text collinearity," 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, pp. 3983-3986.
  8. J. Jung, E. Kim, S. H. Lee, and J. H. Kim, "Scene text separation using touch screen interface," Chinese Conference on Pattern Recognition, Nanjing, China, 2009, pp. 1-5.
  9. A. N. Lai, K. N. Park, M. Kumar, and G. S. Lee, "Korean text extraction by local color quantization and k-means clustering in natural scene," First Asian Conference on Intelligent Information and Database Systems 2009, Dong Hoi, Vietnam, 2009, pp. 138-143.
  10. K. S. Bae, K. K. Kim, Y. G. Chung, and W. P. Yu, "Character recognition system for cellular phone with camera," 29th Annual International Computer Software and Applications Conference 2005, Edinburgh, UK, 2005, pp. 539-544.
  11. V. Fragoso, S. Gauglitz, S. Zamora, J. Kleban, and M. Turk, "TranslatAR: a mobile augmented reality translator," 2011 IEEE Workshop on Applications of Computer Vision, Kona, HI, 2011, pp. 497-502. https://doi.org/10.1109/WACV.2011.5711545
  12. O. Shiku, K. Kawasue, and A. Nakamura, "A method for character string extraction using local and global segment crowdedness," Fourteenth International Conference on Pattern Recognition, Brisbane, Australia, 1998, pp. 1077-1080.

피인용 문헌

  1. New POI Construction with Street-Level Imagery vol.E96.D, pp.1, 2013, https://doi.org/10.1587/transinf.E96.D.129
  2. Extracting Multiword Sentiment Expressions by Using a Domain-Specific Corpus and a Seed Lexicon vol.35, pp.5, 2013, https://doi.org/10.4218/etrij.13.0113.0093
  3. Automatic Korean word spacing using Pegasos algorithm vol.49, pp.1, 2013, https://doi.org/10.1016/j.ipm.2012.05.004
  4. Automatic gang graffiti recognition and interpretation vol.26, pp.05, 2017, https://doi.org/10.1117/1.JEI.26.5.051409