A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

Milevskiy, Igor;Ha, Jin-Young;

doi:10.5626/JCSE.2011.5.3.161

Journal of Computing Science and Engineering

제5권3호
/
Pages.161-166
/
2011
/
1976-4677(pISSN)
/
2093-8020(eISSN)

한국정보과학회 (Korean Institute of Information Scientists and Engineers)

DOI QR Code

A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

Milevskiy, Igor (Department of Computer Science and Engineering, Kangwon National University) ;
Ha, Jin-Young (Department of Computer Science and Engineering, Kangwon National University)

투고 : 2011.03.25
심사 : 2011.06.01
발행 : 2011.09.30

https://doi.org/10.5626/JCSE.2011.5.3.161 인용 PDF KPUBS

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

We present a fast algorithm for Korean text extraction and segmentation from subway signboards using smart phone sensors in order to minimize computational time and memory usage. The algorithm can be used as preprocessing steps for optical character recognition (OCR): binarization, text location, and segmentation. An image of a signboard captured by smart phone camera while holding smart phone by an arbitrary angle is rotated by the detected angle, as if the image was taken by holding a smart phone horizontally. Binarization is only performed once on the subset of connected components instead of the whole image area, resulting in a large reduction in computational time. Text location is guided by user's marker-line placed over the region of interest in binarized image via smart phone touch screen. Then, text segmentation utilizes the data of connected components received in the binarization step, and cuts the string into individual images for designated characters. The resulting data could be used as OCR input, hence solving the most difficult part of OCR on text area included in natural scene images. The experimental results showed that the binarization algorithm of our method is 3.5 and 3.7 times faster than Niblack and Sauvola adaptive-thresholding algorithms, respectively. In addition, our method achieved better quality than other methods.

키워드

참고문헌

J. Park, G. Lee, A. N. Lai, E. Kim, J. Lim, S. Kim, H. Yang, and S. Oh, "Automatic detection and recognition of shop name in outdoor signboard images," IEEE International Symposium on Signal Processing and Information Technology 2008, Sarajebo, Bosnia & Herzegovina, 2008, pp. 111-116. https://doi.org/10.1109/ISSPIT.2008.4775652
T. N. Dinh, J. H. Park, and G. S. Lee, "Low-complexity text extraction in Korean signboards for mobile applications," 8th IEEE International Conference on Computer and Information Technology 2008, Sydney, 2008, pp. 333-337. https://doi.org/10.1109/CIT.2008.4594697
N. Otsu, "A threshold selection method from gray-level histograms," IEEE Transactions on System, Man, and Cybernetics, vol. 9, no. 1, pp. 62-66, Jan. 1979. https://doi.org/10.1109/TSMC.1979.4310076
J. He, Q. D. M. Do, A. C. Downton, and J. J. Kim, "A comparison of binarization methods for historical archive documents," Eighth International Conference on Document Analysis and Recognition, Seoul, Korea, 2005, pp. 538-542.
John Canny, "A Computational Approach to Edge Detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-8, no. 6, pp. 679-698, 1986. https://doi.org/10.1109/TPAMI.1986.4767851
S. H. Lee, J. H. Seok, K. M. Min, and J. H. Kim, "Scene Text Extraction using Image Intensity and Color Information," Chinese Conference on Pattern Recognition, Nanjing, China, 2009, pp. 1-5.
S. H. Lee, M. S. Cho, K. Jung, and J. H. Kim, "Scene text extraction with edge constraint and text collinearity," 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, pp. 3983-3986.
J. Jung, E. Kim, S. H. Lee, and J. H. Kim, "Scene text separation using touch screen interface," Chinese Conference on Pattern Recognition, Nanjing, China, 2009, pp. 1-5.
A. N. Lai, K. N. Park, M. Kumar, and G. S. Lee, "Korean text extraction by local color quantization and k-means clustering in natural scene," First Asian Conference on Intelligent Information and Database Systems 2009, Dong Hoi, Vietnam, 2009, pp. 138-143.
K. S. Bae, K. K. Kim, Y. G. Chung, and W. P. Yu, "Character recognition system for cellular phone with camera," 29th Annual International Computer Software and Applications Conference 2005, Edinburgh, UK, 2005, pp. 539-544.
V. Fragoso, S. Gauglitz, S. Zamora, J. Kleban, and M. Turk, "TranslatAR: a mobile augmented reality translator," 2011 IEEE Workshop on Applications of Computer Vision, Kona, HI, 2011, pp. 497-502. https://doi.org/10.1109/WACV.2011.5711545
O. Shiku, K. Kawasue, and A. Nakamura, "A method for character string extraction using local and global segment crowdedness," Fourteenth International Conference on Pattern Recognition, Brisbane, Australia, 1998, pp. 1077-1080.

피인용 문헌

New POI Construction with Street-Level Imagery vol.E96.D, pp.1, 2013, https://doi.org/10.1587/transinf.E96.D.129
Extracting Multiword Sentiment Expressions by Using a Domain-Specific Corpus and a Seed Lexicon vol.35, pp.5, 2013, https://doi.org/10.4218/etrij.13.0113.0093
Automatic Korean word spacing using Pegasos algorithm vol.49, pp.1, 2013, https://doi.org/10.1016/j.ipm.2012.05.004
Automatic gang graffiti recognition and interpretation vol.26, pp.05, 2017, https://doi.org/10.1117/1.JEI.26.5.051409

Journal of Computing Science and Engineering

A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

초록

키워드

참고문헌

피인용 문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)