텐서보팅을 이용한 텍스트 배열정보의 획득과 이를 이용한 텍스트 검출

Extraction of Text Alignment by Tensor Voting and its Application to Text Detection

  • 이귀상 (전남대학교 전자컴퓨터공학부) ;
  • 또안 (전남대학교 전자컴퓨터공학부) ;
  • 박종현 (전남대학교 전자컴퓨터공학부)
  • 발행 : 2009.11.15

초록

본 논문에서는 이차원 텐서보팅과 에지 기반 방법을 이용하여 자연영상에서 문자를 검출하는 새로운 방법을 제시한다. 텍스트의 문자들은 보통 연속적인 완만한 곡선 상에 배열되어 있고 서로 가깝게 위치하며, 이러한 특성은 텐서보팅에 의하여 효과적으로 검출될 수 있다. 이차원 텐서보팅은 토큰의 연속성을 curve saliency 로 산출하며 이러한 특성은 다양한 영상해석에 사용된다. 먼저 에지 검출을 이용하여 영상 내의 텍스트 영역이 위치할 가능성이 있는 텍스트 후보영역을 찾고 이러한 후보영역의 연속성을 텐서보팅에 의해 검증하여 잡음영역을 제거하고 텍스트 영역만을 구분한다. 실험 결과, 제안된 방법은 복잡한 자연영상에서 효과적으로 텍스트 영역을 검출함을 확인하였다.

A novel algorithm using 2D tensor voting and edge-based approach is proposed for text detection in natural scene images. The tensor voting is used based on the fact that characters in a text line are usually close together on a smooth curve and therefore the tokens corresponding to centers of these characters have high curve saliency values. First, a suitable edge-based method is used to find all possible text regions. Since the false positive rate of text detection result generated from the edge-based method is high, 2D tensor voting is applied to remove false positives and find only text regions. The experimental results show that our method successfully detects text regions in many complex natural scene images.

키워드

참고문헌

  1. P. Clark and M. Mirmehdi, 'Recognizing text in real scenes,' International Journal on Document Analysis and Recognition 4, no.4, pp.243-257, 2008 https://doi.org/10.1007/s10032-001-0072-2
  2. G. Julinda, E. Ralph and F. Bernd, 'A Robust algorithm for Text detection in images,' Proceedings of the 3rd international symposium on Image and Signal Processing and Analysis, 2003
  3. K. Jung, K.I. Kim, A.K. Jain, 'Text information extraction in images and video: a survey,' Pattern Recognition, pp.977-997, 2004
  4. J. Liang, D. Doermann, and H. Li, 'Camera-based analysis of text and documents: a survey,' International Journal on Document Analysis and Recognition 7, pp.84-104, 2005 https://doi.org/10.1007/s10032-004-0138-z
  5. B. K. Sin, S. K. Kim, and B. J. Cho, 'Locating characters in scene images using frequency features,' International conference on pattern recognition, pp.489-492, 2002 https://doi.org/10.1109/ICPR.2002.1047983
  6. D. Crandall, S. Antani, and R. Kasturi, 'Extraction of special effects caption text events from digital video,' International Journal of Document Analysis and Recognition 5, no.2-3, pp.138-157, 2005
  7. Q. Ye, Q. Huang, W. Gao, and D. Zhao, 'Fast and Robust Text Detection in Images and Video Frames,' Image and Vision Computing 23, no.6, pp.565-576, 2005 https://doi.org/10.1016/j.imavis.2005.01.004
  8. J. Samarabandu and X. P. Liu, 'An edge-based text region extraction algorithm for indoor mobile robot navigation,' International Journal of Signal Processing, pp.273-280, 2006
  9. L. Xiaoqing, S. Jagath, 'Multiscale Edge-Based Text Extraction from Complex Images,' ICME, pp.1721-1724. 2006
  10. J. Hoya, A. Shio and S. Akamatsu, 'Recognizing Characters in Scene Images,' IEEE Trans. Pattern Analysis and Machine Intelligence, vol.16. no.2, pp.67-82, 1995
  11. R. Lienhart, F. Stuber, 'Automatic Text Recognition in Digital Videos,' Image and Video Proc. IV, SPIE, 1996
  12. G. Medioni, M.S. Lee, C.K. Tang, 'A Computational Framework for Segmentation and Grouping,' Elsevier, Amsterdam, 2000
  13. W.S. Tong, C.K. Tang, and G. Medioni, 'First Order Tensor Voting, and Application to 3-D Scale Analysis,' Proc. CVPR, pp.175-182, 2001 https://doi.org/10.1109/CVPR.2001.990473
  14. Jaeguyn Lim, Jonghyun Park, Gerard G. Medioni, 'Text segmentation in color images using tensor voting,' Image and Vision Computing, vol.25, Issue 5, pp.671-685, 2007 https://doi.org/10.1016/j.imavis.2006.05.011
  15. J.H. Park, J.M. Yoo and G.S. Lee, 'A Tensor Voting for Corrupted Region Inference and Text Image Segmentation,' International Multimedia Modelling Conference, LNCS, vol.4351, pp. 751-761, 2007
  16. J. Jia, C.K. Tang, 'Inference of Segmented Color and Texture Description by Tensor Voting,' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, no.6, pp.771-786, 2004 https://doi.org/10.1109/TPAMI.2004.10
  17. J. Canny, 'A Computational Approach To Edge Detection,' IEEE Trans. Pattern Analysis and Machine Intelligence, pp.679-714, 1986 https://doi.org/10.1109/TPAMI.1986.4767851
  18. Sneha Sharma, 'Extraction of Text Regions in Natural Images,' Master Thesis from Rochester Institute of Technology, 2007