디지털 비디오를 위한 획기반 자막 추출 알고리즘

A Stroke-Based Text Extraction Algorithm for Digital Videos

  • 정종면 (목포해양대학교 해양전자통신공학부) ;
  • 차지훈 (한국전자통신연구원 전파방송연구단) ;
  • 김규헌 (경희대학교 전자정보대학)
  • 발행 : 2007.06.30


본 논문에서는 디지털 비디오를 위한 획기반 자막 추출 알고리즘을 제안한다. 제안된 알고리즘은 자막 탐지, 자막 위치 찾기 자막 분리 단계와 분리된 자막에 대한 기하학적 검증 과정으로 구성된다. 자막 탐지 단계는 연속적으로 입력되는 프레임 중 자막이 존재하는 프레임을 찾는 단계로써, 주어진 프레임으로부터 자막이 될 가능성이 높은 점, 즉 씨앗점을 추출한 다음 씨앗점에 대하여 모폴로지 연산을 수행한다. 자막 위치 찾기 단계는 자막이 존재하는 프레임에서 자막의 위치를 찾는 단계로써, 씨앗점을 포함하는 에지에 대한 모폴로지 연산과 프로젝션을 통해 수행된다. 자막 분리 단계에서는 자막과 배경의 색상 분포와 복잡한 배경을 고려하여 자막을 강건하게 분리한다. 마지막으로 자막에 대한 사전 정보를 이용하여 분리된 자막에 대한 기하학적 검증 과정을 수행하여 최종 결과를 얻는다.

In this paper, the stroke-based text extraction algorithm for digital video is proposed. The proposed algorithm consists of four stages such as text detection, text localization, text segmentation and geometric verification. The text detection stage ascertains that a given frame in a video sequence contains text. This procedure is accomplished by morphological operations for the pixels with higher possibility of being stroke-based text, which is called as seed points. For the text localization stage, morphological operations for the edges including seed points ate adopted followed by horizontal and vortical projections. Text segmentation stage is to classify projected areas into text and background regions according to their intensity distribution. Finally, in the geometric verification stage, the segmented area are verified by using prior knowledge of video text characteristics.



  1. K. Jung, K. Kim, A. Jain, 'Text Information extraction in images and video: a survey,' Pattern Recognition, Vol. 37, pp. 977-997, May 2004
  2. M. R. Lyu, J, Song, and M. Cai, 'A Comprehensive Method for Multilingual Video Text Detection, Localization, and Extraction,' IEEE Trans. on CSVT., Vol. 15n no 2, Feb. 2005
  3. J. Kim, Y. Moon, 'Extraction of Text Regions and Recognition of Characters from Video Inputs,' LNCS-Advances in Multimedia Information Processing, Vol. 2532, pp. 775-782, Dec. 2002
  4. R. Lienhart and A. Wernicke, 'Localizaing and Segmenting Text ill Images and Videos,' IEEE Trans. on Circuits and Systems for Video technology, Vol. 12, no. 1, pp. 256-268, Apr. 2002
  5. Y. Hasan and L. Karam, 'Morphological Text Extraction from Images,' IEEE Trans. on Image Processing, Vol. 9, no, 11, Nov. 2000
  6. C. W. Lee, K. Jung, H. J. Kim, 'Automatic text detection and removal in video sequences,' Pattern Recognition Letters, Vol. 24, pp. 2607-2623, Nov. 2003
  7. H. Li, D. Doerman, and O. Kia, 'Automatic text detection and tracking in digital video,' IEEE Trans. on Image Processing, Vol. 9, no. 1, pp. 147-156, Jan. 2000
  8. O. Shiku, Y. Xiao, H. Yan, 'Extraction of character patterns in different styles and orientations from natural scene images,' Proc. Of 2004 Int. Symp. on Intelligent Multimedia, Video and Speech Processing, pp. 719-722, Oct. 2004
  9. A. Jian and S. Bhattacharjee, 'Text segmentation using gabor filters for automatic document processing,' Machine Vis. App., Vol. 5, pp. 169-184, 1992
  10. V. Wu, R. Manmatha, and E. Riseman, 'Textfinder: An automatic system to detect and recognize text in images,' IEEE Trans. on Pattern Analysis and Machine Intelligent, Vol. 21, no. 11, pp. 1224-1229, Nov. 1999
  11. A. Jain and B. Yu, 'Automatic text location in images and video frames,' Pattern Recognition, Vol. 31, no. 12, pp. 2055-2076, 1998
  12. M. Cai, J. Song, and M. Lyu, 'A new approach for video text detection,' Proc. of Int. Conf. on Image Process, pp. 117-120, Sep. 2002
  13. A. Wernicke and R. Lienhart, 'On the segmentation of text in videos,' Proc. of IEEE Int. Conf. on Multimedia Expo, Vol. 3, pp. 1511-1514, Jul. 2000
  14. S. Antani, D. Crandall, and R. Kasturi, 'Robust extraction of text in video,' 15th Int. Conf. on Pattern Recognition, Vol. 1, pp. 831-834, Sep. 2001
  15. R. C. Gonzalez, R. E. Woods, Digital Image Processing, 2nd edition, Prentice Hall, New Jersey, 2001
  16. J. Song, M. Cai, M. R. Lyu, 'A Robust Statistic Method for Classifying Color Polarity of Video Text', Proc. of 2003 International Conference on Multimedia and Expo, Vol. 2, pp. II-385-8, Jul. 2003

피인용 문헌

  1. Size-Independent Caption Extraction for Korean Captions with Edge Connected Components vol.12, pp.4, 2012,