Browse > Article

Automatic Text Extraction from News Video using Morphology and Text Shape  

Jang, In-Young (Dept.of Computer Science, Yonsei University)
Ko, Byoung-Chul (Dept.of Computer Science, Yonsei University)
Kim, Kil-Cheon (Dept.of Computer Science, Yonsei University)
Byun, Hye-Ran (Dept.of Computer Science, Yonsei University)
Abstract
In recent years the amount of digital video used has risen dramatically to keep pace with the increasing use of the Internet and consequently an automated method is needed for indexing digital video databases. Textual information, both superimposed and embedded scene texts, appearing in a digital video can be a crucial clue for helping the video indexing. In this paper, a new method is presented to extract both superimposed and embedded scene texts in a freeze-frame of news video. The algorithm is summarized in the following three steps. For the first step, a color image is converted into a gray-level image and applies contrast stretching to enhance the contrast of the input image. Then, a modified local adaptive thresholding is applied to the contrast-stretched image. The second step is divided into three processes: eliminating text-like components by applying erosion, dilation, and (OpenClose+CloseOpen)/2 morphological operations, maintaining text components using (OpenClose+CloseOpen)/2 operation with a new Geo-correction method, and subtracting two result images for eliminating false-positive components further. In the third filtering step, the characteristics of each component such as the ratio of the number of pixels in each candidate component to the number of its boundary pixels and the ratio of the minor to the major axis of each bounding box are used. Acceptable results have been obtained using the proposed method on 300 news images with a recognition rate of 93.6%. Also, my method indicates a good performance on all the various kinds of images by adjusting the size of the structuring element.
Keywords
Text extraction; Video indexing; Morphology;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Jae-Chang Shim, Chitra Dorai, Ruud Bolle, 'Automatic Text Extraction from Video for Content-Based Annotation and Retrieval,' Pattern Recognition, 1998 Proceedings. Fourteenth International Conference on, On page(s): 618-620, vol.1 16-20 Aug. 1998   DOI
2 Anil K. Jain, Bin Yu, 'Automatic text location in images and video frames,' Pattern Recognition, Vol. 31, No. 12, pp. 2055-2076, 1998   DOI   ScienceOn
3 H.Kuwano, Y.Taniguchi, H.Arai, M.Mori, S.Kuraka-ke, H.Kojima, 'Telop-on-demand: video structuring and retrieval base on text recognition,' Multimedia and Expo, 2000 ICME 2000, 2000 IEEE International Conference on, On page(s): 759-762, vol.2, 30 July-2 Aug. 2000   DOI
4 Sameer Antani, Ullas Gargi, David Crandall, Tarak Gandhi and Rangachar Kasturi, 'Extraction of Text in Video,' Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., Technical Report, CSE-99-016, August 30, 1999
5 S. Antani, D. Crandall, R. Kasturi, 'Robust extraction of text in video,' Pattern Recognition, 2000 Proceedings. 15th International Conference on, Volume: 1, 2000, Page(s): 831-834 vol.1   DOI
6 S. Messelodi and C.M. Modena, 'Automatic identification and skew estimation of text lines in real scene images,' Pattern Recognition, Vol. 32 (5) (1999) pp. 791-810   DOI   ScienceOn
7 U. Gargi, S. Antani, R. Kasturi, 'Indexing text events in digital video databases,' Pattern Recognition, 1998 Proceedings. Fourteenth International Conference on, On page(s): 916-918 vol.1 16-20 Aug. 1998   DOI
8 J. Ohya, A. Shio, S. Akamatsu, 'Recognizing characters in scene image,' IEEE Trans. Pattern Anal. Mach. Intell. PAMI-16(2) (1994) 214-220   DOI   ScienceOn
9 C. M. Lee, A. Kankanhalli, 'Autometic extraction of characters in complex images,' Int. J. Pattern Recognition Artificial Intell. ( (1) (1995) 67-82   DOI   ScienceOn
10 H. K. Kim, 'Efficient automatic text location method and content-based indexing and structuring of video database,' J. Visual Commun. Image Representation 7 (4) (1996) 336-344   DOI   ScienceOn
11 Y. Lu, 'Machine printed character segmentation- An overview,' Pattern Recognition, 28, 1995, 67-80   DOI   ScienceOn
12 J. Serra, Image Analysis and Mathematical Morphology. New York: Academic, 1982
13 R. Lienhart, F. Stuber, 'Autometic text recognition in digital videos,' Imege and Video Proceeding IV 1996, SPIE 2666-20, 1996   DOI
14 M. A. Smith, T. Kanade, 'Video skimming for quick browsing base on audio and image characterization,' Technical Report CMU-CS-95-186, Carnegie Mellon University, July 1995
15 Y. Zhong, K. Karu, A. K. Jain, 'Locating text in complex color images,' Pattern Recognition, 28 (10) (1995) 1523-1535   DOI   ScienceOn
16 F. Lebourgeois, 'Robust Multifont OCR System from Gray Level Image,' in International Conference on Document Analysis and Recognition, vol. 1, pp.1-5, 1997   DOI
17 Pyeoung-Kee Kim, 'Automatic Text Location in Complex Color Images using Local Color Quantization,' TENCON 99. Proceedings of the IEEE Region 10 Conference, Volume: 1, pp. 629-632, 1999   DOI
18 M. Bertini, C. Colombo, A. Del Isimbo, 'Automatic Caption Localization in Video using Salient Points,' IEEE Int. Conf. On Multimedia and Expo. pp69-72, 2001