DOI QR코드

DOI QR Code

Size-Independent Caption Extraction for Korean Captions with Edge Connected Components

  • Jung, Je-Hee (Department of Electrical and Computer Engineering, Sungkyunkwan University) ;
  • Kim, Jaekwang (Department of Electrical and Computer Engineering, Sungkyunkwan University) ;
  • Lee, Jee-Hyong (Department of Electrical and Computer Engineering, Sungkyunkwan University)
  • Received : 2012.11.29
  • Accepted : 2012.12.24
  • Published : 2012.12.25

Abstract

Captions include information which relates to the images. In order to obtain the information in the captions, text extraction methods from images have been developed. However, most existing methods can be applied to captions with a fixed height or stroke width using fixed pixel-size or block-size operators which are derived from morphological supposition. We propose an edge connected components based method that can extract Korean captions that are composed of various sizes and fonts. We analyze the properties of edge connected components embedding captions and build a decision tree which discriminates edge connected components which include captions from ones which do not. The images for the experiment are collected from broadcast programs such as documentaries and news programs which include captions with various heights and fonts. We evaluate our proposed method by comparing the performance of the latent caption area extraction. The experiment shows that the proposed method can efficiently extract various sizes of Korean captions.

Keywords

References

  1. R. Lyu, J. Song, and M. Cai, "A Comprehensive Method for Multilingual Video Text Detection, Localization and Extraction,"IEEE Transaction on Circuits and Systems for Video Technology, vol. 15, no. 2, pp. 243-255,2005. https://doi.org/10.1109/TCSVT.2004.841653
  2. J.-M. Jeong, J. Cha, and K. Kim, "A Stroke-Based Text Extraction Algorithm for Digital Videos," International Journal of Fuzzy Logic and Intelligent Systems, vol. 17, no. 3, pp. 297-303, 2007. https://doi.org/10.5391/JKIIS.2007.17.3.297
  3. K.C. Jung, K.I. Kim, and A.K. Jain, "Text Information Extraction in Images and Video: A Survey,"Journal on Pattern Recognition, vol. 37, no. 5, pp. 977-997, 2004. https://doi.org/10.1016/j.patcog.2003.10.012
  4. E.K. Wong and M. Chen, "A Robust Algorithm for Text Extraction in Color Video," the Proc. of the IEEE Multimedia and Expo 2000 (ICME 2000), vol. 2, pp. 797-800, 2000.
  5. K.C. Jung and E.Y, Kim, "Automatic Text Extraction for Content-Based Image Indexing,"Lecture Notes in Computer Science, vol. 3056, pp. 497-507, 2004.
  6. J.-H. Jung, T.-B.Yoon, D.-M.Kim, and J.-H.Lee, "Connected Component-Based and Size-Independent Caption Extraction with Neural Networks,"Journal of Korean Institute of Intelligent Systems, vol.17, no.7, pp.924-929, 2007. https://doi.org/10.5391/JKIIS.2007.17.7.924
  7. J.-M.Jeong, J.-H.Cha, and K.-H. Kim, "A Stroke-Based TextExtraction Algorithm for Digital Videos,"Journal of Korean Institute of Intelligent Systems, vol.17, no.3, pp.297-303, 2007. https://doi.org/10.5391/JKIIS.2007.17.3.297
  8. H. Byun, I. Jang, and Y. Choi, "Text Extraction in Digital News Video Using Morphology," Lecture Notes in Computer Science, vol. 2423, pp. 341-352, 2002.
  9. Y.M.Y. Hasan and L.J. Karam, "Morphological Text Extraction from Images," IEEE Transactions on Image Processing, vol. 9, no. 11, pp. 1978-1983, 2000. https://doi.org/10.1109/83.877220
  10. H.E. Jiaying, L.I. Shaofa, "Hybrid Chinese/English Text Identification in Web Images," Proc. of the 3rd International Conference Image and Graphics (ICIG '04), pp. 361-364, 2004.
  11. www.Stephen Wright.org/Korean
  12. J. Song, M. Cai, and M.R. Lyu, "A Robust Statistic Method for Classifying Color Polarity of Video Text," Proc. of the IEEE International Conference Acoustics Speech and Signal Processing(ICASSP '03), vol. 3, pp. 581-584, 2003.