DOI QR코드

DOI QR Code

Touch TT: Scene Text Extractor Using Touchscreen Interface

  • Received : 2010.02.22
  • Accepted : 2010.09.20
  • Published : 2011.02.28

Abstract

In this paper, we present the Touch Text exTractor (Touch TT), an interactive text segmentation tool for the extraction of scene text from camera-based images. Touch TT provides a natural interface for a user to simply indicate the location of text regions with a simple touchline. Touch TT then automatically estimates the text color and roughly locates the text regions. By inferring text characteristics from the estimated text color and text region, Touch TT can extract text components. Touch TT can also handle partially drawn lines which cover only a small section of text area. The proposed system achieves reasonable accuracy for text extraction from moderately difficult examples from the ICDAR 2003 database and our own database.

Keywords

References

  1. N. Ezaki, M. Bulacu, and L. Schomaker, "Text Detection from Natural Scene Images: Towards a System for Visually Impaired Persons," Proc. ICPR, 2004, pp. 683-686.
  2. L. Xu et al., "Automatic Text Discovering through Stroke-Based Segmentation and Text String Combination," Proc. 16th ACM Int. Conf. Multimedia, 2008, pp. 805-808.
  3. D. Doermann, J. Liang, and H. Li, "Progress in Camera-Based Document Image Analysis," Proc. Seventh Int. Conf. Document Anal. Recog., 2003, pp. 606-616.
  4. C. Mancas-Thilloul, Natural Scene Text Understanding, doctoral dissertation, Presses Universitaires de Louvain.
  5. B. Gatos et al., "Text Detection in Indoor/Outdoor Scene Images," Proc. 1st Workshop Camera-Based Document Anal. Recog., 2005, pp. 127-132.
  6. K. Kim et al., "Scene Text Extraction in Natural Scene Images Using Hierarchical Feature Combining and Verification," Proc. 17th Int. Conf. Patt. Recog., vol. 2, 2004, pp. 679-682.
  7. J. Park, H. Yoon, and G. Lee, "Automatic Segmentation of Natural Scene Images Based on Chromatic and Achromatic Components," Lecture Notes Computer Sci., vol. 4418, 2007, pp. 482-493.
  8. Y. Li et al., "Lazy Snapping," Int. Conf. Graphics Interactive Techniques, 2004, pp. 303-308.
  9. C. Rother, V. Kolmogorov, and A. Blake, "GrabCut: Interactive Foreground Extraction Using Iterated Graph Cuts," ACM Trans. Graphics, vol. 23, no. 3, 2004, pp. 309-314. https://doi.org/10.1145/1015706.1015720
  10. E. Kim, S. Lee, and J.H. Kim, "Scene Text Extraction using Focus of Mobile Camera," Proc. 10th Int. Conf. Document Anal. Recog., 2009, p. 166-170.
  11. N. Nikolaou and N. Papamarkos, "Color Segmentation of Complex Document Images," VISAPP, 2006, pp. 251-263.
  12. W.K. Pratt, Digital Image Processing, New York, NY: Wiley, 1978.
  13. J. MacQueen, "Some Methods for Classification and Analysis of Multivariate Observations," Proc. Fifth Berkeley Symp. Mathematical Statistics Probability, 1967, pp. 281-297.
  14. D. Comaniciu and P. Meer, "Mean Shift: A Robust Approach Toward Feature Space Analysis." IEEE Trans. Patt. Anal. Mach. Intell., vol. 24, no. 5, 2002, pp. 603-619. https://doi.org/10.1109/34.1000236
  15. M. Sarifuddin and R. Missaoui, "A New Perceptually Uniform Color Space with Associated Color Similarity Measure for Content-Based Image and Video Retrieval," Proc. ACM SIGIR Workshop on Multimedia Inf. Retrieval, 2005, pp. 1-8.
  16. N. Otsu, "A Threshold Selection Method from Gray-level Histograms," IEEE Trans. Syst., Man, Cybern., vol. 9, no. 1, 1979, pp. 62-66. https://doi.org/10.1109/TSMC.1979.4310076
  17. J. Sauvola and M. Peitikainen, "Adaptive Document Image Binarization," Patt. Recog., vol. 33, no. 2, 2000, pp. 225-236. https://doi.org/10.1016/S0031-3203(99)00055-2
  18. S. Lucas et al., "ICDAR 2003 Robust Reading Competitions," Proc. ICDAR, 2003, pp. 682-687.

Cited by

  1. 영역분할에 의한 SLI와 벡터 지도 간의 건물영역 일치도 향상 vol.30, pp.4, 2011, https://doi.org/10.7848/ksgpc.2012.30.4.405
  2. New POI Construction with Street-Level Imagery vol.ed96, pp.1, 2011, https://doi.org/10.1587/transinf.e96.d.129
  3. Toward Integrated Scene Text Reading vol.36, pp.2, 2011, https://doi.org/10.1109/tpami.2013.126
  4. Text segmentation using superpixel clustering vol.11, pp.7, 2011, https://doi.org/10.1049/iet-ipr.2016.0914
  5. Comparative Analysis of Multi-scale Wavelet Decomposition and k-Means Clustering Based Text Extraction vol.109, pp.1, 2019, https://doi.org/10.1007/s11277-019-06574-w
  6. Generating Text Sequence Images for Recognition vol.51, pp.2, 2011, https://doi.org/10.1007/s11063-019-10166-x
  7. Unattached irregular scene text rectification with refined objective vol.463, pp.None, 2011, https://doi.org/10.1016/j.neucom.2021.08.047