DOI QR코드

DOI QR Code

An End-to-End Sequence Learning Approach for Text Extraction and Recognition from Scene Image

  • Lalitha, G. (Department of Computer Science, University of Madras) ;
  • Lavanya, B. (Department of Computer Science, University of Madras)
  • Received : 2022.07.05
  • Published : 2022.07.30

Abstract

Image always carry useful information, detecting a text from scene images is imperative. The proposed work's purpose is to recognize scene text image, example boarding image kept on highways. Scene text detection on highways boarding's plays a vital role in road safety measures. At initial stage applying preprocessing techniques to the image is to sharpen and improve the features exist in the image. Likely, morphological operator were applied on images to remove the close gaps exists between objects. Here we proposed a two phase algorithm for extracting and recognizing text from scene images. In phase I text from scenery image is extracted by applying various image preprocessing techniques like blurring, erosion, tophat followed by applying thresholding, morphological gradient and by fixing kernel sizes, then canny edge detector is applied to detect the text contained in the scene images. In phase II text from scenery image recognized using MSER (Maximally Stable Extremal Region) and OCR; Proposed work aimed to detect the text contained in the scenery images from popular dataset repositories SVT, ICDAR 2003, MSRA-TD 500; these images were captured at various illumination and angles. Proposed algorithm produces higher accuracy in minimal execution time compared with state-of-the-art methodologies.

Keywords

Acknowledgement

This research did not receive any specific grant from funding agencies in the public, commercial, or not for profit sectors.

References

  1. Wei, Yuanwang, et al. "Text detection in scene images based on exhaustive segmentation." Signal Processing: Image Communication 50 (2017): 1-8. https://doi.org/10.1016/j.image.2016.10.003
  2. Zheng, Yang, et al. "A cascaded method for text detection in natural scene images." Neurocomputing 238 (2017): 307-315. https://doi.org/10.1016/j.neucom.2017.01.066
  3. Ye, Qixiang, et al. "Fast and robust text detection in images and video frames." Image and Vision Computing 23.6 (2005): 565-576. https://doi.org/10.1016/j.imavis.2005.01.004
  4. GonzaLez, ALvaro, and Luis Miguel Bergasa. "A text reading algorithm for natural images." Image and Vision Computing31.3 (2013): 255-274. https://doi.org/10.1016/j.imavis.2013.01.003
  5. Yi, Chucai, and YingLi Tian. "Text string detection from natural scenes by structure-based partition and grouping." IEEE Transactions on Image Processing 20.9 (2011): 2594-2605. https://doi.org/10.1109/TIP.2011.2126586
  6. Kumuda, T., and L. Basavaraj. "Hybrid Approach to Extract Text in Natural Scene Images." International Journal of Computer Applications 142.10 (2016).
  7. Sahare, Parul, and Sanjay B. Dhok. "Review of text extraction algorithms for scene-text and document images." IETE Technical Review 34.2 (2017): 144-164. https://doi.org/10.1080/02564602.2016.1160805
  8. Bhattacharya, Ujjwal, Swapan Kumar Parui, and Srikanta Mondal. "Devanagari and bangla text extraction from natural scene images." Document Analysis and Recognition, 2009. ICDAR'09. 10th International Conference on. IEEE, 2009.(4)
  9. Ezaki, Nobuo, Marius Bulacu, and Lambert Schomaker. "Text detection from natural scene images: towards a system for visually impaired persons." Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on. Vol. 2. IEEE, 2004.(5)
  10. Pan, Yi-Feng, Cheng-Lin Liu, and Xinwen Hou. "Fast scene text localization by learning-based filtering and verification." Image Processing (ICIP), 2010 17th IEEE International Conference on. IEEE, 2010.(7)
  11. Gllavata, Julinda, Ralph Ewerth, and Bernd Freisleben. "Text detection in images based on unsupervised classification of high-frequency wavelet coefficients." Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on. Vol. 1. IEEE, 2004.(8)
  12. Yi, Chucai, and Yingli Tian. "Text detection in natural scene images by stroke gabor words." Document Analysis and Recognition (ICDAR), 2011 International Conference on. IEEE, 2011.(9)
  13. Liu, Chunmei, Chunheng Wang, and Ruwei Dai. "Text detection in images based on unsupervised classification of edge-based features." Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on. IEEE, 2005.
  14. Epshtein, Boris, Eyal Ofek, and Yonatan Wexler. "Detecting text in natural scenes with stroke width transform." 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 2010.
  15. Li, Yao, and Huchuan Lu. "Scene text detection via stroke width." Pattern Recognition (ICPR), 2012 21st International Conference on. IEEE, 2012.
  16. Borgefors, Gunilla. "Distance transformations in digital images." Computer vision, graphics, and image processing 34.3 (1986): 344-371. https://doi.org/10.1016/s0734-189x(86)80047-0
  17. Abhishek, L. K. "Thinning approach in digital image processing." Special Issue-SACAIM (2017): 326-330.
  18. Wolf, Christian, and Jean-Michel Jolion. "Object count/area graphs for the evaluation of object detection and segmentation algorithms." International Journal of Document Analysis and Recognition (IJDAR) 8.4 (2006): 280-296. https://doi.org/10.1007/s10032-006-0014-0
  19. Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, Shiliang Pu, and Shuigeng Zhou. Aon: Towards arbitrarily-oriented text recognition. In CVPR, 2018.
  20. Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, and Xiang Bai. Robust scene text recognition with automatic rectification. In CVPR, pages 4168-4176, 2016.
  21. Mishra, Anand, Karteek Alahari, and C. V. Jawahar. "Top-down and bottom-up cues for scene text recognition." 2012 IEEE conference on computer vision and pattern recognition. IEEE, 2012.
  22. Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. Reading text in the wild with convolutional neural networks. IJCV, 2016.