DOI QR코드

DOI QR Code

Text Location and Extraction for Business Cards Using Stroke Width Estimation

  • Zhang, Cheng Dong (Department of Electronics and Computer Engineering Chonnam National University) ;
  • Lee, Guee-Sang (Department of Electronics and Computer Engineering Chonnam National University)
  • 투고 : 2011.12.26
  • 심사 : 2012.02.24
  • 발행 : 2012.03.28

초록

Text extraction and binarization are the important pre-processing steps for text recognition. The performance of text binarization strongly related to the accuracy of recognition stage. In our proposed method, the first stage based on line detection and shape feature analysis applied to locate the position of a business card and detect the shape from the complex environment. In the second stage, several local regions contained the possible text components are separated based on the projection histogram. In each local region, the pixels grouped into several connected components based on the connected component labeling and projection histogram. Then, classify each connect component into text region and reject the non-text region based on the feature information analysis such as size of connected component and stroke width estimation.

키워드

참고문헌

  1. A.F. Mollah, S. Basu, D.K. Basu and M. Nasipuri, "Segmentation of Camera Captured Business Card Images for Mobile Device," International Journal of Computer Scinece and Applications, vol. 1, no. 1, pp.33-37, 2009.
  2. L. Lin, C.L. Tan, "Text extraction from name cards using neural network," Proceeding.IJCNN'05, vol. 3, pp.1818-1823, 2005.
  3. K.T. Shine, I.H. Jang and N.C. Kim, "Block adaptive binarization of ill-conditioned business card images acquired in a PDA using a modified quadratic filter," IET Image Processing, vol. 1, no. 1, pp.56-66, 2007. https://doi.org/10.1049/iet-ipr:20060076
  4. V.R. Chandrasekhar, D.M. Chen, S.S. Tsai,N.M. Cheung, H.Z. Chen, G. Takacs, Y. Reznik, R. Vedantham, R. Grzeszczuk, J. Bach and B. Girod. "The Stanford mobile visual search data set," Proceeding MMSys '11, 2011.
  5. R. Duda and P. Hart, "Use of the Hough transform to detect lines and curves in pictures," Communications of the ACM, vol. 15, No. 1, pp. 11-15, 1972. https://doi.org/10.1145/361237.361242
  6. I.H. Jang, C.H. KIM, and N.C. Kim, "Region analysis of Business Card Images in PDA using DCT and Information Pixel Density," ACIVS '05, pp.243-251, 2005.
  7. C. Liu, D. Miao and C. Wang, "Card Images Binarization Based on Dual-Thresholding Identification," ICIC'08, LNAI 5227, pp.1158-1165, 2008.
  8. X.P. Luo, L.X. Zhen, G. Peng, J. Li and B.H. Xiao, "Camera based mixed-lingual card reader for mobile device," ICDAR'05, pp.665-669, 2005.
  9. N. Nikolaou, E. Badekas, N. Papamarkos, and C. Strouthopoulos, "Text localization in color documents," ICCVTA'06, 2006, pp. 181-188.
  10. E. Badekas, N. Nikolaou, N. Papamarkos, "Text Binarization in Color Documents," International Journal of Imaging Systems and Technology, vol. 16, no. 6, 2006, pp.262-274. https://doi.org/10.1002/ima.20092
  11. N. Otsu, "A threshold selection method from gray-level histograms," IEEE Trans. SMC'79. vol. 9, 1979, pp. 62-66.
  12. Z.C. Li, Y.Y. Tang, T.D. Bui, and C.Y. Suen, "Shape Transformation Models And Their Applications In Pattern Recognition," Znt, J. Pattern Reconition and Artificial Intelligence, vol. 4, no. 1, 1990, pp.65-94. https://doi.org/10.1142/S021800149000006X
  13. K. Jung, I.K. Kim, and K. Jain, A. "Text information extraction in images and video: a survey', Pattern Recognition, 2004, pp. 977-997.
  14. J.P. He, "Triangle detection based on windowed Hough transform, Wavelet Analysis and Pattern Recognition, 2009, pp.95 - 100.
  15. L. Lin, "Slant Correction of Vehicle License Plate Image," ICCSS'08, vol. 3617, 2008, pp. 237-244.
  16. Y. Xiangyun, M. Cheriet, and C.Y. Suen, "Stroke-model-based character extraction from gray-level document images," IEEE Transactions on Image Processing, vol. 10, no. 8, 2001, pp. 1152-1161. https://doi.org/10.1109/83.935031
  17. B. Epshtein, E. Ofek and Y. Wexler, "Detecting text in natural scenes with stroke width transform," CVPR'10, 2010, pp. 2963-2970.
  18. P. Palmer, J. Kittler, and M. Petrou, "Using focus of attention with the Hough transform for accurate line parameter estimation," Pattern Recognition, vol. 27, no. 9, 1994, pp.1127-1134. https://doi.org/10.1016/0031-3203(94)90001-9
  19. L. Xu, E. Oja and P. Kultanen, "A new curve detection method: Randomized hough transform (RHT)," Pattern Recognition Letters, vol. 11, no. 5, 1990, pp.331-338. https://doi.org/10.1016/0167-8655(90)90042-Z
  20. Q. Shan, J.Y Jia and A. Agarwala, "High-Quality Motion Deblurring From a Single Image," ACM Transactions on Graphics, vol. 27, no. 3, 2008.
  21. J.Y Jia, "Single Image Motion Deblurring Using Transparency," CVPR'07, 2007.
  22. A. Levin D. Lischinski and Y. Weiss. "A Closed Form Solution to Natural Image Matting," PAMI'08, 2008.
  23. X.P Luo, J. Li and L.X Zhen, "Design and implementation of a card reader on bulid-in camera," Proceedings of ICPR'04, 2004, pp.417-420.
  24. M. Koga, R. Mine, T. Kameyama, T. Takahashi, M. Yamazaki and T.Yamaguchi, "Camera-based Kanji OCR for Mobile-phones: Practical Issues," Proceedings of ICDAR'05, 2005, pp. 635-639.
  25. W. Pan, J.M Jin, G.S Shi, Q. R. Wang, "A System for Automatic Chinese Business Card Recognition," Proceedings of ICDAR'01, 2001, pp. 577-581.
  26. K. S. Bae, K. K. Kim, Y. G. Chung and W. P. Yu, "Character Recognition System for Cellular Phone with Camera," Proceedings of AICSAC'05, vol. 1, 2005, pp. 539-544.
  27. G. Hua, Z.C Liu, Z.Y Zhang, Y. Wu, "Automatic Business Card Scanning with a Camera," Proceedings of ICIP'06, 2006, pp.373-376.