Text Region Verification in Natural Scene Images using Multi-resolution Wavelet Transform and Support Vector Machine

Bae Kyungsook;Choi Youngwoo;

doi:10.3745/KIPSTB.2004.11B.6.667

The KIPS Transactions:PartB (정보처리학회논문지B)

Volume 11B Issue 6
/
Pages.667-674
/
2004
/
1598-284X(pISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Text Region Verification in Natural Scene Images using Multi-resolution Wavelet Transform and Support Vector Machine

다해상도 웨이블릿 변환과 써포트 벡터 머신을 이용한 자연영상에서의 문자 영역 검증

배경숙 (한국전자통신연구원 지능형로봇연구단) ;
최영우 (숙명여자대학교 정보과학부)

Published : 2004.10.01

https://doi.org/10.3745/KIPSTB.2004.11B.6.667 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Extraction of texts from images is a fundamental and important problem to understand the images. This paper suggests a text region verification method by statistical means of stroke features of the characters. The method extracts 36 dimensional features from $16\times16$sized text and non-text images using wavelet transform - these 36 dimensional features express stroke and direction of characters - and select 12 sub-features out of 36 dimensional features which yield adequate separation between classes. After selecting the features, SVM trains the selected features. For the verification of the text region, each $16\times16$image block is scanned and classified as text or non-text. Then, the text region is finally decided as text region or non-text region. The proposed method is able to verify text regions which can hardly be distin guished.

이미지에서 문자 추출은 영상을 이해하기 위한 가장 기초적이고 중요한 문제이다. 본 논문에서는 문자의 획 특징을 이용하는 통계적인 방법으로 문자 영역을 검증하는 방법을 제안한다. 제안하는 방법은 $16\times16$ 크기의 텍스트와 비텍스트 이미지를 웨이블릿(wavelet) 변환하여 문자의 획과 방향성을 표현하는 35차원의 특징을 추출한다. 추출된 특징 중 변별력이 높은 특징만을 선택하여 SVM(Support Vector Machine) 분류기를 구성한다. 분류기론 이용하여 $16\times16$크기의 윈도우로 검증 영역을 스캔하면서, 각각의 윈도우를 텍스트와 비텍스트로 분류하고 최종적으로 검증 영역의 텍스트 여부를 결정한다. 제안한 방법을 적용함으로써 텍스트와 유사하여 구별하기 어려운 비텍스트 영역을 검증할 수 있었다.

Keywords

References

Anil K. Jain, Bin Yu, 'Automatic Text Location in Images and Video Frames,' Pattern Recognition, Vol.31, No.12, pp. 2055-2076, 1998 https://doi.org/10.1016/S0031-3203(98)00067-3
Yu Zhong, Kalle Karu, Anil K. Jain, 'Locating Text in Complex Images,' Pattern Recognition, Vol.28, No.10, pp. 1523-1535, 1995 https://doi.org/10.1016/0031-3203(95)00030-4
H. K. Kim, 'Efficient Automatic Text Location Method and Content-based Indexing and Structuring of Video Database,' Journal of Visual Communications and Image Representation, Vol.7, pp.336-344, 1996 https://doi.org/10.1006/jvci.1996.0029
H. Li, D. Doerman and O. Kia, 'Automatic Text Detection and Tracking in Digital Video,' IEEE Transactions on Image Processing, Vol.9, No.1, pp.147-156, 2000 https://doi.org/10.1109/83.817607
K. I. Kim, 'Texture-Based Approach For Text Detection In Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm,' IEEE Trans. Pattern Analysis and Machine Intelligence(TPAMI), to be published https://doi.org/10.1109/TPAMI.2003.1251157
K. Y. Jeong, K. Jung, E. Y. Kim and H. J. Kim, 'Neural Network-based Text Location for News Video Indexing,'Proceedings of International Conference of Image Processing, 1999 https://doi.org/10.1109/ICIP.1999.817127
Scott E Umbaugh, Computer Vision and Image Processing, Prentice Hall PTR, New Jersey, 1999
V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, 1995
C. Cortes, V. Vapnik, 'Support Vector Networks,' In Proceedings of Machine Learning, Vol.20, pp. 273-297, 1995 https://doi.org/10.1023/A:1022627411411
E. Gose, R. Johnsonbaugh, S. Jost, Pattern Recognition and Image Analysis, Prentice Hall PTR, 1996
김길천, 최우영, 변혜란, '명도 정보를 이용한 장면 텍스트 추출', 한국정보과학회 컴퓨터비젼 및 패턴인식연구회 추계워크샵 발표논문집, 서울, pp. 159-160, 2001
Yu Zhong, Kalle Karu, Anil K. Jain, 'Locating Text in Complex Images,' Pattern Recognition, Vol.28, No.10, pp. 1523-1535, 1995 https://doi.org/10.1016/0031-3203(95)00030-4
S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong and R. Young, 'ICDAR 2003 Robust Reading Competition,' ICDAR 2003, Vol.2, pp.682-687, 2003
David Doermann, Jian Liang and Huiping Li, 'Progress in Camera-Based Document Image Analysis,' ICDAR 2003, Vol.1, pp.606-616, 2003 https://doi.org/10.1109/ICDAR.2003.1227735

The KIPS Transactions:PartB (정보처리학회논문지B)

Text Region Verification in Natural Scene Images using Multi-resolution Wavelet Transform and Support Vector Machine

다해상도 웨이블릿 변환과 써포트 벡터 머신을 이용한 자연영상에서의 문자 영역 검증

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)