Browse > Article
http://dx.doi.org/10.3837/tiis.2012.03.010

A Novel Video Image Text Detection Method  

Zhou, Lin (Zhengzhou Information Science and Technology Institute)
Ping, Xijian (Zhengzhou Information Science and Technology Institute)
Gao, Haolin (Zhengzhou Information Science and Technology Institute)
Xu, Sen (Yancheng Institute of Technology)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.6, no.3, 2012 , pp. 941-953 More about this Journal
Abstract
A novel and universal method of video image text detection is proposed. A coarse-to-fine text detection method is implemented. Firstly, the spectral clustering (SC) method is adopted to coarsely detect text regions based on the stationary wavelet transform (SWT). In order to make full use of the information, multi-parameters kernel function which combining the features similarity information and spatial adjacency information is employed in the SC method. Secondly, 28 dimension classifying features are proposed and support vector machine (SVM) is implemented to classify text regions with non-text regions. Experimental results on video images show the encouraging performance of the proposed algorithm and classifying features.
Keywords
Text detection; SC algorithm; multi-parameters kernel function; SVM;
Citations & Related Records
연도 인용수 순위
1 Q.X. Ye, Q. M. Huang, W. Gao and D. B. Zhao, "Fast and robust text detection in images and video frames," Image and Vision Computing, vol.23, no.6, pp.565-576, Jun.2005.   DOI   ScienceOn
2 K. I. Kim, K. Jung and J. H. Kim, "Texture-based approach for text detection in images using support vector machines and continuous adaptive mean shift algorithm," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.25, no.12, pp.1631-1639, Dec.2003.   DOI   ScienceOn
3 Z. Ji, J. Wang, Y. T. Su, "Text detection in video frames using hybrid features," in Proc. of the 8th International Conference on Machine Learning and Cybernetics, pp.318-342, Jul.2009.
4 Y. Song, A. N. Liu, L Pang, S. X. Lin, Y. D. Zhang and S. Tang, "A novel image text extraction method based on k-means clustering," in Proc. of the 7th IEEE/ACIS International Conference on Computer and Information Science, pp.318-342, Jul.2009.
5 R. Guo, Y. H. Peng and H. L Wan, "Palmprint feature extraction and recognition based on stationary wavelet transform," Computer Engineering and Applications, vol.42, no.17, pp.62-65, Jul.2006.
6 K. Ersahin, I. G. Cumming and R. K. Ward, "Segmentation and classification of polarimetric SAR data using spectral graph partitioning," IEEE Transaction on Geoscience and Remote Sensing, vol.48, no.1, pp.164-174, Jan.2010.   DOI
7 C. Alzate and J. A. K. Suykens, "Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA," IEEE Transaction on Pattern Analysis and Machine Intelligence, vol.32, no.2, pp.335-347, Feb.2010.   DOI
8 T. Sakai, A. Imiya, "Randomized algorithm of spectral clustering and image/video segmentation using a minority of pixels," in Proc. of the 12th IEEE International Conference of Computer Vision Workshops, pp.468-475, Sept.2009.
9 N. Iam-on, T. Boongoen and S. Garrett, "Refining pairwise similarity matrix for cluster ensemble problem with cluster relations." in Proc. of the 7th IEEE International Conference on Discovery Science, vol.5255, pp.222-233, 2008.
10 U. V. Luxburg, "A tutorial on spectral clustering," Journal of Statistics and Computing, vol.17, no.4, pp.395-416, Dec.2007.   DOI   ScienceOn
11 Q. X. Ye and Q. M. Huang, "A new text detection algorithm in images/video frames," Lecture Notes in Computer Science, vol.3332, pp.858-865, 2004.
12 L. C. Jiao, X. R. Zhang, B. Hou, S. Wang and F. Liu, "Intelligent SAR image processing and interpretation," Science Press, 2008.
13 M. Wang, X. S. Hua, R. Hong, J. H. Tang, G. J. Qi and Y. Song, "Unified video annotation via multigraph learning," IEEE Transaction on Circuits and Systems for Video Technology, vol.19, no.5, pp.733-746, May.2009.   DOI
14 M. Wang, X. S. Hua, J. H. Tang and R. Hong, "Beyond distance measurement: constructing neighborhood similarity for video annotation," IEEE Transaction on Multimedia, vol.11, no.3, pp.465-476, Apr.2009.   DOI
15 M. R. Naphade and J. R. Smith, "On the detection of semantic concepts at TRECVID," in Proc. of. ACM Multimedia, pp.660-667, Oct.2004.
16 H. S. Lee, S. J. Hong and E. Kim, "Probabilistic background subtraction in a video-based recognition system," KSII Transaction on Internet and Information Systems, vol.5, no.4, pp.782-804, Apr.2011.
17 N. M. Oliver, B. Rosario and A. Pentland, "A Bayesian computer vision system for modeling human interactions," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.8, pp.831-843, Aug.2000.   DOI   ScienceOn
18 J. C. Nascimento and J. S. Marques, "Performance evaluation of object detection algorithms for video surveillance," IEEE Transactions on Multimedia, vol.8, no.4, pp.761-774, Aug.2006.   DOI
19 S.C. Pei and Y.T. Chuang, "Automatic text detection using multi-layer color quantization in complex color images," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.619-622, Jun.2004.
20 P. Shivakumara, T. Q. Phan and C. L. Tan, "Video text detection based on filters and edge features," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.514-517, Jul.2009.
21 C. M. Liu, C. H. Wang and R. W. Dai, "Text detection in images based on unsupervised classification of edge-based features," in Proc. of the 8th International Conference on Document Analysis and Recognition, pp.610-614, Sep.2005.
22 P. Shivakumara, T. Q. Phan and C. L. Tan, "A robust wavelet transform based technique for video text detection," in Proc. of the 10th International Conference on Document Analysis and Recognition, pp.1285-1289, Jul.26-29, 2009.
23 X. R. Zhang, L. C. Jiao, F. Liu, L. F. Bo and M. G. Gong, "Spectral clustering ensemble applied to SAR image segmentation," IEEE Transaction on Geoscience and Remote Sensing, vol.46, no.7, pp.2126-2136, Jul.2008.   DOI
24 J. B. Shi and J. Malik, "Normalized cuts and image segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.8, pp.888-905, 2000.   DOI   ScienceOn
25 C. Yang, X. R. Zhang and L. C Jiao, "Self-tuning semi-supervised spectral clustering," in Proc. of the IEEE International Conference on Computational Intelligence and Security, pp.1-5, Dec.2008.
26 C. Fowlkes, S. Belongie, F. Chung and L. Malik, "Spectral grouping using the Nyström method," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.26, no.2, pp.214-225, Feb. 2004.   DOI   ScienceOn
27 V. Vladimir, "The nature of statistical learning theory," Springer-Verlag, 1995.
28 X. Q. Liu and J. Samarabandu, "Multiscale edge-based text extraction from complex images," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.1721-7124, Jul.2006.
29 E. K. Wong and M. Y. Chen, "A new robust algorithm for video text extraction", Pattern Recognition, vol.36, pp.1397-1406, 2003.   DOI   ScienceOn
30 T. X. Zhao, G. M. Sun, C. Zhang and D. M. Chen, "Study on video text processing," in Proc. of the IEEE International Symposium on Industrial Electronics, pp.1215-1218, Jun.2008.
31 Z. Wang and Z. Q. Wei, "A comparative study of feature selection for SVM in video text detection," in Proc. of the 2th International Symposium on Intelligence and Design, pp.552-556, Dec.2009.