Browse > Article
http://dx.doi.org/10.3837/tiis.2012.04.011

A Novel Video Image Text Detection Method  

Zhou, Lin (Zhengzhou Information Science and Technology Institute)
Ping, Xijian (Zhengzhou Information Science and Technology Institute)
Gao, Haolin (Zhengzhou Information Science and Technology Institute)
Xu, Sen (Yancheng Institute of Technology)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.6, no.4, 2012 , pp. 1140-1152 More about this Journal
Abstract
A novel and universal method of video image text detection is proposed. A coarse-to-fine text detection method is implemented. Firstly, the spectral clustering (SC) method is adopted to coarsely detect text regions based on the stationary wavelet transform (SWT). In order to make full use of the information, multi-parameters kernel function which combining the features similarity information and spatial adjacency information is employed in the SC method. Secondly, 28 dimension classifying features are proposed and support vector machine (SVM) is implemented to classify text regions with non-text regions. Experimental results on video images show the encouraging performance of the proposed algorithm and classifying features.
Keywords
Text detection; SC algorithm; multi-parameters kernel function; SVM;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
Times Cited By Web Of Science : 0  (Related Records In Web of Science)
연도 인용수 순위
1 N. Iam-on, T. Boongoen and S. Garrett, "Refining pairwise similarity matrix for cluster ensemble problem with cluster relations", in Proc. of the 7th IEEE International Conference on Discovery Science, vol.5255, pp.222-233, 2008.
2 U. V. Luxburg, "A tutorial on spectral clustering," Journal of Statistics and Computing, vol.17, no.4, pp.395-416, Dec.2007.   DOI   ScienceOn
3 Q. X. Ye and Q. M. Huang, "A new text detection algorithm in images/video frames," Lecture Notes in Computer Science, vol.3332, pp.858-865, 2004.
4 L. C. Jiao, X. R. Zhang, B. Hou, S. Wang and F. Liu, "Intelligent SAR image processing and interpretation," Science Press, 2008.
5 J. B. Shi and J. Malik, "Normalized cuts and image segmentation," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.8, pp.888-905, 2000.   DOI   ScienceOn
6 C. Yang, X. R. Zhang and L. C Jiao, "Self-tuning semi-supervised spectral clustering," in Proc. of the IEEE International Conference on Computation-al Intelligence and Security, pp.1-5, Dec.2008.
7 C. Fowlkes, S. Belongie, F. Chung, L. Malik, "Spectral grouping using the Nystrom method," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 2, pp. 214-225, February, 2004.   DOI   ScienceOn
8 X. R. Zhang, L. C. Jiao, F. Liu, L. F. Bo and M. G. Gong, "Spectral clustering ensemble applied to SAR image segmentation," IEEE Transaction on Geoscience and Remote Sensing, vol.46, no.7, pp. pp.2126-2136, Jul.2008.
9 V. Vladimir, "The nature of statistical learning theory," Springer-Verlag, 1995.
10 X. Q. Liu and J. Samarabandu, "Multiscale edge-based text extraction from complex images," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.1721-7124, Jul.2006.
11 E. K. Wong and M. Y. Chen, "A new robust algorithm for video text extraction", Pattern Recognition, vol.36, pp.1397-1406, 2003.   DOI   ScienceOn
12 T. X. Zhao, G. M. Sun, C. Zhang and D. M. Chen, "Study on video text processing," in Proc. of the IEEE International Symposium on Industrial Electronics, pp.1215-1218, Jun.2008.
13 Z. Wang and Z. Q. Wei, "A comparative study of feature selection for SVM in video text detection," in Proc. of the 2th International Symposium on Intelligence and Design, pp.552-556, Dec.2009.
14 C. M. Liu, C. H. Wang and R. W. Dai, "Text detection in images based on unsupervised classification of edge-based features," in Proc. of the 8th International Conference on Document Analysis and Recognition, pp.610-614, Aug.2005.
15 P. Shivakumara, T. Q. Phan and C. L. Tan, "A robust wavelet transform based technique for video text detection," in Proc. of the 10th International Conference on Document Analysis and Recognition, pp.1285-1289, Jul. 2009.
16 K. I. Kim, K. Jung and J. H. Kim, "Texture-based approach for text detection in images using support vector machines and continuous adaptive mean shift algorithm," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.25, no.12, pp.1631-1639, Dec.2003.   DOI   ScienceOn
17 Z. Ji, J. Wang and Y. T. Su, "Text detection in video frames using hybrid features," in Proc. of the 8th International Conference on Machine Learning and Cybernetics, pp.318-342, Jul.2009.
18 Y. Song, A. N. Liu, L Pang, S. X. Lin, Y. D. Zhang and S. Tang, "A novel image text extraction method based on k-means clustering," in Proc. of the 7th IEEE/ACIS International Conference on Computer and Information Science, pp.318-342, Jul.2009.
19 R. Guo, Y. H. Peng and H. L Wan, "Palmprint feature extraction and recognition based on stationary wavelet transform," Computer Engineering and Applications, vol.42, no.17, pp.62-65, Jul.2006.
20 K. Ersahin, I. G. Cumming and R. K. Ward, "Segmentation and classification of polarimetric SAR data using spectral graph partitioning," IEEE Transaction on Geoscience and Remote Sensing, vol.48, no.1, pp.164-174, Jan.2010.   DOI
21 C. Alzate and J. A. K. Suykens, "Multiway spectral clustering with out-of-sample extensions through weighted kernel PCA," IEEE Transaction on Pattern Analysis and Machine Intelligence, vol.32, no.2, pp.335-347, Feb.2010.   DOI
22 T. Sakai and A. Imiya, "Randomized algorithm of spectral clustering and image/video segmentation using a minority of pixels", in Proc. of the 12th IEEE International Conference of Computer Vision Workshops, pp.468-475, Sep.2009.
23 Q.X. Ye, Q. M. Huang, W. Gao, D. B. Zhao, "Fast and robust text detection in images and video frames," Image and Vision Computing, vol.23, no.6, pp.565-576, Jun.2005.   DOI   ScienceOn
24 M. Wang, X. S. Hua, R. Hong, J. H. Tang, G. J. Qi and Y. Song, "Unified video annotation via multigraph learning," IEEE Transaction on Circuits and Systems for Video Technology, vol.19, no.5, pp.733-746, May.2009.   DOI
25 M. Wang, X. S. Hua, J. H. Tang and R. Hong, "Beyond distance measurement: constructing neighborhood similarity for video annotation," IEEE Transaction on Multimedia, vol.11, no.3, pp.465-476, Apr. 2009.   DOI
26 M. R. Naphade and J. R. Smith, "On the detection of semantic concepts at TRECVID," in Proc. of. the ACM Multimedia, pp.660-667, Oct.2004.
27 H. S. Lee, S. J. Hong and E. Kim, "Probabilistic background subtraction in a video-based recognition system," KSII Transaction on Internet and Information Systems, vol.5, no.4, pp.782-804, Apr. 2011.
28 N. M. Oliver, B. Rosario and A. Pentland, "A Bayesian computer vision system for modeling human interactions," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, no.8, pp.831-843, Aug. 2000.   DOI   ScienceOn
29 J. C. Nascimento and J. S. Marques, "Performance evaluation of object detection algorithms for video surveillance," IEEE Transactions on Multimedia, vol.8, no.4, pp.761-774, Aug.2006.   DOI
30 S.C. Pei and Y.T. Chuang, "Automatic text detection using multi-layer color quantization in complex color images," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.619-622, Jun.2004.
31 P. Shivakumara, T. Q. Phan and C. L. Tan, "Video text detection based on filters and edge features," in Proc. of the IEEE International Conference on Multimedia and Expo, pp.514-517, Jun.2009.