Browse > Article

Lip Detection using Color Distribution and Support Vector Machine for Visual Feature Extraction of Bimodal Speech Recognition System  

정지년 (한국과학기술원 전산학과)
양현승 (한국과학기술원 전산학과)
Abstract
Bimodal speech recognition systems have been proposed for enhancing recognition rate of ASR under noisy environments. Visual feature extraction is very important to develop these systems. To extract visual features, it is necessary to detect exact lip position. This paper proposed the method that detects a lip position using color similarity model and SVM. Face/Lip color distribution is teamed and the initial lip position is found by using that. The exact lip position is detected by scanning neighbor area with SVM. By experiments, it is shown that this method detects lip position exactly and fast.
Keywords
Support Vector Machine; bimodal speech recognition; lip detection; lip tracking; lip reading; Support Vector Machine; color distribution;
Citations & Related Records
연도 인용수 순위
  • Reference
1 G. Potamianos, H. P. Graf, and E. Cosatto, An Image Transform Approach for HMM Based Automatic Lipreading, Image Processing, 1998. ICIP 98. Proceedings. 1998 International Conference on, vol.3, Page(s): 173-177, 4-7 Oct 1998   DOI
2 Zhang Jian, Kaynak M.N., Cheok A.D., Ko Chi Chung, Real-time lip tracking for virtual lip implementation in virtual environments and computer games, The 10th IEEE International Conference on Fuzzy Systems, Volume: 3, Page(s): 1359-1362, 2001   DOI
3 Chan M.T., Zhang, Y. and Huang T.S., Real-time lip tracking and bimodal continuous speech recognition, IEEE Second Workshop on Multimedia Signal Processing, Page(s): 65-70, 7-9 Dec 1998   DOI
4 Kaucic R and Blake A., Accurate, real-time, unadorned lip tracking, Sixth International Conference on Computer Vision, Page(s): 370-375, 4-7 Jan 1998   DOI
5 Sadeghi M., Kittler J. and Messer K., 'Modelling and segmentation of lip area in face images,' IEEE Proceedings on Vision, Image and Signal Processing, Volume: 149 Issue: 3, Page(s): 179 -184 , Jun 2002   DOI   ScienceOn
6 Zhilin Wu, Aleksic P.S. and Katsaggelos A.K. Lip tracking for MPEG-4 facial animation, Proceedings of Fourth IEEE International Conference on Multimodal Interfaces, Page(s): 293-298, 2002   DOI
7 Lucey S., Sridharan. S. and Chandran. W., Chromatic lip tracking using a connectivity based fuzzy thresholding technique, ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and Its Applications, Volume: 2, Page(s): 669-672 vol.2, 1999   DOI
8 Delmas P., Eveno. N. and Lievin. M., Towards robust lip tracking, Proceedings of 16th International Conference on Pattern Recognition, Volume: 2, Page(s): 528-531 vol.2, 2002   DOI
9 Robert M. Haralick and Linda G. Shapiro, Computer and Robot Vision, Vol.1 pp. 73-74, Addison-Wesley publishing company., 1992
10 Christopher J. C. Burges, A Tutorial on Support Vector Machines for Pattern Recognition, Data Mining and Knowledge Discovery 2, pp. 121-167, 1998   DOI   ScienceOn