DOI QR코드

DOI QR Code

Lip Region Extraction by Gaussian Classifier

가우스 분류기를 이용한 입술영역 추출

  • Received : 2016.09.25
  • Accepted : 2017.01.09
  • Published : 2017.02.28

Abstract

Lip reading is a field of image processing to assist the process of sound recognition. In some environment, the capture of sound signal usually has significant noise and therefore, the recognition rate of sound signal decreases. Lip reading can be a good feature for the increase of recognition rates. Conventional lip extraction methods have been proposed widely. Maia et. al. proposed a method by the sum of Cr and Cb. However, there are two problems as follows: the point with maximum saturation is not always regarded as lips region and the inner part of lips such as oral cavity and teeth can be classified as lips. To solve these problems, this paper proposes a method which adopts the histogram-based classifier for the extraction of lips region. The proposed method consists of two stages, learning and test. The amount of computation is minimized because this method has no color conversion. The performance of proposed method gives 66.8% of detection rate compared to 28% of conventional ones.

Keywords

References

  1. G. Potamianos, C. Neti, J. Luettin, and I. Matthews, Audio-Visual Automatic Speech Recognition: An Overview, The Massachusetts Institute of Technology Press, Cambridge MA, 2004.
  2. A.L. Yuille, P.W. Hallinan, and D.S. Cohen, "Feature Extraction from Faces Using Deformable Templates," International Journal of Computer Vision, Vol. 8, No. 2, pp. 99-111, 1992. https://doi.org/10.1007/BF00127169
  3. T.F. Cootes, C.J. Taylor, D.H. Cooper, and J. Grahan, "Active Shape Models-Their Training and Application," Computer Vision and Image Understanding, Vol. 61, No. 2, pp. 38- 59, 1995. https://doi.org/10.1006/cviu.1995.1004
  4. C. Bregler and Y. Konig, "Eigenlips for Robust Speech Recognition," Proceeding of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 669-672, 1994.
  5. G.I. Chiou and J.N. Hwang, "Lip Reading from Color Video," IEEE Transactions on Image Processing, Vol. 6, No. 8, pp. 1192-1195, 1997. https://doi.org/10.1109/83.605417
  6. A.W. Liew, S.H. Leung, and W.H. Lau, "Segmentation of Color Lip Images by Spatial Fuzzy Clustering," IEEE Transactions on Fuzzy Systems, Vol. 11, No. 4, pp. 542-549, 2003. https://doi.org/10.1109/TFUZZ.2003.814843
  7. Y. Wu, R. Ma, W. Hu, T. Wang, Y. Zhang, J. Cheng, et al., "Robust Lip Localization on Multi-View Faces in Video," Proceeding of IEEE International Conference on Image Processing, pp. 481-484, 2007.
  8. S. Sridharan S. Lucey, and V. Chandran, "Adaptive Mouth Segmentation Using Chromatic Features," Pattern Recognition Letters, Vol. 23, No. 11, pp. 1293-1302, 2002. https://doi.org/10.1016/S0167-8655(02)00078-8
  9. O. Erol, Y. Berkay, K. Harun, E. Hakan, and U. Mustafa, "Lip Segmentation Using Adaptive Color Space Training," Proceeding of the International Conference on Auditory-Visual Speech Processing, pp. 219-222, 2008.
  10. R. Rohani, F. Sobhanmanesh, S. Alizadeh, and R. Boostani, "Lip Processing and Modeling based on Spatial Fuzzy Clustering in Color Images," International Journal of Fuzzy Systems, Vol. 13, No. 2, pp. 65-73, 2011.
  11. J.Kim, "Extraction of Lip Region using Chromaticity Transformation and Fuzzy Clustering," Journal of Korea Multimedia Society, Vol. 17, No. 7, pp. 806-817, July 2014. https://doi.org/10.9717/kmms.2014.17.7.806
  12. D. Maia and R. Trindade, "Face Detection and Recognition in Color Images under Matlab," International Journal of Signal Processing, Image Processing and Pattern Recognition, Vol. 9, No. 2, pp. 13-24, 2016.
  13. V. Le, J. Brandt, Z. Lin, L. Bourdev, T. Huang, "Interactive Facial Feature Localization," Proceeding of European Conference on Computer Vision, pp. 1-10, 2012.
  14. M. Jones and J. Rehg, "Statistical Color Models with Applications to Skin Detection," International Journal of Computer Vision, Vol. 46, pp. 81-96, 2002. https://doi.org/10.1023/A:1013200319198