3GPP2 SMV의 실시간 유/무성음 분류 성능 향상을 위한 Gaussian Mixture Model 기반 연구

Enhancement Voiced/Unvoiced Sounds Classification for 3GPP2 SMV Employing GMM

  • 송지현 (인하대학교 전자공학부) ;
  • 장준혁 (인하대학교 전자공학부)
  • Song, Ji-Hyun (Department of Electronics Engineering, Inha University) ;
  • Chang, Joon-Hyuk (Department of Electronics Engineering, Inha University)
  • 발행 : 2008.09.25

초록

본 논문에서는 패턴 인식에서 우수한 성능을 보이는 가우시안 혼합모델 (Gaussian mixture model, GMM)을 이용하여 비정상적인 잡음환경에서 3GPP2 selectable mode vocoder (SMV)의 유/무성음 분류 알고리즘 성능 향상을 위한 방법을 제안한다. 기존의 SMV에 대해서 분석하고, 이론 기반으로 유/무성음 분류 알고리즘에서 우수한 성능을 보여주는 특징 벡터를 선택하여 GMM의 입력벡터로 효과적으로 이용한다 다양한 잡음환경에서 시스템의 성능을 평가한 결과 GMM을 이용한 제안된 방법이 기존의 SMV의 방법보다 우수한 유/무성음 분류 성능을 보였다.

In this paper, we propose an approach to improve the performance of voiced/unvoiced (V/UV) decision under background noise environments for the selectable mode vocoder (SMV) of 3GPP2. We first present an effective analysis of the features and the classification method adopted in the SMV. And then feature vectors which are applied to the GMM are selected from relevant parameters of the SMV for the efficient voiced/unvoiced classification. For the purpose of evaluating the performance of the proposed algorithm, different experiments were carried out under various noise environments and yields better results compared with the conventional scheme of the SMV.

키워드

참고문헌

  1. 3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, vol. 1.0, Apr. 2005
  2. Y. Gao, E. Shlomot, A. Benyassine, J. hyssen, Huan-yu Su, and C. Murgia, "The SMV Algorithm Selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 709-712, May 2001
  3. J. -H. Chang, N. S. Kim, and S. K. Mitra, "A statistical model-based V/UV decision under background noise environments," IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp.2885-2887, Dec. 2004
  4. S. Ahmadi and A. S. Spanias, "Cepstrum-based pitch detection using a new statistical V/UV classification algorithm," IEEE Trans. Speech Audio Process., vol. 7, no. 3, May 1999
  5. B. Atal and L. R. Rabiner, "A pattern recognition approach to voiced-unvoiced-silence classification with application to speech recognition," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-24, pp. 201-212, Jun. 1976
  6. L. Siegel, "A procedure for using pattern classification techniques to obtain a voiced/unvoiced/ classifier," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP_27, pp. 83-88, Jun. 1976
  7. L. R. Rabiner and M. R. Sambur, "Application of an LPC Distance Measure to the Voiced- Unvoiced -Silence Detection Problem," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-25, no. 4, pp. 339-343, Aug. 1977
  8. S. C. Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 953-956, May 2001
  9. C. V. Goudar, P. Rabha, M. Deshpande and A. Rao, "SMVLite: Reduced Complexity Selectable Mode Vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 701-704, May 2006
  10. 3GPP2 Spec., "Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems," 3GPP2- C.S0030-0, v3.0, Jan. 2004
  11. P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, pp.182-187, 2006
  12. G. Xuan, W. Zhang and P. Chai, "EM algorithm of gaussian mixture model and hidden Markov model," Proc. IEEE International Conference on Image Processing, vol. 1, pp. 145-148, Oct. 2001
  13. D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, Jan. 2000 https://doi.org/10.1006/dspr.1999.0361