Enhancement Voiced/Unvoiced Sounds Classification for 3GPP2 SMV Employing GMM

Song, Ji-Hyun;Chang, Joon-Hyuk;

대한전자공학회논문지SP (Journal of the Institute of Electronics Engineers of Korea SP)

제45권5호
/
Pages.111-117
/
2008
/
1229-6384(pISSN)

대한전자공학회 (The Institute of Electronics and Information Engineers)

3GPP2 SMV의 실시간 유/무성음 분류 성능 향상을 위한 Gaussian Mixture Model 기반 연구

Enhancement Voiced/Unvoiced Sounds Classification for 3GPP2 SMV Employing GMM

송지현 (인하대학교 전자공학부) ;
장준혁 (인하대학교 전자공학부)

Song, Ji-Hyun (Department of Electronics Engineering, Inha University) ;
Chang, Joon-Hyuk (Department of Electronics Engineering, Inha University)

발행 : 2008.09.25

PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 패턴 인식에서 우수한 성능을 보이는 가우시안 혼합모델 (Gaussian mixture model, GMM)을 이용하여 비정상적인 잡음환경에서 3GPP2 selectable mode vocoder (SMV)의 유/무성음 분류 알고리즘 성능 향상을 위한 방법을 제안한다. 기존의 SMV에 대해서 분석하고, 이론 기반으로 유/무성음 분류 알고리즘에서 우수한 성능을 보여주는 특징 벡터를 선택하여 GMM의 입력벡터로 효과적으로 이용한다 다양한 잡음환경에서 시스템의 성능을 평가한 결과 GMM을 이용한 제안된 방법이 기존의 SMV의 방법보다 우수한 유/무성음 분류 성능을 보였다.

In this paper, we propose an approach to improve the performance of voiced/unvoiced (V/UV) decision under background noise environments for the selectable mode vocoder (SMV) of 3GPP2. We first present an effective analysis of the features and the classification method adopted in the SMV. And then feature vectors which are applied to the GMM are selected from relevant parameters of the SMV for the efficient voiced/unvoiced classification. For the purpose of evaluating the performance of the proposed algorithm, different experiments were carried out under various noise environments and yields better results compared with the conventional scheme of the SMV.

키워드

참고문헌

3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, vol. 1.0, Apr. 2005
Y. Gao, E. Shlomot, A. Benyassine, J. hyssen, Huan-yu Su, and C. Murgia, "The SMV Algorithm Selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 709-712, May 2001
J. -H. Chang, N. S. Kim, and S. K. Mitra, "A statistical model-based V/UV decision under background noise environments," IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp.2885-2887, Dec. 2004
S. Ahmadi and A. S. Spanias, "Cepstrum-based pitch detection using a new statistical V/UV classification algorithm," IEEE Trans. Speech Audio Process., vol. 7, no. 3, May 1999
B. Atal and L. R. Rabiner, "A pattern recognition approach to voiced-unvoiced-silence classification with application to speech recognition," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-24, pp. 201-212, Jun. 1976
L. Siegel, "A procedure for using pattern classification techniques to obtain a voiced/unvoiced/ classifier," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP_27, pp. 83-88, Jun. 1976
L. R. Rabiner and M. R. Sambur, "Application of an LPC Distance Measure to the Voiced- Unvoiced -Silence Detection Problem," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-25, no. 4, pp. 339-343, Aug. 1977
S. C. Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 953-956, May 2001
C. V. Goudar, P. Rabha, M. Deshpande and A. Rao, "SMVLite: Reduced Complexity Selectable Mode Vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 701-704, May 2006
3GPP2 Spec., "Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems," 3GPP2- C.S0030-0, v3.0, Jan. 2004
P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, pp.182-187, 2006
G. Xuan, W. Zhang and P. Chai, "EM algorithm of gaussian mixture model and hidden Markov model," Proc. IEEE International Conference on Image Processing, vol. 1, pp. 145-148, Oct. 2001
D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, Jan. 2000 https://doi.org/10.1006/dspr.1999.0361

대한전자공학회논문지SP (Journal of the Institute of Electronics Engineers of Korea SP)

3GPP2 SMV의 실시간 유/무성음 분류 성능 향상을 위한 Gaussian Mixture Model 기반 연구

Enhancement Voiced/Unvoiced Sounds Classification for 3GPP2 SMV Employing GMM

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)