Browse > Article
http://dx.doi.org/10.7776/ASK.2007.26.8.390

Analysis and Implementation of Speech/Music Classification for 3GPP2 SMV Based on GMM  

Song, Ji-Hyun (인하대학교 전자전기공학부)
Lee, Kye-Hwan (인하대학교 전자전기공학부)
Chang, Joon-Hyuk (인하대학교 전자전기공학부)
Abstract
In this letter, we propose a novel approach to improve the performance of speech/music classification for the selectable mode vocoder(SMV) of 3GPP2 using the Gaussian mixture model(GMM) which is based on the expectation-maximization(EM) algorithm. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then feature vectors which are applied to the GMM are selected from relevant Parameters of the SMV for the efficient speech/music classification. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.
Keywords
Speech/Music classification algorithm; Selectable mode vocoder(SMV); Gaussian mixture model(GMM);
Citations & Related Records
연도 인용수 순위
  • Reference
1 3GPP2 Spec., 'Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems,' 3GPP2-C.S0052-A, v.1.0, Apr. 2005
2 J. Saunders, 'Real-time discrimination of broadcast speech/music,' Proc. IEEE International Conference on Acoustics, Speech, and Processing, 2, 993-996, May 1996
3 D. A. Reynolds, T. F. Ouatieri. and R. B. Dunn, 'Speaker verification using adapted Gaussian mixture models,' Digital Signal Processing, 10, 19-41, Jan. 2000   DOI   ScienceOn
4 C. V. Goudar, P. Rabha, M. Deshpande, and A. Rao, 'SMVLite: Reduced Complexity Selectable Mode Vocoder,' Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 701-704, May 2006
5 S. Craig Greer, and A. Dejaco, 'Standardization of the selectable mode vocoder,' Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 953-956, May 2001
6 W. M. Fisher, G. R. Doddington and K. M. Goudie-Marshall, 'The DARPA speech recognition research database: Specifications and status,' Proc. DARPA Workshop Speech Recognition, 93-99, Feb. 1986
7 P. Kabal, R. Prakash and Ramachandran, 'The computation of line spectral frequencies using Chebyshey polynomials,' IEEE Trans. Acoustics, speech and signal processing, ASSP-34, (6) 1419-1426, Dec. 1986
8 3GPP2 Spec., 'Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems,' 3GPP2-C.S0030-0, v3.0, Jan. 2004
9 Y. D. Cho, and A. Kondoz, 'Analysis and improvement of a statistical model-based voice activity detector,' IEEE Signal Process. Lett., 8, 276-278, Oct. 2001   DOI   ScienceOn
10 W. Q. Wang, W. Gao, and D. W. Ying, 'A fast and robust speech/music Discrimination Approach,' Proc. International Conference on Information, Communications and Signal Processing, 3, 1325-1329, Dec. 2003
11 Y. Gao. E. Shlomot, A. Benyassine, J. Thyssen, Huan-yu Su, and C. Murgia, 'The SMV Algorithm Selected by TIA and 3GPP2 for COMA Applications,' Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 709-712, May 2001
12 D. A. Reynolds, and R. C. Rose, 'Robust text- independent speaker identification using Gaussian mixture models,' IEEE Transactions on Speech and Audio Processing, 3, 72-83, Jan. 1995   DOI   ScienceOn
13 P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, (182-187, 2006)
14 A. R. Abu-El-Quran and R. A. Goubran, 'Pitch-based feature extraction for audio classification,' Proc. IEEE International Workshop on Haptic, Audio and Visual Environments and Their Applications, 43-47, Sep. 2003
15 J. Makinen, P. Ojala, and H. Toukomaa, 'Performance comparison of source controlled GSM AMR and SMV vocoders,' Proc. International Symposium on Intelligent Signal Processing and Communication Systems, 51-154, Nov. 2004