Browse > Article
http://dx.doi.org/10.7776/ASK.2008.27.6.319

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training  

Kang, Sang-Ick (인하대학교 전자공학부)
Chang, Joon-Hyuk (인하대학교 전자공학부)
Lee, Seong-Ro (목포대학교 정보공학부)
Abstract
In this paper, we propose a novel approach to improve the performance of speech/music classification for the selectable mode vocoder (SMV) of 3GPP2 using the discriminative weight training which is based on the minimum classification error (MCE) algorithm. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then proposed the speech/music decision rule is expressed as the geometric mean of optimally weighted features which are selected from the SMV. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.
Keywords
Speech/music classification; Minimum classification error; Discriminative weight training; Selectable mode vocoder;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Gao, E. Shlomot, A. Benyassine, J. Thyssen, H.-Y. Su, and C. Murgia, "The SMV algorithm selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 709 -712, May 2001
2 3GPP2 Spec., "Selectable mode vocoder (SMV) service option for wideband spread spectrum communication systems," 3GPP2 -C.S0030-0, v3.0, Jan. 2004
3 P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, pp.182-187, 2006
4 S.-I. Kang, Q.-H. Jo, J.-H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Processing Letters, 15, 170-173, Feb. 2008   DOI   ScienceOn
5 P. Kabal, R. Prakash and Ramachandran, "The computation of line spectral frequencies using Chebyshey polynomials," IEEE Trans. Acoustics, speech and signal processing, ASSP -34(6), 1419-1426, Dec. 1986   DOI
6 금지수, 임성길, 이현수, "스펙트럼 분석과 신경망을 이용한 음성/음악 분류", 한국음향학회지, 26(5), 207-213, Jul. 2007   과학기술학회마을
7 J. Saunders, "Real-time discrimination of broadcast speech /music," Proc. IEEE International Conference on Acoustics, Speech, and Processing, 2, 993-996, May 1996
8 C. V. Goudar, P. Rabha, M. Deshpande, and A. Rao, "SMVLite: Reduced Complexity Selectable Mode Vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 701-704, May 2006
9 J. Makinen, P. Ojala, and H. Toukomaa, "Performance comparison of source controlled GSM AMR and SMV vocoders," Proc. International Symposium on Intelligent Signal Processing and Communication Systems, 51-154, Nov. 2004
10 W. Q. Wang, W. Gao, and D. W. Ying, "A fast and robust speech/music discrimination approach," Proc. International Conference on Information, Communications and Signal Processing, 3, 1325-1329, Dec. 2003
11 S. Craig Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 953-956, May 2001
12 B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Processing, 5(3), 257-265, May 1997   DOI   ScienceOn
13 3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, v.1.0, Apr. 2005
14 W. M. Fisher, G. R. Doddington and K. M. Goudie-Marshall, "The DARPA speech recognition research database: Specifi-cations and status," Proc. DARPA Workshop Speech Recognition, pp.93-99, Feb. 1986