[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2008.27.6.319

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training

Kang, Sang-Ick (인하대학교 전자공학부)
Chang, Joon-Hyuk (인하대학교 전자공학부)
Lee, Seong-Ro (목포대학교 정보공학부)

Publication Information

The Journal of the Acoustical Society of Korea / v.27, no.6, 2008 , pp. 319-324 More about this Journal

Abstract

In this paper, we propose a novel approach to improve the performance of speech/music classification for the selectable mode vocoder (SMV) of 3GPP2 using the discriminative weight training which is based on the minimum classification error (MCE) algorithm. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then proposed the speech/music decision rule is expressed as the geometric mean of optimally weighted features which are selected from the SMV. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.

Keywords

Speech/music classification; Minimum classification error; Discriminative weight training; Selectable mode vocoder;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	Y. Gao, E. Shlomot, A. Benyassine, J. Thyssen, H.-Y. Su, and C. Murgia, "The SMV algorithm selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 709 -712, May 2001
2	3GPP2 Spec., "Selectable mode vocoder (SMV) service option for wideband spread spectrum communication systems," 3GPP2 -C.S0030-0, v3.0, Jan. 2004
3	P. Vary and R. Martin, Digital Speech Transmission : enhancement, coding and error concealment, pp.182-187, 2006
4	S.-I. Kang, Q.-H. Jo, J.-H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Processing Letters, 15, 170-173, Feb. 2008 DOI ScienceOn
5	P. Kabal, R. Prakash and Ramachandran, "The computation of line spectral frequencies using Chebyshey polynomials," IEEE Trans. Acoustics, speech and signal processing, ASSP -34(6), 1419-1426, Dec. 1986 DOI
6	금지수, 임성길, 이현수, "스펙트럼 분석과 신경망을 이용한 음성/음악 분류", 한국음향학회지, 26(5), 207-213, Jul. 2007 과학기술학회마을
7	J. Saunders, "Real-time discrimination of broadcast speech /music," Proc. IEEE International Conference on Acoustics, Speech, and Processing, 2, 993-996, May 1996
8	C. V. Goudar, P. Rabha, M. Deshpande, and A. Rao, "SMVLite: Reduced Complexity Selectable Mode Vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 701-704, May 2006
9	J. Makinen, P. Ojala, and H. Toukomaa, "Performance comparison of source controlled GSM AMR and SMV vocoders," Proc. International Symposium on Intelligent Signal Processing and Communication Systems, 51-154, Nov. 2004
10	W. Q. Wang, W. Gao, and D. W. Ying, "A fast and robust speech/music discrimination approach," Proc. International Conference on Information, Communications and Signal Processing, 3, 1325-1329, Dec. 2003
11	S. Craig Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 953-956, May 2001
12	B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Processing, 5(3), 257-265, May 1997 DOI ScienceOn
13	3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, v.1.0, Apr. 2005
14	W. M. Fisher, G. R. Doddington and K. M. Goudie-Marshall, "The DARPA speech recognition research database: Specifi-cations and status," Proc. DARPA Workshop Speech Recognition, pp.93-99, Feb. 1986

KSCI

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training 변별적 가중치 학습을 이용한 3GPP2 SVM의 실시간 음성/음악 분류 성능 향상

Enhancement of Speech/Music Classification for 3GPP2 SMV Codec Employing Discriminative Weight Training