Browse > Article

Analysis and Implementation of Speech/Music Classification for 3GPP2 SMV Codec Based on Support Vector Machine  

Kim, Sang-Kyun (Department of Electronics Engineering, Inha University)
Chang, Joon-Hyuk (Department of Electronics Engineering, Inha University)
Publication Information
Abstract
In this paper, we propose a novel a roach to improve the performance of speech/music classification for the selectable mode vocoder (SMV) of 3GPP2 using the support vector machine (SVM). The SVM makes it possible to build on an optimal hyperplane that is separated without the error where the distance between the closest vectors and the hyperplane is maximal. We first present an effective analysis of the features and the classification method adopted in the conventional SMV. And then feature vectors which are a lied to the SVM are selected from relevant parameters of the SMV for the efficient speech/music classification. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional scheme of the SMV.
Keywords
Support Vector Machine(SVM); Selectable Mode Vocoder(SMV); Speech/Music Classification Algorithm;
Citations & Related Records
연도 인용수 순위
  • Reference
1 N. Cristianini and J. Shawe-Taylor, An introduction to support vector machines and other kernel-based learning methods. Cambridge Univ. Press, 2000
2 3GPP2 Spec., "Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems," 3GPP2-C.S0030-0, v3.0, Jan. 2004
3 Y. Gao, E. Shlomot, A. Benyassine, J. Thyssen, Huan-yu Su, and C. Murgia, "The SMV Algorithm Selected by TIA and 3GPP2 for CDMA Applications," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 709-712, May 2001
4 S. Craig Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 953-956, May 2001
5 3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, v.1.0, Apr. 2005
6 X. Wang, J. Chen, P. Wang, Z. Huang, "Infrared Human Face Auto Locating Based on SVM and A Smart Thermal Biometrics System," Proc. Sixth International Conference on Intelligent Systems Design and Applications (ISDA'06), vol. 2, pp. 1066-1072, Oct. 2006
7 W. Q. Wang, W. Gao, and D. W. Ying, "A fast and robust speech/music Discrimination Approach," Proc. International Conference on Information, Communications and Signal Processing, vol. 3, pp. 1325-1329, Dec. 2003
8 J. Saunders, "Real-time discrimination of broadcast speech/music," Proc. IEEE International Conference on Acoustics, Speech, and Processing, vol. 2, pp. 93-996, May 1996
9 A. Ganapathiraju, J. E. Hamaker, J. Picone, "Applications of support vector machines to speech recognition," IEEE Trans. Signal Processing, vol. 52, pp. 2348-2355, Aug. 2004   DOI   ScienceOn
10 J. H. Song, K. H. Lee, J.-H. Chang, J. K. Kim, and N. S. Kim, "Analysis and Improvement of Speech/Music Classication for 3GPP2 SMV based on GMM," Proc. IEEE Signal Processing Letters, vol. 15, pp. 103-106, Jan. 2008   DOI   ScienceOn
11 V. N. Vapnik, "An overview of statistical learning theory," IEEE Trans. Neural Networks, vol. 10, no. 5, pp. 988 -999, Sep. 1999   DOI   ScienceOn