Browse > Article

Improving SVM with Second-Order Conditional MAP for Speech/Music Classification  

Lim, Chung-Soo (Mokpo National University)
Chang, Joon-Hyuk (Dep. of Electronic Engineering, Hanyang University)
Publication Information
Abstract
Support vector machines are well known for their outstanding performance in pattern recognition fields. One example of their applications is music/speech classification for a standardized codec such as 3GPP2 selectable mode vocoder. In this paper, we propose a novel scheme that improves the speech/music classification of support vector machines based on the second-order conditional maximum a priori. While conventional support vector machine optimization techniques apply during training phase, the proposed technique can be adopted in classification phase. In this regard, the proposed approach can be developed and employed in parallel with conventional optimizations, resulting in synergistic boost in classification performance. According to experimental results, the proposed algorithm shows its compatibility and potential for improving the performance of support vector machines.
Keywords
Second-order Conditional Maximum a posteriori (Second-order CMAP); Support Vector Machine (SVM); Selectable Mode Vocoder (SMV); Speech/Music Classification Algorithm;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 J. W. Shin, H. J. Kwon, S. H. Jin, and N. S. Kim, "Voice activity detection based on conditional map criterion," IEEE Signal Processing Letters, vol. 15, no. 2, pp. 257-260, February. 2008.   DOI
2 W. M. Fisher, G. R. Doddington and K. M. Goudie-Marshall, "The DARPA speech recognition research database: Specifications and status," in Proc. DARPA Workshop Speech Recognition, pp. 93-99, February 1986.
3 S. -K. Kim and J. -H. Chang, "Discriminative weight training for support vector machine-based speech/music classification in 3GPP2 SMV codec," IEICE Trans. Fundamentals of Electronics, Communications and Computer Sciences, vol. E93-A, no. 1, pp. 316-319, January 2010.   DOI   ScienceOn
4 임정수, 송지현, 장준혁, "SVM의 미세조정을 통한 음성/음악 분류 성능향상," 전자공학회 논문지 SP편 48권 2호, 141-148쪽, 2011년 3월
5 X. Wang, J. Chen, P Wang, Z. Huang, "Infrared human face auto locating based on SVM and a smart thermal biometrics system," in Proc. Sixth International Conference on Intelligent Systems Design and Applications (ISDA'06) , vol. 2, pp. 1066-1072, October 2006.
6 A. Ganapathiraju, J. E. Hamaker, J. Picone, "Applications of support vector machines to speech recognition," IEEE Trans. Signal Processing, vol. 52, pp. 2348-2355, August 2004.   DOI   ScienceOn
7 S. C. Greer, and A. Dejaco, "Standardization of the selectable mode vocoder," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 953-956, May 2001.
8 C. V. Goudar, P. Rabha, M. Deshpande, and A. Rao, "SMVLite: reduced complexity selectable mode vocoder," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 701-704, May 2006.
9 V. N. Vapnik, "An overview of statistical learning theory," IEEE Trans. Neural Networks, vol. 10, no. 5, pp. 988-999, 1999.   DOI   ScienceOn
10 J. -M. Kum and J. -H. Chang, "Speech enhancement based on minima controlled recursive averaging incorporating second-order conditional MAP criterion," IEEE Signal Processing Letters, Vol. 16, no. 7, pp. 624-627, July 2009.   DOI
11 John C. Platt, "Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods," in Advances in Large Margin Classifiers, MIT Press, pp. 61-74, 1999.
12 3GPP2 Spec., "Source-controlled variable-rate multimedia wideband speech codec (VMR-WB), service option 62 and 63 for spread spectrum systems," 3GPP2-C.S0052-A, vol. 1.0, April. 2005.
13 Y. Gao, E. Shlomot, A. Benyassine, J. Hyssen, Huan-yu Su, and C. Murgia, "The SMV algorithm selected by TIA and 3GPP2 for CDMA appications," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 709-712, May 2001.
14 S. -K. Kim and J. -H. Chang, "Speech/music classification enhancement for 3GPP2 SMV codec based on support vector machine," IEICE Trans. Fundamentals of Electronics, Communications and Computer Sciences, Vol. E92-A, no. 2, pp. 630-632, February 2009.   DOI   ScienceOn