[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2011.30.6.330

Improvement in Supervector Linear Kernel SVM for Speaker Identification Using Feature Enhancement and Training Length Adjustment

So, Byung-Min (서울시립대학교 컴퓨터과학부)
Kim, Kyung-Wha (대검찰청 음성분석실)
Kim, Min-Seok (LG 전자기술원)
Yang, Il-Ho (서울시립대학교 컴퓨터과학부)
Kim, Myung-Jae (서울시립대학교 컴퓨터과학부)
Yu, Ha-Jin (서울시립대학교 컴퓨터과학부)

Publication Information

The Journal of the Acoustical Society of Korea / v.30, no.6, 2011 , pp. 330-336 More about this Journal

Abstract

In this paper, we propose a new method to improve the performance of supervector linear kernel SVM (Support Vector Machine) for speaker identification. This method is based on splitting one training datum into several pieces of utterances. We use four different databases for evaluating performance and use PCA (Principal Component Analysis), GKPCA (Greedy Kernel PCA) and KMDA (Kernel Multimodal Discriminant Analysis) for feature enhancement. As a result, the proposed method shows improved performance for speaker identification using supervector linear kernel SVM.

Keywords

Speaker identification; SVM; PCA; GKPCA; KMDA;

Citations & Related Records

Reference

1	J.-L. Gauvain and C.-H. Lee, "Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains," IEEE Trans. Speech Audio Proc., vol. 2, no. 2, pp. 291- 298, Apr. 1994. DOI ScienceOn
2	Smith, L. I, "A tutorial on Principal Components Analysis", 2002.
3	김민석, 양일호, 유하진, "Greedy Kernel PCA를 이용한 화자식별", 말소리, 66호, 105-116쪽, 2008.
4	Kim, M-S., Yang, I-H., Yu, H-J., "Kernel multimodal discriminant analysis for speaker verification", In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4498-4501. 2010.
5	B. Scholkopf, A. Smola and K.-R. Muller, "Kernel Principal Component Analysis," In Int. Conf. on Aritificil Neural Networks, pp. 583-588, 1997.
6	W.M. Campbell, D.E. Sturim, D.A. Reynolds, "Support Vector Machines using GMM Supervectors for Speaker Verification," IEEE Signal Processing Letters, vol. 13, no. 5, pp. 308-311, 2006. DOI ScienceOn
7	Douglas A. Reynolds and Richard C. Rose, "Robust Text- Independent Speaker Identification Using Gaussian Mixture Speaker Models," IEEE Trans. Speech Audio Processing, vol. 3, no. 1, pp. 72-83, 1995. DOI ScienceOn
8	Douglas A. Reynolds, Thomas F. Quatieri and Robert B. Dunn, "Speaker Verification Using Adapted Gaussian Mixture Models," Digital Signal Processing., vol. 10, no. 1-3, pp. 19-41, Jan. 2000. DOI ScienceOn

KSCI

Improvement in Supervector Linear Kernel SVM for Speaker Identification Using Feature Enhancement and Training Length Adjustment 특징 강화 기법과 학습 데이터 길이 조절에 의한 Supervector Linear Kernel SVM 화자식별 개선

Improvement in Supervector Linear Kernel SVM for Speaker Identification Using Feature Enhancement and Training Length Adjustment