Browse > Article
http://dx.doi.org/10.7776/ASK.2011.30.6.330

Improvement in Supervector Linear Kernel SVM for Speaker Identification Using Feature Enhancement and Training Length Adjustment  

So, Byung-Min (서울시립대학교 컴퓨터과학부)
Kim, Kyung-Wha (대검찰청 음성분석실)
Kim, Min-Seok (LG 전자기술원)
Yang, Il-Ho (서울시립대학교 컴퓨터과학부)
Kim, Myung-Jae (서울시립대학교 컴퓨터과학부)
Yu, Ha-Jin (서울시립대학교 컴퓨터과학부)
Abstract
In this paper, we propose a new method to improve the performance of supervector linear kernel SVM (Support Vector Machine) for speaker identification. This method is based on splitting one training datum into several pieces of utterances. We use four different databases for evaluating performance and use PCA (Principal Component Analysis), GKPCA (Greedy Kernel PCA) and KMDA (Kernel Multimodal Discriminant Analysis) for feature enhancement. As a result, the proposed method shows improved performance for speaker identification using supervector linear kernel SVM.
Keywords
Speaker identification; SVM; PCA; GKPCA; KMDA;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J.-L. Gauvain and C.-H. Lee, "Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains," IEEE Trans. Speech Audio Proc., vol. 2, no. 2, pp. 291- 298, Apr. 1994.   DOI   ScienceOn
2 Smith, L. I, "A tutorial on Principal Components Analysis", 2002.
3 김민석, 양일호, 유하진, "Greedy Kernel PCA를 이용한 화자식별", 말소리, 66호, 105-116쪽, 2008.
4 Kim, M-S., Yang, I-H., Yu, H-J., "Kernel multimodal discriminant analysis for speaker verification", In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 4498-4501. 2010.
5 B. Scholkopf, A. Smola and K.-R. Muller, "Kernel Principal Component Analysis," In Int. Conf. on Aritificil Neural Networks, pp. 583-588, 1997.
6 W.M. Campbell, D.E. Sturim, D.A. Reynolds, "Support Vector Machines using GMM Supervectors for Speaker Verification," IEEE Signal Processing Letters, vol. 13, no. 5, pp. 308-311, 2006.   DOI   ScienceOn
7 Douglas A. Reynolds and Richard C. Rose, "Robust Text- Independent Speaker Identification Using Gaussian Mixture Speaker Models," IEEE Trans. Speech Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.   DOI   ScienceOn
8 Douglas A. Reynolds, Thomas F. Quatieri and Robert B. Dunn, "Speaker Verification Using Adapted Gaussian Mixture Models," Digital Signal Processing., vol. 10, no. 1-3, pp. 19-41, Jan. 2000.   DOI   ScienceOn