[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.6109/jkiice.2011.15.12.2521

Speaker Recognition using LPC cepstrum Coefficients and Neural Network

Choi, Jae-Seung (신라대학교 전자공학과)

Publication Information

Journal of the Korea Institute of Information and Communication Engineering / v.15, no.12, 2011 , pp. 2521-2526 More about this Journal

Abstract

This paper proposes a speaker recognition algorithm using a perceptron neural network and LPC (Linear Predictive Coding) cepstrum coefficients. The proposed algorithm first detects the voiced sections at each frame. Then, the LPC cepstrum coefficients which have speaker characteristics are obtained by the linear predictive analysis for the detected voiced sections. To classify the obtained LPC cepstrum coefficients, a neural network is trained using the LPC cepstrum coefficients. In this experiment, the performance of the proposed algorithm was evaluated using the speech recognition rates based on the LPC cepstrum coefficients and the neural network.

Keywords

Perceptron neural network; Linear predictive analysis; LPC cepstrum coefficient; Speaker recognition;

Citations & Related Records

Reference

1	H. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions", in Proc. ISCA ITRW ASR2000 on Automatic Speech Recognition: Challenges for the Next Millennium, Paris, France, 2000.
2	A. Revathi, Y. Venkataramani, "Speaker independent continuous speech and isolated digit recognition using VQ and HMM", 2011 International Conference on Communications and Signal Processing, pp. 198-202, 2011.
3	K. Kuah, M. Bodruzzaman, S. Zein-Sabatto, "A neural network-based text independent voice recognition system", Proceedings of the 1994 IEEE Southeastcon 'Creative Technology Transfer - A Global Affair'., pp. 131-135, 1994.
4	B. Lu, J. J. Xu, "Research on Isolated Word Speech Recognition Based on Biomimetic Pattern Recognition", 2009 International Conference on Artificial Intelligence and Computational Intelligence, Vol. 2, pp. 436-439, 2009.
5	D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning representations by back-propagation errors", Nature, Vol. 323, pp. 533-536, 1986. DOI ScienceOn
6	S. K. Pal, S. Mitra, "Multilayer perceptron, fuzzy sets, and classification", IEEE Transaction on Neural Networks, Vol. 3, No. 5, pp. 683-697, 1992. DOI ScienceOn
7	W. G. Knecht, M. E. Schenkel, G. S. Moschytz, "Neural network filters for speech enhancement", IEEE Trans. Speech and Audio Processing, Vol. 3, No. 6, pp. 433-438, 1995. DOI ScienceOn
8	P. B. Patil, "Multilayered network for LPC based speech recognition", IEEE Transactions on Consumer Electronics, Vol. 44, No. 2, pp. 435-438, 1998. DOI ScienceOn

KSCI

Speaker Recognition using LPC cepstrum Coefficients and Neural Network LPC 켑스트럼 계수와 신경회로망을 사용한 화자인식

Speaker Recognition using LPC cepstrum Coefficients and Neural Network