Browse > Article
http://dx.doi.org/10.6109/jkiice.2011.15.12.2521

Speaker Recognition using LPC cepstrum Coefficients and Neural Network  

Choi, Jae-Seung (신라대학교 전자공학과)
Abstract
This paper proposes a speaker recognition algorithm using a perceptron neural network and LPC (Linear Predictive Coding) cepstrum coefficients. The proposed algorithm first detects the voiced sections at each frame. Then, the LPC cepstrum coefficients which have speaker characteristics are obtained by the linear predictive analysis for the detected voiced sections. To classify the obtained LPC cepstrum coefficients, a neural network is trained using the LPC cepstrum coefficients. In this experiment, the performance of the proposed algorithm was evaluated using the speech recognition rates based on the LPC cepstrum coefficients and the neural network.
Keywords
Perceptron neural network; Linear predictive analysis; LPC cepstrum coefficient; Speaker recognition;
Citations & Related Records
연도 인용수 순위
  • Reference
1 H. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions", in Proc. ISCA ITRW ASR2000 on Automatic Speech Recognition: Challenges for the Next Millennium, Paris, France, 2000.
2 A. Revathi, Y. Venkataramani, "Speaker independent continuous speech and isolated digit recognition using VQ and HMM", 2011 International Conference on Communications and Signal Processing, pp. 198-202, 2011.
3 K. Kuah, M. Bodruzzaman, S. Zein-Sabatto, "A neural network-based text independent voice recognition system", Proceedings of the 1994 IEEE Southeastcon 'Creative Technology Transfer - A Global Affair'., pp. 131-135, 1994.
4 B. Lu, J. J. Xu, "Research on Isolated Word Speech Recognition Based on Biomimetic Pattern Recognition", 2009 International Conference on Artificial Intelligence and Computational Intelligence, Vol. 2, pp. 436-439, 2009.
5 D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning representations by back-propagation errors", Nature, Vol. 323, pp. 533-536, 1986.   DOI   ScienceOn
6 S. K. Pal, S. Mitra, "Multilayer perceptron, fuzzy sets, and classification", IEEE Transaction on Neural Networks, Vol. 3, No. 5, pp. 683-697, 1992.   DOI   ScienceOn
7 W. G. Knecht, M. E. Schenkel, G. S. Moschytz, "Neural network filters for speech enhancement", IEEE Trans. Speech and Audio Processing, Vol. 3, No. 6, pp. 433-438, 1995.   DOI   ScienceOn
8 P. B. Patil, "Multilayered network for LPC based speech recognition", IEEE Transactions on Consumer Electronics, Vol. 44, No. 2, pp. 435-438, 1998.   DOI   ScienceOn