Browse > Article
http://dx.doi.org/10.5391/JKIIS.2015.25.3.272

Emotion Recognition using Pitch Parameters of Speech  

Lee, Guehyun (Department of Electrical Engineering, Kunsan National University)
Kim, Weon-Goo (Department of Electrical Engineering, Kunsan National University)
Publication Information
Journal of the Korean Institute of Intelligent Systems / v.25, no.3, 2015 , pp. 272-278 More about this Journal
Abstract
This paper studied various parameter extraction methods using pitch information of speech for the development of the emotion recognition system. For this purpose, pitch parameters were extracted from korean speech database containing various emotions using stochastical information and numerical analysis techniques. GMM based emotion recognition system were used to compare the performance of pitch parameters. Sequential feature selection method were used to select the parameters showing the best emotion recognition performance. Experimental results of recognizing four emotions showed 63.5% recognition rate using the combination of 15 parameters out of 56 pitch parameters. Experimental results of detecting the presence of emotion showed 80.3% recognition rate using the combination of 14 parameters.
Keywords
Emotion Recognition; Speech Parameter; Pitch;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Dimitrios Ververidis, Constantine Kotropoulos, Loannis Pitas, "Automatic Emotional Speech Classification", in Proceedings of ICASSP'04, 2004.
2 Carlos Busso, Sungbok Lee, Shrikanth Narayanan, "Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection,", IEEE Trans. Speech and Audio Processing, Vol. 17, No 4, pp. 582-596, May 2009   DOI
3 Janet E. Cahn, "The Generation of Affect in Synthesized Speech", Journal of the American Voice I/0 Society, Vol. 8, pp.1-19 July 1990.
4 K. R. Scherer, D. R. Ladd, and K. E. A. Silverman, "Vocal Cues to Speaker Affect: Testing Two Models", Journal Acoustical Society of America, Vol. 76, No. 5, pp. 1346-1355, Nov 1984.   DOI
5 Iain R. Murray and John L. Arnott, "Toward the Simulation of Emotion in Synthetic Speech: A Review of the Literature on Human Vocal Emotion", Journal Acoustical Society of America, pp.1097-1108, Feb. 1993.
6 Rosalind W. Picard, "Affective Computing", The MIT Press, 1997.
7 V. Kostv and S. Fukuda, "Emotion in User Interface, Voice Interaction System," IEEE International Conference on Systems, Cybernetics Representation, No.2, pp,798-803, 2000
8 T. Moriyama and S. Oazwa, "Emotion Recognition and Synthesis System on Speech," IEEE Intl. Conference on Multimedia Computing and System, , pp.840-844. 1999
9 L. C. Siva and P. C. Ng, "Bimodal Emotion Recognition," in Proceeding of the 4th Intl. Conference on Automatic Face and Gesture Recognition, pp.332-335. 2000
10 Y. G. Kim, Y. C. Bae, "Design of Emotion Recognition Model Using fuzzy Logic" Proceedings of KFIS Spring Conference, 2000.
11 K. B. Sim, C. H. Park, "Analyzing the element of emotion recognition from speech", Journal of Korean Institute of Intelligent Systems, Vol. 11, no. 6, pp.510-515, 2001.
12 P. A. Devijver and J. Kitteler, "Pattern Recognition : A Statistical Approach", London: Prentice-Hall International, 1982
13 P. Boersma and D. Weeninck, "PRAAT, a system for doing phonetics by computer," Inst. Phon. Sci. Univ. of Amsterdam, Amsterdam, Negherlands, Tech. Rep. 132, 1996 [Online]. Available: http://www.praat.org.
14 B. S. Kang, "text-independent emotion recognition algorithm using speech signal," Master thesis, Yonsei University, 2000