Browse > Article
http://dx.doi.org/10.5909/JBE.2013.18.5.713

On the Importance of Tonal Features for Speech Emotion Recognition  

Lee, Jung-In (Dept. of EE at Yonsei university)
Kang, Hong-Goo (Dept. of EE at Yonsei university)
Publication Information
Journal of Broadcast Engineering / v.18, no.5, 2013 , pp. 713-721 More about this Journal
Abstract
This paper describes an efficiency of chroma based tonal features for speech emotion recognition. As the tonality caused by major or minor keys affects to the perception of musical mood, so the speech tonality affects the perception of the emotional states of spoken utterances. In order to justify this assertion with respect to tonality and emotion, subjective hearing tests are carried out by using synthesized signals generated from chroma features, and consequently show that the tonality contributes especially to the perception of the negative emotion such as anger and sad. In automatic emotion recognition tests, the modified chroma-based tonal features are shown to produce noticeable improvement of accuracy when they are supplemented to the conventional log-frequency power coefficient (LFPC)-based spectral features.
Keywords
Emotion recognition; tonality; tonal features; chroma feature;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Cowie, E. Douglas-Cowei, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, and J. G. Taylor, "Emotion Recognition in Human Computer Interaction," IEEE Signal Processing Magazine, pp. 32-80, 2001.
2 D. Ververidis and C. Kotropoulos, "Emotional speech recognition: Resources, features, and methods," Speech Communication, vol. 48(9), pp. 1162-1181, 2006.   DOI   ScienceOn
3 M. E. Ayadi, M. S. Kamel, and F. Karray, "Survey on speech emotion recognition: Features, classification schemes, and databases", Pattern Recognition, vol. 44, pp. 572-587, 2011.   DOI   ScienceOn
4 I. Murray, J. Arnott, "Toward the simulation of emotion in synthetic speech: A review of the literature of human vocal emotion," J. Acoust. Soc. Am, vol. 93 (2), pp. 1097-1108, 1993.   DOI   ScienceOn
5 C. E. Williams and K. N. Stevens, "Emotion and speech: Some acoustical correlates", J. Acoust. Soc. Am, vol. 52(4), pp. 1238-1250, 1972.   DOI
6 C. Gobl and A. N. Chasaide, "The role of voice quality in communicating emotion, mood and attitude", Speech Communication, vol. 40, pp. 189-212, 2003.   DOI   ScienceOn
7 M. Goudbeek and K. Scherer, "Beyond arousal: Valence and potency/ control cues in the vocal expression of emotion", J. Acoust. Soc. Am, vol. 128, pp. 1322-1336, 2010.   DOI   ScienceOn
8 S. Yacoub, S. Simske, X. Lin, J. Burns, "Recognition of Emotionsin Interactive Voice Response System," Proceedings of the Eurospeech 2003, Geneva, 2003.
9 T. L. Nwe, S. W. Foo, and et al, "Speech emotion recognition using hidden markov models", Speech Communication, vol. 41(4), pp. 603-623, 2003.   DOI   ScienceOn
10 M. A. Bartsch and G. H. Wakefield, "Audio thumbnailing of popular music using chroma-based representation," IEEE Transactions on Multimedia, vol. 7(1), pp. 96-104, 2005.   DOI   ScienceOn
11 Y. E. Kim, E. M. Schmidt, R. Migneco, B. G. Morton, P. Richardson, J. Scott, J. A. Speck, and D. Turnbull, "Music emotion recognition: A state of the art review", Proc. 11th Int. Soc. Music Information Retrieval Conf.(ISMIR), pp. 255-266, 2010.
12 M. Muller, F. Kurth, and M. Clausen, "Audio matching via chroma- based statistical features", Proc. 5th Int. Soc. Music Information Retrieval Conf.(ISMIR), pp. 288-295, 2005.
13 T. F. Quatieri, Discrete-Time Speech Signal Processing, Prentice-Hall, NJ, 2002.
14 H. Purwins, "Profiles of Pitch Classes: Circularity of Relative Pitch and Key: Experiments, Models, Computational Music Analysis, and Perspectives," Ph. D. dissertation, Berlin Univ. of Technol., Berlin, Germany, 2005.
15 T. Lan, D. Erdogmus, U. Ozertem, and Y. Huang, "Estimating mutual information using Gaussian mixture model for feature ranking and selection", Proc. Int. Joint Conf. on Neural Networks, pp. 5034-5039, 2006.
16 B.-S. Kang, "Text-independent emotion recognition algorithm using speech signal," M. S. Thesis, Yonsei university, Electrical and Electronic Engineering Department, 2000.
17 C.-C. Chang and C.-J. Lin, "Libsvm: a library for support vector machines", ACM Transactions on Intelligent Systems and Technology, vol. 2, 27:1-26, 2011.
18 P. Shen, Z. Changjun, X. CHen, "Automatic Speech Emotion Recognition using Support Vector Machine," Int. Conf. on Electronic and Mechanical Engineering and Information Technology (EMEIT), vol. 2, pp. 621-625, 2011.