Browse > Article
http://dx.doi.org/10.9708/jksci.2014.19.11.159

How to Express Emotion: Role of Prosody and Voice Quality Parameters  

Lee, Sang-Min (Ethical Leader Path College, Catholic University of Korea)
Lee, Ho-Joon (Dept. of Smart IT, Youngdong University)
Abstract
In this paper, we examine the role of emotional acoustic cues including both prosody and voice quality parameters for the modification of a word sense. For the extraction of prosody parameters and voice quality parameters, we used 60 pieces of speech data spoken by six speakers with five different emotional states. We analyzed eight different emotional acoustic cues, and used a discriminant analysis technique in order to find the dominant sequence of acoustic cues. As a result, we found that anger has a close relation with intensity level and 2nd formant bandwidth range; joy has a relative relation with the position of 2nd and 3rd formant values and intensity level; sadness has a strong relation only with prosody cues such as intensity level and pitch level; and fear has a relation with pitch level and 2nd formant value with its bandwidth range. These findings can be used as the guideline for find-tuning an emotional spoken language generation system, because these distinct sequences of acoustic cues reveal the subtle characteristics of each emotional state.
Keywords
Emotional Acoustic Cues; Emotional Prosody Structure; Emotional Voice Quality Parameter; Korean Emotional Speech Analysis; Emotional Speech Recognition; Emotional Speech Generation;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Marc Schroder. 2001. Emotional Speech Synthesis: A Review. Eurospeech 2001, pages 561-564.
2 Gi-Jeong Lim, Jung-Chul Lee. 2012. Improvement of Naturalness for a HMM-based Korean TTS using the prosodic boundary information. Journal of The Korea Society of Computer and Information, vol. 17, no. 9, pp. 75-84, September 2012.   과학기술학회마을   DOI   ScienceOn
3 Agustin Gravano, Stefan Benus, Hector Chavez, Julia Hirschberg, and Lauren Wilcox. 2007. On the role of context and prosody in the interpretation of 'okay'. 45th Conference of the ACL, pages 800-807.
4 Elissaveta Abadjieva, Iain R. Murray, John L. Arnott. 1993. Applying Analysis of Human Emotion Speech to Enhance Synthesis Speech. Eurospeech 93, pages 909-912.
5 Ho-Joon Lee and Jong C. Park. 2009. Interpretation of User Evaluation for Emotional Speech Synthesis System. Human Computer Interaction International 2009.
6 Mark Tatham and Katherine Morton. 2006. Expression in Speech: Analysis and Synthesis. Oxford University Press.