Browse > Article
http://dx.doi.org/10.7776/ASK.2006.25.6.269

A Study on Robust Speech Emotion Feature Extraction Under the Mobile Communication Environment  

Cho Youn-Ho (단국대학교 정보.컴퓨터과학과)
Park Kyu-Sik (단국대학교 정보. 컴퓨터과학과)
Abstract
In this paper, we propose an emotion recognition system that can discriminate human emotional state into neutral or anger from the speech captured by a cellular-phone in real time. In general. the speech through the mobile network contains environment noise and network noise, thus it can causes serious System performance degradation due to the distortion in emotional features of the query speech. In order to minimize the effect of these noise and so improve the system performance, we adopt a simple MA (Moving Average) filter which has relatively simple structure and low computational complexity, to alleviate the distortion in the emotional feature vector. Then a SFS (Sequential Forward Selection) feature optimization method is implemented to further improve and stabilize the system performance. Two pattern recognition method such as k-NN and SVM is compared for emotional state classification. The experimental results indicate that the proposed method provides very stable and successful emotional classification performance such as 86.5%. so that it will be very useful in application areas such as customer call-center.
Keywords
Mobile communication; Speech emotion recognition; MA filtering; SFS; Call center;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. de la Cuadra, A. Master and C. Sapp, 'Efficient Pitch Detection Techniques for Interactive Music', International Computer Music Conference, 403-406, Havana, Cuba, September, 2001
2 C. M. Lee, and S. S. Narayanan, 'Towards Detecting Emotions in Spoken Dialogs,' in IEEE Transactions on Speech and Audio Processing, 13 (2) 2005
3 Xuejing Sun, 'A Pitch Determination Algorithm Based On Subharmonic-to-Harmonic Ratio', International Conference on Spoken Language Processing '2000, 676-679, 2000
4 Noam Amir, 'Classifying Emotion in Speech: a Comparison of Methods', Proceedings of Euro Speech'2001, 1 127-130, Aalborg, Denmark, 2001
5 A. Nogueiras, A. Moreno, A. Bonafonte, and J. B. Marino, 'Speech Emotion Recognition Using Hidden Markov Models,' presented at Eurospeech 2001, Scandinavia, 2001
6 Anil Jain and Douglas Zongker, 'Feature Selection : Evaluation, Application, and Small Sample Performance', IEEE Pattern Analysis and Machine Intelligence, 19 (2) 153-158, 1997   DOI   ScienceOn
7 Guojun Zhou, John H. L. Hansen, and James F. Kaiser, 'Nonlinear Feature Based Classification of Speech Under Stress' IEEE Transactions on Speech and Audio Processing, 9 (3) 2001
8 T. Moriyama and Oazwa, 'Emotion Recognition and Synthesis System on Speech', IEEE International Conference on Multimedia Computing and Systems, 1 840-844, Florence, Italy, 1999
9 Lingyun Gu and Stephen A. Zahorian, 'A New Robust Algorithm for Isolated Word Endpoint Detection,' IV-4161 International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, 13-17, 2002
10 M. Liu and C. Wan, 'A Study on Content-based Classification Retrieval of Audio Database,' Proc. of the International Database Engineering & Applications Symposium, 339-345. 2001
11 M.J. Ross, H.L. Shaer, A. Cohen, R. Freudberg, and H. J. Manley, 'Average Magnitude Difference Function Pitch Extractor', Acoustics, Speech, and Signal Processing (see also IEEE Transactions on Signal Processing), IEEE Transactions on 22 (5) 353-362, Oct. 1974   DOI
12 강봉석, '음성 신호를 이용한 문장독립 감정 인식 시스템', 석사학위 논문, 연세대학교, 2001
13 F. Dellaert, T. Polzin, and A. Waibel, 'Recognizing Emotion in Speech', In Proc. International Conf. on Spoken Language Processing, 1970-1973, 1996