Speech Emotion Recognition by Speech Signals on a Simulated Intelligent Robot

모의 지능로봇에서 음성신호에 의한 감정인식

  • Jang, Kwang-Dong (Department of Control and Instrumentation Engineering, Chungbuk National University) ;
  • Kwon, Oh-Wook (Department of Control and Instrumentation Engineering, Chungbuk National University)
  • 장광동 (충북대학교 제어계측공학과) ;
  • 권오욱 (충북대학교 제어계측공학과)
  • Published : 2005.11.17

Abstract

We propose a speech emotion recognition method for natural human-robot interface. In the proposed method, emotion is classified into 6 classes: Angry, bored, happy, neutral, sad and surprised. Features for an input utterance are extracted from statistics of phonetic and prosodic information. Phonetic information includes log energy, shimmer, formant frequencies, and Teager energy; Prosodic information includes pitch, jitter, duration, and rate of speech. Finally a patten classifier based on Gaussian support vector machines decides the emotion class of the utterance. We record speech commands and dialogs uttered at 2m away from microphones in 5different directions. Experimental results show that the proposed method yields 59% classification accuracy while human classifiers give about 50%accuracy, which confirms that the proposed method achieves performance comparable to a human.

Keywords