Feature Vector Processing for Speech Emotion Recognition in Noisy Environments

Park, Jeong-Sik;Oh, Yung-Hwan;

Phonetics and Speech Sciences (말소리와 음성과학)

Volume 2 Issue 1
/
Pages.77-85
/
2010
/
2005-8063(pISSN)
/
2586-5854(eISSN)

Korean Society of Speech Sciences (한국음성학회)

Feature Vector Processing for Speech Emotion Recognition in Noisy Environments

잡음 환경에서의 음성 감정 인식을 위한 특징 벡터 처리

박정식 (한국과학기술원 전산학과) ;
오영환 (한국과학기술원 전산학과)

Received : 2009.11.01
Accepted : 2010.01.16
Published : 20100300

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper proposes an efficient feature vector processing technique to guard the Speech Emotion Recognition (SER) system against a variety of noises. In the proposed approach, emotional feature vectors are extracted from speech processed by comb filtering. Then, these extracts are used in a robust model construction based on feature vector classification. We modify conventional comb filtering by using speech presence probability to minimize drawbacks due to incorrect pitch estimation under background noise conditions. The modified comb filtering can correctly enhance the harmonics, which is an important factor used in SER. Feature vector classification technique categorizes feature vectors into either discriminative vectors or non-discriminative vectors based on a log-likelihood criterion. This method can successfully select the discriminative vectors while preserving correct emotional characteristics. Thus, robust emotion models can be constructed by only using such discriminative vectors. On SER experiment using an emotional speech corpus contaminated by various noises, our approach exhibited superior performance to the baseline system.

Phonetics and Speech Sciences (말소리와 음성과학)

Feature Vector Processing for Speech Emotion Recognition in Noisy Environments

잡음 환경에서의 음성 감정 인식을 위한 특징 벡터 처리

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)