Acoustic parameters for induced emotion categorizing and dimensional approach

Park, Ji-Eun;Park, Jeong-Sik;Sohn, Jin-Hun;

감성과학 (Science of Emotion and Sensibility)

제16권1호
/
Pages.117-124
/
2013
/
1226-8593(pISSN)
/
2383-613X(eISSN)

한국감성과학회 (Korean Society for Emotion and Sensibility)

자연스러운 정서 반응의 범주 및 차원 분류에 적합한 음성 파라미터

Acoustic parameters for induced emotion categorizing and dimensional approach

박지은 (충남대학교 심리학과/뇌과학연구소) ;
박정식 (목원대학교 공과대학 지능로봇공학과) ;
손진훈 (충남대학교 심리학과/뇌과학연구소)

Park, Ji-Eun (Department of Psychology, Brain Research Institute, Chungnam National University) ;
Park, Jeong-Sik (Department of Intelligent Robot Engineering, Mokwon University) ;
Sohn, Jin-Hun (Department of Psychology, Brain Research Institute, Chungnam National University)

투고 : 2013.02.27
심사 : 2013.03.29
발행 : 2013.03.30

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 연구는 음성 인식기에서 일반적으로 사용되는 음향적 특징인 MFCC, LPC, 에너지, 피치 관련 파라미터들을 이용하여 자연스러운 음성의 정서를 범주 및 차원으로 얼마나 잘 인식할 수 있는지 살펴보았다. 자연스러운 정서 반응 데이터를 얻기 위해 선행 연구에서 이미 타당도와 효과성이 밝혀진 정서 유발 자극을 사용하였고, 110명의 대학생들에게 7가지 정서 유발 자극을 제시한 후 유발된 음성 반응을 녹음하여 분석에 사용하였다. 각 음성 데이터에서 추출한 파라미터들을 독립변인으로 하여 선형 판별 분석(LDA)으로 7가지 정서 범주를 분류하였고, 범주 분류의 한계를 극복하기 위해 단계별 다중회귀(stepwise multiple regression) 모형을 도출하여 4가지 정서 차원(valence, arousal, intensity, potency)을 가장 잘 예측하는 음성 특징 파라미터를 산출하였다. 7가지 정서 범주 판별율은 평균 62.7%이었고, 4 차원 예측 회귀모형들도 p<.001수준에서 통계적으로 유의하였다. 결론적으로, 본 연구 결과는 자연스러운 감정의 음성 반응을 분류하는데 유용한 파라미터들을 선정하여 정서의 범주와 차원적 접근으로 정서 분류 가능성을 보였으며 논의에 본 연구의 개선방향에 대해 기술하였다.

This study examined that how precisely MFCC, LPC, energy, and pitch related parameters of the speech data, which have been used mainly for voice recognition system could predict the vocal emotion categories as well as dimensions of vocal emotion. 110 college students participated in this experiment. For more realistic emotional response, we used well defined emotion-inducing stimuli. This study analyzed the relationship between the parameters of MFCC, LPC, energy, and pitch of the speech data and four emotional dimensions (valence, arousal, intensity, and potency). Because dimensional approach is more useful for realistic emotion classification. It results in the best vocal cue parameters for predicting each of dimensions by stepwise multiple regression analysis. Emotion categorizing accuracy analyzed by LDA is 62.7%, and four dimension regression models are statistically significant, p<.001. Consequently, this result showed the possibility that the parameters could also be applied to spontaneous vocal emotion recognition.

감성과학 (Science of Emotion and Sensibility)

자연스러운 정서 반응의 범주 및 차원 분류에 적합한 음성 파라미터

Acoustic parameters for induced emotion categorizing and dimensional approach

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)