Annual Conference of KIPS (한국정보처리학회:학술대회논문집)
- 2023.11a
- /
- Pages.526-527
- /
- 2023
- /
- 2005-0011(pISSN)
- /
- 2671-7298(eISSN)
DOI QR Code
Speech and Textual Data Fusion for Emotion Detection: A Multimodal Deep Learning Approach
감정 인지를 위한 음성 및 텍스트 데이터 퓨전: 다중 모달 딥 러닝 접근법
- Edward Dwijayanto Cahyadi (School of Smart IT , Semyung University) ;
- Mi-Hwa Song (School of Smart IT , Semyung University)
- Published : 2023.11.02
Abstract
Speech emotion recognition(SER) is one of the interesting topics in the machine learning field. By developing multi-modal speech emotion recognition system, we can get numerous benefits. This paper explain about fusing BERT as the text recognizer and CNN as the speech recognizer to built a multi-modal SER system.
Keywords