잡음환경에서의 음성인식 성능 향상을 위한 이중채널 음성의 CASA 기반 전처리 방법

CASA-based Front-end Using Two-channel Speech for the Performance Improvement of Speech Recognition in Noisy Environments

  • 박지훈 (광주과학기술원 정보통신공학과) ;
  • 윤재삼 (광주과학기술원 정보통신공학과) ;
  • 김홍국 (광주과학기술원 정보통신공학과)
  • Park, Ji-Hun (Department of Information and Communication Gwangju Institute of Science and Technology) ;
  • Yoon, Jae-Sam (Department of Information and Communication Gwangju Institute of Science and Technology) ;
  • Kim, Hong-Kook (Department of Information and Communication Gwangju Institute of Science and Technology)
  • 발행 : 2007.07.11

초록

In order to improve the performance of a speech recognition system in the presence of noise, we propose a noise robust front-end using two-channel speech signals by separating speech from noise based on the computational auditory scene analysis (CASA). The main cues for the separation are interaural time difference (ITD) and interaural level difference (ILD) between two-channel signal. As a result, we can extract 39 cepstral coefficients are extracted from separated speech components. It is shown from speech recognition experiments that proposed front-end has outperforms the ETSI front-end with single-channel speech.

키워드