Robust Speech Recognition by Utilizing Class Histogram Equalization

클래스 히스토그램 등화 기법에 의한 강인한 음성 인식

  • 서영주 (한국정보통신대학교 공학부) ;
  • 김회린 (한국정보통신대학교 공학부) ;
  • 이윤근 (한국전자통신연구원 음성언어정보연구센터 음성처리연구팀)
  • Published : 2006.12.30

Abstract

This paper proposes class histogram equalization (CHEQ) to compensate noisy acoustic features for robust speech recognition. CHEQ aims to compensate for the acoustic mismatch between training and test speech recognition environments as well as to reduce the limitations of the conventional histogram equalization (HEQ). In contrast to HEQ, CHEQ adopts multiple class-specific distribution functions for training and test environments and equalizes the features by using their class-specific training and test distributions. According to the class-information extraction methods, CHEQ is further classified into two forms such as hard-CHEQ based on vector quantization and soft-CHEQ using the Gaussian mixture model. Experiments on the Aurora 2 database confirmed the effectiveness of CHEQ by producing a relative word error reduction of 61.17% over the baseline met-cepstral features and that of 19.62% over the conventional HEQ.

Keywords