Rapid Speaker Adaptation for Continuous Speech Recognition Using Merging Eigenvoices

Eigenvoice 병합을 이용한 연속 음성 인식 시스템의 고속 화자 적응

  • 최동진 (한국과학기술원(KAIST) 전자전산학과 전산학전공 음성언어연구실) ;
  • 오영환 (한국과학기술원(KAIST) 전자전산학과 전산학전공 음성언어연구실)
  • Published : 2005.03.01

Abstract

Speaker adaptation in eigenvoice space is a popular method for rapid speaker adaptation. To improve the performance of the method, the number of speaker dependent models should be increased and eigenvoices should be re-estimated. However, principal component analysis takes much time to find eigenvoices, especially in a continuous speech recognition system. This paper describes a method to reduce computation time to estimate eigenvoices only for supplementary speaker dependent models and to merge them with the used eigenvoices. Experiment results show that the computation time is reduced by 73.7% while the performance is almost the same in case that the number of speaker dependent models is the same as used ones.

Keywords