Unsupervised Speaker Adaptation Based on Sufficient HMM Statistics

SUFFICIENT HMM 통계치에 기반한 UNSUPERVISED 화자 적응

  • 고봉옥 (전북대학교 전자정보공학부) ;
  • 김종교 (전북대학교 전자정보공학부)
  • Published : 2003.05.01

Abstract

This paper describes an efficient method for unsupervised speaker adaptation. This method is based on selecting a subset of speakers who are acoustically close to a test speaker, and calculating adapted model parameters according to the previously stored sufficient HMM statistics of the selected speakers' data. In this method, only a few unsupervised test speaker's data are required for the adaptation. Also, by using the sufficient HMM statistics of the selected speakers' data, a quick adaptation can be done. Compared with a pre-clustering method, the proposed method can obtain a more optimal speaker cluster because the clustering result is determined according to test speaker's data on-line. Experiment results show that the proposed method attains better improvement than MLLR from the speaker independent model. Moreover the proposed method utilizes only one unsupervised sentence utterance, while MLLR usually utilizes more than ten supervised sentence utterances.

Keywords