A study on the speaker adaptation in CDHMM usling variable number of mixtures in each state

CDHMM의 상태당 가지 수를 가변시키는 화자적응에 관한 연구

  • 김광태 (상주산업대학교 전자전기공학과) ;
  • 서정일 (경북대학교 전자전기공학부) ;
  • 홍재근 (경북대학교 전자전기공학부)
  • Published : 1998.03.01

Abstract

When we make a speaker adapted model using MAPE (maximum a posteriori estimation), the adapted model has one mixture in each state. This is because we cannot estimate a number of a priori distribution from a speaker-independent model in each state. If the model is represented by one mixture in each state, it is not well adadpted to specific speaker because it is difficult to represent various speech informationof the speaker with one mixture. In this paper, we suggest the method using several mixtures to well represent various speech information of the speaker in each state. But, because speaker-specific training dat is not sufficient, this method can't be used in every state. So, we make the number of mixtures in each state variable in proportion to the number of frames and to the determinant ofthe variance matrix in the state. Using the proposed method, we reduced the error rate than methods using one branch in each state.

Keywords