Browse > Article

Gaussian Density Selection Method of CDHMM in Speaker Recognition  

서창우 ((주)인스모바일 기술연구소)
이주헌 (동아방송대학 인터넷방송과)
임재열 (한국기술교육대학교 정보기술공학부)
이기용 (숭실대학교 정보통신전자공학부)
Abstract
This paper proposes the method to select the number of optimal mixtures in each state in Continuous Density HMM (Hidden Markov Models), Previously, researchers used the same number of mixture components in each state of HMM regardless spectral characteristic of speaker, To model each speaker as accurately as possible, we propose to use a different number of mixture components for each state, Selection of mixture components considered the probability value of mixture by each state that affects much parameter estimation of continuous density HMM, Also, we use PCA (principal component analysis) to reduce the correlation and obtain the system' stability when it is reduced the number of mixture components, We experiment it when the proposed method used average 10% small mixture components than the conventional HMM, When experiment result is only applied selection of mixture components, the proposed method could get the similar performance, When we used principal component analysis, the feature vector of the 16 order could get the performance decrease of average 0,35% and the 25 order performance improvement of average 0.65%.
Keywords
Speaker recognition; CDHMM; Gaussian density; Principal component analysis;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. E. Rosenberg, C. H. Lee, and F. K. Soong, 'Subword talker verification using hidden Markov models,' IEEE ICASSP, 269-272, 1990
2 K. F. Lee, Automatic Speech Recognition: the development of the SPHINX system, Kluwer Academic, 1989
3 R. C . Vasko, A. EI-Jarudi, J. R. Boston, 'An algorithm to determine hidden Markov model topology,' ICASS 96, 6, 3577-3580, May, 1996
4 Y. Ariki, S. Tagashira, and M. Nishijima, 'Speaker recognition and speaker normalization by projection to speaker subspace,' ICASSP 96, Atlanta, USA, 319-322, 1996
5 A. Biem, H. Jin-Young, and J. Subrahmonia, 'A bayesian model selection criterion for HMM topology optimization,' ICASSP 02, 1, 989-992, May 2002
6 X. D. Huang, Y. Ariki, and M. A. Jack, Hidden Markov Models for Speech Recognition. Edinburgh University, 1990
7 S. Furui, 'An overview of speaker recognition technology,' ESCA workshop on Automatic Speaker Recognition Identification Verification, 1-9, 1994
8 L. Liu and J. He, 'On the use of orthogonal GMM in speaker recognition,' Pro. ICASSP, 845-849, 1999