Browse > Article

Self-Adaptation Algorithm Based on Maximum A Posteriori Eigenvoice for Korean Connected Digit Recognition  

Kim Dong Kook (전남대학교 전자컴퓨터정보통신공학부)
Jeon Hyung Bae (한국전자통신연구원)
Abstract
This paper Presents a new self-adaptation algorithm based on maximum a posteriori (MAP) eigenvoice for Korean connected digit recognition. The proposed MAP eigenvoice is developed by introducing a probability density model for the eigenvoice coefficients. The Proposed approach provides a unified framework that incorporates the Prior model into the conventional eigenvoice estimation. In self-adaptation system we use only one adaptation utterance that will be recognized, we use MAP eigenvoice that is most robust adaptation. In series of self-adaptation experiments on the Korean connected digit recognition task. we demonstrate that the performance of the proposed approach is better than that of the conventional eigenvoice algorithm for a small amount of adaptation data.
Keywords
Maximum a Posteriori; Speaker Adaptation; Self-adaptation; Speech Recognition;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. K. Kim, Y. J. Kim, W H. Lim, and N. S. Kim, 'Online adaptation using transformation space model evolution,' in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2003
2 P. Nguyen, 'Speaker adaptation: Modeling variabilities,' Ph.D. thesis, 2002
3 R. Kuhn, F. Perronnin and J. -C. Junqua, 'Time is money: why very rapid adaPtation matters,' in Proc. Adaptation Methods for Speech Recognition, ISCA ITR-Workshop, Sophia-Antipolis, France, 33-36, 2001
4 C. J. Leggetter and P. C. Woodland, 'Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models.' Computer Speech and Language, 9, 171-185, 1995   DOI   ScienceOn
5 W Chou, 'Maximum a posteriori linear regression with elliPtically symmetric matrix priors,' in Proc. Euro. Conf. Speech Commun., Technology, 1, 1-4, 1999
6 R. Kuhn, J. -C. Junqua, P. Nguyen and N. Niedzielski, 'Rapid speaker adaptation in Eigenvoice Space,' IEEE Trans. Speech and Audio Proc.. 8(6), 695-707, 2000   DOI   ScienceOn
7 Ho-Young Jung, Mansoo Park, Hoi-Rin Kim, and Minsoo Hahn, 'Speaker Adaptation Using ICA-Based Feature Transfomation,' ETRI J., 24(6), pp,469-472, Dec. 2002   DOI   ScienceOn
8 J .L. Gauvain and C. -H. Lee, 'Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,' IEEE Trans. Speech and Audio Proc., 2, 291-298, 1994   DOI   ScienceOn
9 P. C. Woodland. 'Speaker adaPtation for continuous density HMMs; a review,' in Proc. AdaPtation Methods for Speech Recognition, ISCA ITR-Workshop, Sophia-Antipolis, France, pp. 11-19, 2001
10 I. T. Jolliffe, Principal Component Analysis. Springer-Verlag, 1986
11 전형배, 김동국, '연결 숫자음 인식에서의 고속 화자 적응', 제 20회 음성통신 및 신호처리 학술대회 논문집, pp 441-444, 2003
12 K. -T. Chen, W -W Liau, H. -M. Wang, and L. -So Lee, 'Fast speaker adaptation using eigenspace-based maximum likelihood linear regression,' in Proc. Int. Conf. Spoken Language Processing, Beijing, China, 742-745, Oct. 2000
13 D. K. Kim and N. S. Kim, 'Rapid speaker adaptation using probabilistic principal component analysis,' IEEE Signal Processing Letters. 8(6), 180-183, June 2001   DOI   ScienceOn