[KSCI] Korea Science Citation Index Service

Self-Adaptation Algorithm Based on Maximum A Posteriori Eigenvoice for Korean Connected Digit Recognition

Kim Dong Kook (전남대학교 전자컴퓨터정보통신공학부)
Jeon Hyung Bae (한국전자통신연구원)

Publication Information

The Journal of the Acoustical Society of Korea / v.23, no.8, 2004 , pp. 590-596 More about this Journal

Abstract

This paper Presents a new self-adaptation algorithm based on maximum a posteriori (MAP) eigenvoice for Korean connected digit recognition. The proposed MAP eigenvoice is developed by introducing a probability density model for the eigenvoice coefficients. The Proposed approach provides a unified framework that incorporates the Prior model into the conventional eigenvoice estimation. In self-adaptation system we use only one adaptation utterance that will be recognized, we use MAP eigenvoice that is most robust adaptation. In series of self-adaptation experiments on the Korean connected digit recognition task. we demonstrate that the performance of the proposed approach is better than that of the conventional eigenvoice algorithm for a small amount of adaptation data.

Keywords

Maximum a Posteriori; Speaker Adaptation; Self-adaptation; Speech Recognition;

Citations & Related Records

Reference

1	D. K. Kim, Y. J. Kim, W H. Lim, and N. S. Kim, 'Online adaptation using transformation space model evolution,' in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2003
2	P. Nguyen, 'Speaker adaptation: Modeling variabilities,' Ph.D. thesis, 2002
3	R. Kuhn, F. Perronnin and J. -C. Junqua, 'Time is money: why very rapid adaPtation matters,' in Proc. Adaptation Methods for Speech Recognition, ISCA ITR-Workshop, Sophia-Antipolis, France, 33-36, 2001
4	C. J. Leggetter and P. C. Woodland, 'Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models.' Computer Speech and Language, 9, 171-185, 1995 DOI ScienceOn
5	W Chou, 'Maximum a posteriori linear regression with elliPtically symmetric matrix priors,' in Proc. Euro. Conf. Speech Commun., Technology, 1, 1-4, 1999
6	R. Kuhn, J. -C. Junqua, P. Nguyen and N. Niedzielski, 'Rapid speaker adaptation in Eigenvoice Space,' IEEE Trans. Speech and Audio Proc.. 8(6), 695-707, 2000 DOI ScienceOn
7	Ho-Young Jung, Mansoo Park, Hoi-Rin Kim, and Minsoo Hahn, 'Speaker Adaptation Using ICA-Based Feature Transfomation,' ETRI J., 24(6), pp,469-472, Dec. 2002 DOI ScienceOn
8	J .L. Gauvain and C. -H. Lee, 'Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,' IEEE Trans. Speech and Audio Proc., 2, 291-298, 1994 DOI ScienceOn
9	P. C. Woodland. 'Speaker adaPtation for continuous density HMMs; a review,' in Proc. AdaPtation Methods for Speech Recognition, ISCA ITR-Workshop, Sophia-Antipolis, France, pp. 11-19, 2001
10	I. T. Jolliffe, Principal Component Analysis. Springer-Verlag, 1986
11	전형배, 김동국, '연결 숫자음 인식에서의 고속 화자 적응', 제 20회 음성통신 및 신호처리 학술대회 논문집, pp 441-444, 2003
12	K. -T. Chen, W -W Liau, H. -M. Wang, and L. -So Lee, 'Fast speaker adaptation using eigenspace-based maximum likelihood linear regression,' in Proc. Int. Conf. Spoken Language Processing, Beijing, China, 742-745, Oct. 2000
13	D. K. Kim and N. S. Kim, 'Rapid speaker adaptation using probabilistic principal component analysis,' IEEE Signal Processing Letters. 8(6), 180-183, June 2001 DOI ScienceOn

KSCI

Self-Adaptation Algorithm Based on Maximum A Posteriori Eigenvoice for Korean Connected Digit Recognition 한국어 연결 숫자음 인식을 일한 최대 사후 Eigenvoice에 근거한 자기적응 기법

Self-Adaptation Algorithm Based on Maximum A Posteriori Eigenvoice for Korean Connected Digit Recognition