Browse > Article

A Study on the Speaker Adaptation in CDHMM  

Kim, Gwang-Tae (Dept.of Electronics Electric Engineering, Sangmyung University)
Publication Information
Abstract
A new approach to improve the speaker adaptation algorithm by means of the variable number of observation density functions for CDHMM speech recognizer has been proposed. The proposed method uses the observation density function with more than one mixture in each state to represent speech characteristics in detail. The number of mixtures in each state is determined by the number of frames and the determinant of the variance, respectively. The each MAP Parameter is extracted in every mixture determined by these two methods. In addition, the state segmentation method requiring speaker adaptation can segment the adapting speech more Precisely by using speaker-independent model trained from sufficient database as a priori knowledge. And the state duration distribution is used lot adapting the speech duration information owing to speaker's utterance habit and speed. The recognition rate of the proposed methods are significantly higher than that of the conventional method using one mixture in each state.
Keywords
Speaker adaptation; speech recognition; speaker dependent; speaker independent; Viterbi algorithm;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs /
[ K.Ohkura;M.Sugiyama;S.Sagayama ] / Proc. ICSLP
2 Predictive speaker adaptation in speech recongnition /
[ S.Cox ] / Computer Speech and Language   DOI   ScienceOn
3 Studies in transformation-based adaptation /
[ V.Nagesha;L.Gillick ] / Proc. ICASSP97
4 K. F. Lee and R. Rebby, Automatic speech recognition, Kluwer Academic, 1989
5 Telephone line characteristics adaptation using vector field smoothing technique /
[ J.Takahashi;S.Sagayama ] / Proc. ICSLP
6 CDHMM의 상태당 가지수를 가변시키는 화자적응에 관한 연구 /
[ 김광태;서정일;홍재근 ] / 대한전자공학회 논문집   과학기술학회마을
7 L. R. Rabiner and B. H. Juang, 'An introduction to hidden Markov models,' IEEE ASSP Mag., pp. 4-16, Jan. 1986
8 B. S. Atal and L. R. Rabiner, 'Speech research directions,' AT&T Tech. J., vol. 65, pp. 75-88, Sep.-Oct., 1986   DOI
9 H. Sakoe and S. Chiba, 'Dynamic programming algorithm optimization for spoken word recognition,' IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26, pp, 43-49, Feb. 1978
10 'A neural network approach to speech recognition,' Tech. Report, ETRI. Advanced Research Dept., Mar. 1990
11 C. H. Lee, C. H. Lin, and B. H. Juang, 'A study on speaker adaptation of the parameters of continuous density hidden Markov models,' IEEE Trans. on Signal Processing, vol. 39, no. 4, pp. 803-814, Apr. 1991   DOI
12 V. V. Digalakis, D. Rtischev, and L. G. Neumeyer, 'Speaker adaptation using cons-trained estimation of Gaussian mixtures,' IEEE Trans. on Speech and Audio Processing, vol. 3, no. 5, pp. 357-365, Sep, 1995   DOI   ScienceOn
13 Y. Shiraki and M. Honda, 'Speaker adaptation algorithms for segment vocoder,' IEICE, vol. SP87-67, pp. 49-56, Oct. 1987
14 Y. Hao and D. Fang, 'Speech recognition using speaker adaptation by system parameter transformation,' IEEE Trans. on Speech and Audio Processing, vol. 2, no. 1, pp. 63-68, Jan. 1994   DOI   ScienceOn
15 S. Furui, 'Unsupervised speaker adaptation method based on hierarchical spectral clus-tering,' Proc. ICASSP89, pp. 286-289, May 1989   DOI
16 K. Shikano, K. F. Lee, and R. Reddy, 'Speaker adaptation through vector quantization,' Proc. ICASSP86, pp. 2643-2646, Apr. 1986
17 P. F. Brown, C. H. Lee, and J. C. Spohrer, 'Bayesian adaptation in speech recognition,' Proc. ICASSP83, pp. 761-764, Apr. 1983
18 J. Chien, H. Wang, and C. Lee, 'Improved Bayesian learning of hidden Markov models for speaker adaptation,' Proc. ICASSP97, pp. 1027-1030, Apr. 1997   DOI
19 S. Cox, 'Predictive speaker adaptation in speech recognition', Computer Speech and Language, vol. 9, pp. 1-17, 1995   DOI   ScienceOn
20 V. Nagesha and L. Gillick, 'Studies in transfor-mation-based adaptation,' Proc. ICASSP97, pp. 1031-1034, Apr. 1997
21 R. M. Stern and M. J. Lasry, 'Dynamic speaker adaptation for feature-based isolated word recognition,' IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, no. 6, June 1987
22 K. Ohkura, M. Sugiyama, and S. Sagayama, 'Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs,' Proc. ICSLP, pp. 369-372, 1992
23 김광태, 서정일, 홍재근, 'CDHMM의 상태당 가지수를 가변시키는 화자적응에 관한 연구,' 대한전자공학회 논문집, 제35C권 제3호, pp. 166-175, 1998
24 J. Takahashi and S. Sagayama, 'Telephone line characteristic adaptation using vector field smoothing technique,' Proc. ICSLP, pp. 991-994, 1994