Browse > Article
http://dx.doi.org/10.7776/ASK.2009.28.6.566

Simultaneous Speaker and Environment Adaptation by Environment Clustering in Various Noise Environments  

Kim, Young-Kuk (LG전자기술원)
Song, Hwa-Jeon (부산대학교 전자전기공학부)
Kim, Hyung-Soon (부산대학교 전자전기공학부)
Abstract
This paper proposes noise-robust fast speaker adaptation method based on the eigenvoice framework in various noisy environments. The proposed method is focused on de-noising and environment clustering. Since the de-noised adaptation DB still has residual noise in itself, environment clustering divides the noisy adaptation data into similar environments by a clustering method using the cepstral mean of non-speech segments as a feature vector. Then each adaptation data in the same cluster is used to build an environment-clustered speaker adapted (SA) model. After selecting multiple environmentally clustered SA models which are similar to test environment, the speaker adaptation based on an appropriate linear combination of clustered SA models is conducted. According to our experiments, we observe that the proposed method provides error rate reduction of $40{\sim}59%$ over baseline with speaker independent model.
Keywords
Speech recognition; Speaker adaptation; Environment clustering; Eigenvoice; Eigen-environment;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Kuhn, P. Nguyen, J. C. Jungua, L. Goldwasser, N. Nied-zielski, S. Finche, K. Field, and M. Contolini, "Eigenvoices for speaker adaptation," in Proc. ICSLP, pp 1771-1774, Nov. 30-Dec. 4, 1998
2 C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, vol.9, no.1, pp.171-185, 1995   DOI   ScienceOn
3 C. H. Lee, C. H. Lin, and B. H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Signal Processing, vol.39, no.4, pp.806-814, 1991   DOI
4 H. J. Song and H. S. Kim, “Eigen-environment based noise compensation method for robust speech recognition,” In Proc. Eurospeech, pp.981-984, Sep. 2005
5 H. J. Song and H. S. Kim, "Simultaneous estimation of weights of eigenvoices and bias compensation vector for rapid speaker adaptation," In Proc. ICSLP, pp.2945-2948, Oct. 2004
6 ITU recommendation P.56, Objective measurement of active speech IeveI, Mar., 1993