Browse > Article

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments  

Yoon, Jang-Hyuk (Department of Electronics Engineering, Keimyung University)
Chung, Young-Joo (Department of Electronics Engineering, Keimyung University)
Abstract
Multiple-model speech recognizer has been shown to be quite successful in noisy speech recognition. However, its performance has usually been tested using the general speech front-ends which do not incorporate any noise adaptive algorithms. For the accurate evaluation of the effectiveness of the multiple-model frame in noisy speech recognition, we used the state-of-the-art front-ends and compared its performance with the well-known multi-style training method. In addition, we improved the multiple-model speech recognizer by employing N-best reference HMMs for interpolation and using multiple SNR levels for training each of the reference HMM.
Keywords
HMM; Multi-model Speech Recognizer; Noise Robustness;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. Macho, L. Mauuary, B. Noe, Y. Cheng, D. Eahey, D. Jouvet, H. Kelleher, D. Pearce, F. Saadoun, "Evaluation of a noise-robust DSR front-end on Aurora databases", in Proc. ICSLP, pp. 17-20, 2002.
2 B. H. Juang and L. R. Rabiner, "A Probabilistic Distance Measure for Hidden Markov Models", AT&T Technology Journal, pp. 391-408, 1984.
3 D. Pearce and H. Hirsch, The Aurora experimental framework for the performance evaluation of speech recognition systems under conditions", in Proc. ICSLP, pp.29-32, 2000.
4 S. F. Ball. "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. 27, pp.113-120, 1979.   DOI
5 ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced Front-end feature extraction algorithm; Compression algorithm, ETSI Standard ES 202 050. 2002.
6 ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm Compression algorithm, ETSI Standard ES 202 108., 2000.
7 H. Xu, Z.-H. Tan, P. Dalsgaard and B. Lindberg, "Robust Speech Recognition on Noise and SNR Classification-a Multiple-Model Framework", in Proc. Interspeech, 2005.
8 M. J. F. Gales, "Model based techniques for noise-robust speech recognition", Ph.D. Dissertation, University of Cambridge, 1995.
9 P. J. Moreno, "Speech recognition in noisy environments", Ph.D. Dissertation, Carnegie Mellon University, 1996.