[KSCI] Korea Science Citation Index Service

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments

Yoon, Jang-Hyuk (Department of Electronics Engineering, Keimyung University)
Chung, Young-Joo (Department of Electronics Engineering, Keimyung University)

Publication Information

The Journal of the Acoustical Society of Korea / v.29, no.2E, 2010 , pp. 100-106 More about this Journal

Abstract

Multiple-model speech recognizer has been shown to be quite successful in noisy speech recognition. However, its performance has usually been tested using the general speech front-ends which do not incorporate any noise adaptive algorithms. For the accurate evaluation of the effectiveness of the multiple-model frame in noisy speech recognition, we used the state-of-the-art front-ends and compared its performance with the well-known multi-style training method. In addition, we improved the multiple-model speech recognizer by employing N-best reference HMMs for interpolation and using multiple SNR levels for training each of the reference HMM.

Keywords

HMM; Multi-model Speech Recognizer; Noise Robustness;

Citations & Related Records

Reference

1	D. Macho, L. Mauuary, B. Noe, Y. Cheng, D. Eahey, D. Jouvet, H. Kelleher, D. Pearce, F. Saadoun, "Evaluation of a noise-robust DSR front-end on Aurora databases", in Proc. ICSLP, pp. 17-20, 2002.
2	B. H. Juang and L. R. Rabiner, "A Probabilistic Distance Measure for Hidden Markov Models", AT&T Technology Journal, pp. 391-408, 1984.
3	D. Pearce and H. Hirsch, The Aurora experimental framework for the performance evaluation of speech recognition systems under conditions", in Proc. ICSLP, pp.29-32, 2000.
4	S. F. Ball. "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. 27, pp.113-120, 1979. DOI
5	ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced Front-end feature extraction algorithm; Compression algorithm, ETSI Standard ES 202 050. 2002.
6	ETSI draft standard doc. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm Compression algorithm, ETSI Standard ES 202 108., 2000.
7	H. Xu, Z.-H. Tan, P. Dalsgaard and B. Lindberg, "Robust Speech Recognition on Noise and SNR Classification-a Multiple-Model Framework", in Proc. Interspeech, 2005.
8	M. J. F. Gales, "Model based techniques for noise-robust speech recognition", Ph.D. Dissertation, University of Cambridge, 1995.
9	P. J. Moreno, "Speech recognition in noisy environments", Ph.D. Dissertation, Carnegie Mellon University, 1996.

KSCI

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments 잡음 환경하에서의 다 모델 기반인식기와 다 스타일 학습방법과의 성능비교

Performance Comparison of Multiple-Model Speech Recognizer with Multi-Style Training Method Under Noisy Environments