Browse > Article

Robust Distributed Speech Recognition under noise environment using MESS and EH-VAD  

Choi, Gab-Keun (Computer Engineering Department, Kwangwoon University)
Kim, Soon-Hyob (Computer Engineering Department, Kwangwoon University)
Publication Information
Abstract
The background noises and distortions by channel are major factors that disturb the practical use of speech recognition. Usually, noise reduce the performance of speech recognition system DSR(Distributed Speech Recognition) based speech recognition also bas difficulty of improving performance for this reason. Therefore, to improve DSR-based speech recognition under noisy environment, this paper proposes a method which detects accurate speech region to extract accurate features. The proposed method distinguish speech and noise by using entropy and detection of spectral energy of speech. The speech detection by the spectral energy of speech shows good performance under relatively high SNR(SNR 15dB). But when the noise environment varies, the threshold between speech and noise also varies, and speech detection performance reduces under low SNR(SNR 0dB) environment. The proposed method uses the spectral entropy and harmonics of speech for better speech detection. Also, the performance of AFE is increased by precise speech detections. According to the result of experiment, the proposed method shows better recognition performance under noise environment.
Keywords
Noisy speech recognition; Distributed Speech Recognition; speech recognition system;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 ETSI standard document, "Speech Processing, Transmission and Quality aspects(STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms," ETSI ES 202 050 v.1.1.3 (2003-11), Nov. 2003.
2 Scart, P., Filho, J. "Speech enhancement based on a priori signal to noise estimation", Proc. IEEE Int. Conf. Acoust., Speech Signal Process., pp. 629-632, 2002.
3 Ephraim, Y., Malah, D. "Speech enhancement Using a minimum mean square error short-time spectral amplitude estimator", IEEE Trans. Acoust., Speech Signal Process., Vol 32, pp. 1109-1121, 1984.   DOI
4 최갑근, 김순협, "엔트로피와 하모닉 검출을 이용한 잡음환경에 강인한 음성검출", 전자공학회논문지, 제47권 SP편 1호, 169-174쪽, 1229-6384, 2010년.
5 Rabiner, L. R., M. R. Sambur, "An Algorithm for Determining the Endpoints of Isolated Utterances", The Bell System Technical Journal, Vol. 54, No. 2, pp. 297-315, 1975.
6 Abdallah I., Montresor S., Baudry M, "Robust speech/non-speech detection in adverse conditions using an entropy based estimator" Digital Signal Processing Proceedings 1997, pp. 752-760, Santorini Greece, Jul 1997.
7 Ramalho, M.A. Mammone, R.J. "New speech enhancement techniques using the pitch mode modulation model" Circuits and Systems, 1993 Proceedings of the 36th Midwest Symposium, pp. 1531-1534, Detroit, USA, Aug 1993.
8 Ball, S, F., "Supression of acoustic noise in speech using spectral subtraction", IEEE Trans., Speech Signal Process., Vol. 27, pp 113-120, 1979.   DOI
9 Berouti, M., Schwartz, R., Makhoul, J. "Enhancement of speech corrupted by acoustic noise", Proc. IEEE Int. Conf. Acoustic, Speech Signal Process., pp 208-211., 1979.
10 Kamarth, S., Loizou, P. "A multi-Band spectral subtraction method for enhancing speech corrupted by colored noise", Proc. IEEE Int, Conf, Acoustic Speech Signal Process., pp 101-111,2002.
11 A. Varga and H. J. M. Steeneken, "Assessmentfor automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Communication, vol 12, no. 3, pp. 247-251, July 1993.   DOI   ScienceOn
12 ETSI standard document, "Speech Processing, Transmission and Quality aspects(STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithms," ETSI ES 201 108 v.1.1.1 (2000-02), Feb. 2002.