[KSCI] Korea Science Citation Index Service

Spectral Subtraction Using Spectral Harmonics for Robust Speech Recognition in Car Environments

Beh, Jounghoon (Dept. of Electronics and Computer Engineering, Korea University)
Ko, Hanseok (Dept. of Electronics and Computer Engineering, Korea University)

Publication Information

The Journal of the Acoustical Society of Korea / v.22, no.2E, 2003 , pp. 62-68 More about this Journal

Abstract

This paper addresses a novel noise-compensation scheme to solve the mismatch problem between training and testing condition for the automatic speech recognition (ASR) system, specifically in car environment. The conventional spectral subtraction schemes rely on the signal-to-noise ratio (SNR) such that attenuation is imposed on that part of the spectrum that appears to have low SNR, and accentuation is made on that part of high SNR. However, these schemes are based on the postulation that the power spectrum of noise is in general at the lower level in magnitude than that of speech. Therefore, while such postulation is adequate for high SNR environment, it is grossly inadequate for low SNR scenarios such as that of car environment. This paper proposes an efficient spectral subtraction scheme focused specifically to low SNR noisy environment by extracting harmonics distinctively in speech spectrum. Representative experiments confirm the superior performance of the proposed method over conventional methods. The experiments are conducted using car noise-corrupted utterances of Aurora2 corpus.

Keywords

Robust speech recognition; Spectral subtraction;

Citations & Related Records

Reference

1	J. Jensen, and J. Hansen, 'Speech enhancement using a constrained iterative sinusoidal model,' IEEE Transactions on Speech and Audio Processing, 9 (7), 731-740, 2001 DOI ScienceOn
2	D. Ealey, H. Kellher, and D. Pearce, 'Harmonic tunneling:t rack-ing non-stationary noises during speech,' Eurospeech, 437-440, 2001
3	N. Virag, 'Single channel speech enhancement based on masking properties of the human auditory system,' IEEE Transactions on Speech and Audio Processing, 7(2), 126-137, 1999 DOI ScienceOn
4	L. Rabiner, and R. Schafer, Digital Processing of Speech Signals, Prentice-Hall, 1978
5	P. Lockwood, and J. Boudy, 'Experiments with a Nonlinear Spectral Subtractor (NSS), hidden markov models and the projection, for robust speech recognition in cars,' Speech Communication, 11, 215-228, 1992 DOI ScienceOn
6	S. F. Boll, 'Suppression of acoustic noise in speech using spectral subtraction,' IEEE Transaction on Acoustics, Speech and Signal Processing, 27(2), 113-120, 1979 DOI
7	M. Berouti, R. Schwartz, and J. Makhoul, 'Enhancement of speech corrupted by additive noise,' Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing, 208-211, 1979
8	W. Hess, Pitch Determination of Speech Signals, Springer Verlag, 1983

1	Implementation of Embedded Speech Recognition System for Supporting Voice Commander to Control an Audio and a Video on Telematics Terminals / [Kwon, Oh-Il;Lee, Heung-Kyu;] / Journal of the Institute of Electronics Engineers of Korea TC
2	Preprocessing Technique for Improvement of Speech Recognition in a Car / [Kim, Hyun-Tae;Park, Jang-Sik;] / The Journal of the Korea Contents Association
3	Footstep Detection in Noisy Environment via Non-Linear Spectral Subtraction and Cross-Correlation / [Kim, Tae-Bok;Ko, Hanseok;] / The Journal of Korean Institute of Communications and Information Sciences