[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2006.25.8.383

A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement

Park, Yun-Sik (인하대학교 전자전기공학부)
Chang, Joon-Hyuk (인하대학교 전자전기공학부)

Publication Information

The Journal of the Acoustical Society of Korea / v.25, no.8, 2006 , pp. 383-388 More about this Journal

Abstract

This Paper presents a novel approach to single channel microphone speech enhancement in noisy environments. Widely used noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gam depending on the signal-to-noise ratio (SNR). The well-known decision-directed(DD) estimator of Ephraim and Malah efficiently reduces musical noise under the background noise conditions, but generates the delay of the a prioiri SNR because the DD weights the speech spectrum component of the Previous frame in the speech signal. Therefore, the noise suppression gain which is affected by the delay of the a priori SNR, which is estimated by the DD matches the previous frame rather than the current one, so after noise suppression. this degrades the noise reduction performance during speech transient periods. We propose a computationally simple but effective speech enhancement technique based on the sigmoid type function for the weight Parameter of the DD. The proposed approach solves the delay problem about the main parameter, the a priori SNR of the DD while maintaining the benefits of the DD. Performances of the proposed enhancement algorithm are evaluated by ITU-T p.862 Perceptual Evaluation of Speech duality (PESQ). the Mean Opinion Score (MOS) and the speech spectrogram under various noise environments and yields better results compared with the fixed weight parameter of the DD.

Keywords

a Priori SNR; Decision-Directed; Speech Enhancement; Sigmoid Type;

Citations & Related Records

Reference

1	O. Cappe, 'Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor,' IEEE Trans Speech Audio Process., 2(2) 345-349, Apr. 1994 DOI ScienceOn
2	C. You, S. N. Koh, and S. Rahardja 'Signal subspace speech enhancement for audible noise reduction', in Proc IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 1 145-148, Mar. 2005
3	N. S. Kim, J.-H. Chang, 'Spectral enhancement based on global soft decision,' IEEE Signal Processing Letters, 7(5) May 2000, 108-110 DOI ScienceOn
4	C. Plapous, C. Marro, P. Scalart, and L. Mauuary, 'A two-step noise reduction technique, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, 1 289--292
5	R. J. McAualy and M. L. Malpass, 'Speech enhancement using a soft-decision noise suppression filter,' IEEE Trans Acoust., Speech, Signal Processing, vol. ASSP-28, 137-145, Apr. 1980 DOI
6	J. Sohn, N. S. Kim, W. Sung, 'A statistical model-based voice activity detection,' IEEE Signal Processing Letters, 6(1) 1-3, Jan. 1999 DOI ScienceOn
7	I. Cohen, 'Speech enhancement using a noncausal a priori SNR estimator,' IEEE Signal Processing Letters, 11 (9) Sept. 2004. 725-728 DOI ScienceOn
8	S. F. Boll, 'Suppression of acoustic noise in speech using spectral subtraction,' IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, 2 113-120, Apr. 1979
9	N. Virag, 'Single channel speech enhancement based on masking properties of the human auditory system,' IEEE Trans. Speech and Audio Processing, 7(2) 126-137, Mar. 1999 DOI ScienceOn
10	Y. Ephraim and D. Malah, 'Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,' IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, 6 1109--1121, Dec. 1984
11	N. Ma, M. Bouchard and R. Goubran, 'Perceptual Kalman filtering for speech enhancement in colored noise,' in Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, 1 717-720, Montreal, May 2004

KSCI

A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement 음성 향상에서 강인한 새로운 선행 SNR 추정 기법에 관한 연구

A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement