Browse > Article
http://dx.doi.org/10.7776/ASK.2006.25.8.383

A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement  

Park, Yun-Sik (인하대학교 전자전기공학부)
Chang, Joon-Hyuk (인하대학교 전자전기공학부)
Abstract
This Paper presents a novel approach to single channel microphone speech enhancement in noisy environments. Widely used noise reduction techniques based on the spectral subtraction are generally expressed as a spectral gam depending on the signal-to-noise ratio (SNR). The well-known decision-directed(DD) estimator of Ephraim and Malah efficiently reduces musical noise under the background noise conditions, but generates the delay of the a prioiri SNR because the DD weights the speech spectrum component of the Previous frame in the speech signal. Therefore, the noise suppression gain which is affected by the delay of the a priori SNR, which is estimated by the DD matches the previous frame rather than the current one, so after noise suppression. this degrades the noise reduction performance during speech transient periods. We propose a computationally simple but effective speech enhancement technique based on the sigmoid type function for the weight Parameter of the DD. The proposed approach solves the delay problem about the main parameter, the a priori SNR of the DD while maintaining the benefits of the DD. Performances of the proposed enhancement algorithm are evaluated by ITU-T p.862 Perceptual Evaluation of Speech duality (PESQ). the Mean Opinion Score (MOS) and the speech spectrogram under various noise environments and yields better results compared with the fixed weight parameter of the DD.
Keywords
a Priori SNR; Decision-Directed; Speech Enhancement; Sigmoid Type;
Citations & Related Records
연도 인용수 순위
  • Reference
1 O. Cappe, 'Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor,' IEEE Trans Speech Audio Process., 2(2) 345-349, Apr. 1994   DOI   ScienceOn
2 C. You, S. N. Koh, and S. Rahardja 'Signal subspace speech enhancement for audible noise reduction', in Proc IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 1 145-148, Mar. 2005
3 N. S. Kim, J.-H. Chang, 'Spectral enhancement based on global soft decision,' IEEE Signal Processing Letters, 7(5) May 2000, 108-110   DOI   ScienceOn
4 C. Plapous, C. Marro, P. Scalart, and L. Mauuary, 'A two-step noise reduction technique, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, 1 289--292
5 R. J. McAualy and M. L. Malpass, 'Speech enhancement using a soft-decision noise suppression filter,' IEEE Trans Acoust., Speech, Signal Processing, vol. ASSP-28, 137-145, Apr. 1980   DOI
6 J. Sohn, N. S. Kim, W. Sung, 'A statistical model-based voice activity detection,' IEEE Signal Processing Letters, 6(1) 1-3, Jan. 1999   DOI   ScienceOn
7 I. Cohen, 'Speech enhancement using a noncausal a priori SNR estimator,' IEEE Signal Processing Letters, 11 (9) Sept. 2004. 725-728   DOI   ScienceOn
8 S. F. Boll, 'Suppression of acoustic noise in speech using spectral subtraction,' IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, 2 113-120, Apr. 1979
9 N. Virag, 'Single channel speech enhancement based on masking properties of the human auditory system,' IEEE Trans. Speech and Audio Processing, 7(2) 126-137, Mar. 1999   DOI   ScienceOn
10 Y. Ephraim and D. Malah, 'Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,' IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, 6 1109--1121, Dec. 1984
11 N. Ma, M. Bouchard and R. Goubran, 'Perceptual Kalman filtering for speech enhancement in colored noise,' in Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, 1 717-720, Montreal, May 2004