Browse > Article

Speech Enhancement Based on Improved Minima Controlled Recursive Averaging Incorporating GSAP  

Song, Ji-Hyun ((Department of Electronic Engineering, Inha University)
Bang, Dong-Hyeouck ((Department of Electronic Engineering, Inha University)
Lee, Sang-Min ((Department of Electronic Engineering, Inha University)
Publication Information
Abstract
In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA). From an examination for various noise environment, it is shown that the IMCRA has a fundamental drawback for the noise power estimate at the offset region of continuity speech signals. Espectially, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. To overcome the drawback, we apply the global speech absence probability (GSAP) conditioned on both a priori SNR and a posteriori SNR to the speech detection algorithm of IMCRA. With the performance criteria of the ITU-T P.862 perceptual evaluation of speech quality (PESQ) and a composite measure test, we show that the proposed algorithm yields better results compared to the conventional IMCRA-based scheme under various noise environments. In particular, in the case of babble 5 dB, the proposed method produced a remarkable improvement compared to the IMCRA ( PESQ = 0.026, composite measure = 0.029 ).
Keywords
Citations & Related Records
연도 인용수 순위
  • Reference
1 S. F. Boll. "Suppression of acousitc noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-27(2), pp.113-120, Apr. 1979.
2 S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Signal Processing, pp.113-120, Apr. 1979.
3 I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environment," Signal Processing, pp.2403-2418, Nov. 2001.
4 G. Doblinger, "Computationally efficient speech enhancement by spectral minima tracking in subbands," Proc. 4th European Conf. Speech, Communication and Technology, EUROSPEECH'95, pp.1513-1516, Sep. 1995.
5 R. Martin, "Spectral subtraction based on minimum statistics," Proceeding of 7th EUSIPCO'94, Edinburgh, U.K., pp.1182-1185, Sep. 1994.
6 I. Cohen and B. Berdugo, "Spectral enhancement by tracking speech presence probability in subbands," Proc. IEEE Workshop on Hands Free Speech Communication, HSC'01, Kyoto, Japan, pp.95-98, Apr. 2001.
7 I. Cohen and B. Berdugo, " Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Processing Letters, pp.12-15, Jan. 2002
8 I. Cohen, "Noise spectrum estimation in adverse environments : improved minima controlled recursive averaging," IEEE Transactions on Speech and Audio Processing, pp.466-475, Sep. 2003.
9 R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans Acoustic, Speech and Audio Processing, pp.504-512, Jul. 2001
10 S. Rangachari, P. C. Loizou and Y. Hu, "A noise estimation algorithm with rapid adaptation for highly nonstationary environments," IEEE Conf. Acoustic, Speech Signal Processing, pp.305-308. May 2004.
11 N. S. Kim and J. H. Chang, "Spectral enhancement based on global soft decision," IEEE Siganl Processing Letters, pp.108-110, May. 2000.
12 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-32(6), pp.1109-1121, Dec. 1984.
13 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-32(2), pp.443-445, Apr. 1985.
14 Y. Hu and P. C. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Transactions on Audio, Speech and Language Processing, pp.229-238 Jan. 2008.