Browse > Article
http://dx.doi.org/10.7776/ASK.2011.30.3.123

Improved Minimum Statistics Based on Environment-Awareness for Noise Power Estimation  

Son, Young-Ho (인하대학교 전자공학부)
Choi, Jae-Hun (인하대학교 전자공학부)
Chang, Joon-Hyuk (한양대학교 융합전자공학부)
Abstract
In this paper, we propose the improved noise power estimation in speech enhancement under various noise environments. The previous MS algorithm tracking the minimum value of finite search window uses the optimal power spectrum of signal for smoothing and adopts minimum probability. From the investigation of the previous MS-based methods it can be seen that a fixed size of the minimum search window is assumed regardless of the various environment. To achieve the different search window size, we use the noise classification algorithm based on the Gaussian mixture model (GMM). Performance of the proposed enhancement algorithm is evaluated by ITU-T P.862 perceptual evaluation of speech quality (PESQ) under various noise environments. Based on this, we show that the proposed algorithm yields better result compared to the conventional MS method.
Keywords
Minimum Statistics (MS); Gaussian mixture model (GMM); Environment-awareness;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J.-H. Chang and N. S. Kim, "Voice activity detection based on complex Laplacian. model," IEE Electronics Letters, vol. 39, no. 7, pp. 632-634, Apr. 2003.   DOI   ScienceOn
2 S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
3 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. ASSP, vol. 33, no. 2, pp 443-445, Apr. 1985.   DOI
4 B. L. Sim, Y. C. Tong, J. S. Chang, and C. T. Tan, "A parametric formulation of the generalized spectral subtraction method," IEEE Trans. on Speech and Audio Processing, vol. 6, no. 4, pp. 328-336, July 1998.   DOI   ScienceOn
5 R. J. McAualy and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 137-145, Apr. 1980.
6 R. Martin, "Spectral subtraction based on minimum statistics," in Proc. Eur. Signal Processing Conf., pp. 1182-1185, 1994.
7 J. Sohn, W. Sung, "A voice activity detector employing soft decision based noise spectrum adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, pp. 365-368, 1998.
8 G. Doblinger, "Computationally efficient speech enhancement by spectral minima tracking in subbands," in Proc. EUROSPEECH, vol. 2, pp. 1513-1516, 1995.
9 J. W. Shin, J.-H. Chang, and N. S. Kim, "Statistical modeling of speech signals based on generalized gamma distribution," IEEE Signal Processing Letters, vol. 12, no. 3, pp. 258-261, Mar. 2005.   DOI   ScienceOn
10 I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Processing Letters, vol. 9, no. 1, pp. 12-15, Jan. 2002.   DOI   ScienceOn
11 R. Martin, "Noise Power Spectral Density Estimation Based. on Optimal Smoothing and Minimum Statistics," IEEE. Trans. on Speech and Audio Processing, vol. 9, no. 5, pp. 504-512, Jul. 2001.   DOI   ScienceOn
12 G. Xuan, W.Zhang, and P. Chai, "EM algorithm of gaussian mixture model and hidden Markov model," in Proc. IEEE International Conference on Image Processing, vol. 1, pp. 145-148, Oct. 2001.
13 D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, Jan. 2000.   DOI   ScienceOn
14 N. S. Kim and J.-H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Processing Letters, vol. 7, no. 5, pp. 108-110, May 2000.   DOI   ScienceOn