Browse > Article
http://dx.doi.org/10.5370/KIEE.2012.61.12.1920

Speech Enhancement Based on IMCRA Incorporating noise classification algorithm  

Song, Ji-Hyun (Dept. of Electrical Engineering, Inha University)
Park, Gyu-Seok (Dept. of Electrical Engineering, Inha University)
An, Hong-Sub (Dept. of Electrical Engineering, Inha University)
Lee, Sang-Min (Dept. of Electrical Engineering, Inha University)
Publication Information
The Transactions of The Korean Institute of Electrical Engineers / v.61, no.12, 2012 , pp. 1920-1925 More about this Journal
Abstract
In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA) in non-stationary noisy environment. The conventional IMCRA algorithm efficiently estimate the noise power by averaging past spectral power values based on a smoothing parameter that is adjusted by the signal presence probability in frequency subbands. Since the minimum of smoothing parameter is defined as 0.85, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. For this reason, we proposed the modified IMCRA, which adaptively estimate and updata the noise power according to the noise type classified by the Gaussian mixture model (GMM). The performances of the proposed method are evaluated by perceptual evaluation of speech quality (PESQ) and composite measure under various environments and better results compared with the conventional method are obtained.
Keywords
Improved minima controlled recursive averaging(IMCRA); Speech enhancement; GMM;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-32(2), pp.443-445, Apr. 1985.
2 Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Transactions on Acoustics, Speech and Siganl Processing, ASSP-32(6), pp.1109-1121, Dec. 1984.
3 Y. Hu and P. C. Loizou, "Evaluation of objective quality measures for speech enhancement," IEEE Transactions on Audio, Speech and Language Processing, pp.229-238 Jan. 2008.
4 I. Cohen and B. Berdugo, " Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Processing Letters, pp.12-15, Jan. 2002
5 I. Cohen, "Noise spectrum estimation in adverse environments : improved minima controlled recursive averaging," IEEE Transactions on Speech and Audio Processing, pp.466-475, Sep. 2003.
6 P. Renevey and A, Drygajlo, "Entropy based voice activity detection in very noisy conditions." In Eurospeech 2001, pp. 1887-1890, Sep. 2001.
7 M. Asgari, A. Sayadian, M. Farhadloo and E. A. Mehrizi, "Voice activity detection using entropy in spectral domain," Telecommunication Networks and Applications Conference 2008, pp. 407-410, Dec. 2008.
8 J. Lei, J. Wang and Z, Yang, "A robust voice activity detection algorithm in nonstationary noise," International Conference on Industrial and Information Systems 2009, pp. 195-198, Dec. 2009.
9 Y. H. Son, S. M. Lee, "Improved speech absence probability estimation based on environmental noise classification," Journal of Central South University., Vol. 19, No. 9, pp 2548-2553, September. 2012.   DOI   ScienceOn
10 S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Transactions on Acoustics, Speech and Signal Processing, pp.113-120, Apr. 1979.
11 I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environment," Signal Processing, pp.2403-2418, Nov. 2001.
12 G. Doblinger, "Computationally efficient speech enhancement by spectral minima tracking in subbands," Proc. 4th European Conf. Speech, Communication and Technology, EUROSPEECH'95, pp.1513-1516, Sep. 1995.
13 R. Martin, "Spectral subtraction based on minimum statistics," Proceeding of 7th EUSIPCO'94, Edinburgh, U.K., pp.1182-1185, Sep. 1994.
14 I. Cohen and B. Berdugo, "Spectral enhancement by tracking speech presence probability in subbands," Proc. IEEE Workshop on Hands Free Speech Communication, HSC'01, Kyoto, Japan, pp.95-98, Apr. 2001.