Browse > Article
http://dx.doi.org/10.7776/ASK.2010.29.3.200

Non-Stationary/Mixed Noise Estimation Algorithm Based on Minimum Statistics and Codebook Driven Short-Term Predictor Parameter Estimation  

Lee, Myeong-Seok (세종대학교 정보통신공학과)
Noh, Myung-Hoon (세종대학교 정보통신공학과)
Park, Sung-Joo (전자부품연구원 디지털미디어연구센터)
Lee, Seok-Pil (전자부품연구원 디지털미디어연구센터)
Kim, Moo-Young (세종대학교 정보통신공학과)
Abstract
In this work, the minimum statistics (MS) algorithm is combined with the codebook driven short-term predictor parameter estimation (CDSTP) to design a speech enhancement algorithm that is robust against various background noise environments. The MS algorithm functions well for the stationary noise but relatively not for the non-stationary noise. The CDSTP works efficiently for the non-stationary noise, but not for the noise that was not considered in the training stage. Thus, we propose to combine CDSTP and MS. Compared with the single use of MS and CDSTP, the proposed method produces better perceptual evaluation of speech quality (PESQ) score, and especially works excellent for the mixed background noise between stationary and non-stationary noises.
Keywords
Minimum statistics; noise estimation; speech enhancement; non-stationary noise; mixed noise;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. Speech Audio processing. vol. 9, no. 5, pp. 504-512, 2001.   DOI   ScienceOn
2 S. Kamath, and P. Loizou, "A multi-band spectral subtraction method for enhancing speech corrupted by colored noise," Proc. IEEE Int. Conf. Acousf. Speech Signal Processing, 2002.
3 L. Singh, and S. Sridharan, "Speech enhancement using critical band spectral subtraction," in Proc, Intern. Conf. Spoken Lang, Processing, pp. 2827-2830, 1998.
4 S. Srinivasan, J. Samuelsson, and W. B. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Trans. Speech Audio processing, vol. 14, issue 1. pp.163-176, 2006.   DOI
5 S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 27, no. 2, pp. 113-120, 1979.   DOI
6 ETSI ES 202 050, Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Advanced ront-end feature extraction algorithm; Compression algorithms, 2007.
7 A. Rix, J. Beerends, M. Hollier, and A. Hekstra, "Perceptual evaluation of speech quality (PESQ) - A new method for speech quality assessment of telephone networks and codecs," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, col. 2, pp. 749-752, 2001
8 R. M. Gray, A. Buzo, A. H. G Jr. and Y. Matsuyama, "Distortion measures for speech processing," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 28, no. 4, pp. 367-376, 1980   DOI
9 T. Sreenicas and P. Kirnapure, "Codebook constrained Wiener filtering for speech enhancement," IEEE Trans. Speech Audio Processing, col. 4, no. 5, pp. 383-389, 1996.   DOI   ScienceOn
10 박윤식, 장준혁, "강인한 음성향상을 위한 Minimum Statistics와 Soft Decision의 확률적 결합의 새로운 잡음전력 추정기법," 한국음향학회지, 26권, 4호, 153-158쪽, 2007.   과학기술학회마을
11 M. Kuropatwinski and W. B. Kleijn, "Estimation of the excitation variances of speech and noise AR-models for enhanced speech coding," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, col. 1, pp. 669-672, 2001