Browse > Article

Design of the Noise Suppressor Using the Perceptual Model and Wavelet Packet Transform  

Kim, Mi-Seon (충북대학교 전파공학과)
Park, Seo-Young (충북대학교 전파공학과)
Kim, Young-Ju (충북대학교 전파공학과)
Lee, In-Sung (충북대학교 전파공학과)
Abstract
In this paper. we Propose the noise suppressor with the Perceptual model and wavelet packet transform. The objective is to enhance speech corrupted colored or non-stationary noise. If corrupted noise is colored. subband approach would be more efficient than whole band one. To avoid serious residual noise and speech distortion, we must adjust the Wavelet Coefficient Threshold (WCT). In this Paper. the subband is designed matching with the critical band and WCT is adapted noise masking threshold (NMT) and segmental signal to noise ratio (seg_SNR). Consequently. it has similar Performance with EVRC in PESQ-MOS. But it's better than wavelet packet transform using universal threshold about 0.289 in PESQ-MOS. The important thing is that it's more useful than EVRC in coded speech. In coded speech. PESQ-MOS is higher than EVRC about 0.23.
Keywords
Speech enhancement; Noise suppression; Wavelet Packet transform; Perceptual model;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 윤석현, 유창동, '시간-주파수 영역에서 음성/잡음 우세 결정에 의한 새로운 잡음처리', 한국음향학회지 20 48-55, 2001
2 S. Yoon, C. D. Yoo. 'Speech/Noise- dominant detection for speech enhancement.', In: Proc. European Conf. on Speech Communication and Technology, 1941-1944,2001
3 M. Bahoura, J. Rouat, 'Wavelet speech enhancement based on the teaqer energy operator.', IEEE Signal Process. Lett. 8 10 -12. 2001   DOI   ScienceOn
4 N. Virag, 'Single channel speech enhancement based on masking properties of the human auditory system', IEEE Trans. Signal Process. Mag. 7 126-137, 1999
5 S. W. Chang, S. I. Jung. Y. H. Kwon, 'Adaptive Wavelet Based Speech Enhancement with Robust VAD in Non-stationary Noise Environment', The Journal of the Acoustical Society of Korea 22 161-166, 2003
6 N. R. Chong, I. S. Burnett, J. F. Chicharo, 'A new waveform interpolation coding scheme based on pitch synchronous wavelet transform decomposition', IEEE Trans. speech Audio Process. 8 345-348, 2000   DOI   ScienceOn
7 D. L. Donoho, I. M. Johnston, 'Ideal spatial adaptation by wavelet shrinkage.', Biometrika, 81 425-455, 1994   DOI   ScienceOn
8 D. L. Donoho, 'De-noising by soft- thresholding', IEEE Trans. Inform. Theory 41 613-627, 1995   DOI   ScienceOn
9 J. Lukasiak, I. S. Burnett, 'Exploiting simultaneously masked linear prediction in a WI speech coder.', In: Int. Conf. on Acoustics, Speech and Signal Processing(ICASSP), 11 -13, Istanbul, June, 2000
10 H. Sheikhzadeh, H. R. Abutalebi, 'An improved waveletbased speech enhancement system.', In: Proc. European Conf. on Speech Communication and Technology, 1855 -1858, Aalbors, September, 2001
11 D. Sinha, A. H. Tewfik, 'Low bit rate transparent audio compression using adapted wavelets.', IEEE Trans. Signal Process. 41 3463-3479, 1993   DOI   ScienceOn
12 B. Carnero, A. Drygajlo, 'Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithm.', IEEE Trans. Signal Process. 47 1622 -1635, 1999
13 C. T. Lu, H. C. Wang, 'Enhancement of single channel speech based on masking property and wavelet transform', Speech Commun. Mag. 41 409-427, 2003   DOI
14 P. Srinivasan, L. Jamieson, 'High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling.', IEEE Trans. Signal Process. 46 1085 -1093, 1998   DOI   ScienceOn
15 J. D. Johnston, 'Transform coding of audio signal using perceptual noise criteria.', IEEE J. Select. Area Commun. 6, 314-323, 1988   DOI   ScienceOn
16 S. Mallat, A Wavelet Tour of Signal Processing(A Harcout Science and Technology. Academic-Press, SanDiego, 1999)