Critical Banded Wavelet Packet-Based Spectral Subtractions for Speech Enhancement

Chang, Sung-Wook;Yang, Sung-Il;

The Journal of the Acoustical Society of Korea

제23권4E호
/
Pages.125-133
/
2004
/
1225-4428(pISSN)

한국음향학회 (The Acoustical Society of Korea)

음성신호개선을 위한 임계대역 웨이블렛 패킷 기반의 스펙트럼 차감법

Critical Banded Wavelet Packet-Based Spectral Subtractions for Speech Enhancement

Chang, Sung-Wook (School of Electrical and Computer Engineering, Hanyang University) ;
Yang, Sung-Il (School of Electrical and Computer Engineering, Hanyang University)

발행 : 2004.12.01

PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In this paper, we propose a critical banded wavelet packet-based spectral subtraction for speech enhancement. Critical banded wavelet packet, which reflects the human auditory system, may lead to minimization of intelligibility loss and quality improvement of the enhanced speech in the spectral domain, when combined with an appropriate spectral subtraction gain function. The proposed method shows better performance than the conventional one in comparative assessments. We also show that, for effective evaluation of enhanced speech, it is essential to consider the characteristics of speech quality measures.

키워드

참고문헌

Y. Ephraim and H. L. Van Trees, 'A Signal Subspace Approach forSpeech Enhancement,' IEEE Trans. on Speech and Audio Processing,3(4),pp.251-266,JuIy, 1995 https://doi.org/10.1109/89.397090
M. Klein and P. Kabal, 'Signal Subspace Speech Enhancement withPerceptual Post filtering,' Proc. IEEE Int. Conf. Acoustics Speechand Signal Processing, pp. 537-540 May 2002
F. Jabloun and B. Champagne, 'A Perceptual Signal SubspaceApproach for Speech Enhancement in Colored Noise,' Proc IEEE Int.Conf. Acoustics, Speech and Signal Processing, 1 pp. 569-572 2002
Y. Hu and P. Loizou, 'Perceptual Weighting Motivated Subspacebased Speech Enhancement Approach,' Proc. of Int Conf onSpoken Language Processing, pp. 1797-1800, 2002
S. Chang, S. Jung, Y. Kwon, and S. Yang, 'Speech Enhancementusing Wavelet Packet Transform,' Proc. of Int. Conf. on SpokenLanguage Processing, pp. 1809-1812, 2002
S. Chang, Y. Kwon, S. Jung, S. Yang and K. Lee, 'SpeechEnhancement using Level Adapted Wavelet Packet with AdaptiveNoise Estimation,' The Joumal of the Acoustical Society of Korea22(2E), pp. 87-92, 2003
S. Chang, Y. Kwon, SU Jung and S. Yang, 'Adaptive Wavelet basedSpeech Enhancement wlth Robust VAD in NonStationary NoiseEnvironment,' The Journal of the Acoustlcal Society of Korea, 22(4E)2003
M. V. Wickerhauser, Adapted Wavelet Analysis from Theory to Software, AK Peters, 1994
I. Cohen, 'Enhancement of Speech using Bark-ScaIed Wavelet Packet Decomposition,' EUROSPEECH 2001 pp 3-7 2001
H. G. Hirsch, 'Estimation of Noise Spectrum and its Application toSNR Estimation and Speech Enhancement,' Technical Report TR-93-012, International Computer Science Institute, Berkeley USA 1993
P. Noll, 'Adaptive Quantization in Speech Coding Systems,' Proc.Int. Zurich Seminar on Dlgital Communications, pp. B3.1-B3.6 Oct.1974
S. R. Quackenbush, T. P. Barnwell, M. A. Clements, ObjectiveMeasures of Speech Quality, Prentice-Hall, NJ, 1988
S. Wang and A. Gersho, 'An Objective Measure for PredictingSubjective Quality of Speech Coders,' IEEE Journal on SelectedAreas in Communications, 10(5), June, 1992
D. G. Jamieson, L. Deng, M. Price, V. Parsa and J. Till, 'Interactionof Speech Disorders with Speech Coders: Effects on SpeechIntelligibility,' Proc. of Int. Conf. on Spoken Language Processing, pp.737-740, 1996
J. H. L. Hansen and B. L. Pellom, 'An Effective Quality EvaluationProtocol for Speech Enhancement Algorithms,'Proc. of Int. Conf. onSpoken Language Processing , pp. 2819-2822, 1998
J. Deller, J. Proakis, J. H. L. Hansen, Discrete-Time Processing of Speech Signals, McMillan Series for Prentice Hall, New York, NY,1993
T. P. Barnwell and W. D. Voiers, 'An analysis of objective measuresfor user acceptance of voice communication systems,' DCA FinalTechnical Report, No. DCA100-78-C-0003, Sept. 1979
T. P. Barnwell, M. A. Clements, S. R. Quackenbush et. al, 'Improvedobjective measures for speech quality testing,' DCA Final Technical Report, No DCA100-83-C-0027, Sept. 1984
T. P. Barnwell, 'Improved objective quality for low bit speechcompression,' National Science Foundation, Final Technical Report, ECS-8016712, 1985
D Klatt, 'Prediction of Perceived Phonetic Distance from CriticalBand Spectra: A First Step,' Proc. IEEE Int. Conf. Acoustics, Speechand Signal Processing, pp. 1278-1281, 1982
ITU-T Recommendation P.862, Perceptual Evaluation of Speech Quality (PESQ), International Telecommunication Union, Feb. 2001
P. Kabal, 'Demo and Matlab Toolbox for [2],' http://www.tsp.ece.mcgill.ca/Kabal/Papers/2002/K1einC2002-demo/
B. Pellom, 'Matlab toolbox for Objective Speech QualityAssessment,' CSLU Robust Speech Processing Laboratory, http://cslr.colorado.edu/rspl/rspl_software.html
P. E. Papamichalis, Practical Approaches to Speech Coding,Prentice Hall, England Cliffs, NJ, 1987

The Journal of the Acoustical Society of Korea

음성신호개선을 위한 임계대역 웨이블렛 패킷 기반의 스펙트럼 차감법

Critical Banded Wavelet Packet-Based Spectral Subtractions for Speech Enhancement

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)