Noisy Speech Enhancement Based on Complex Laplacian Probability Density Function

Park, Yun-Sik;Jo, Q-Haing;Chang, Joon-Hyuk;

Journal of the Institute of Electronics Engineers of Korea SP (대한전자공학회논문지SP)

Volume 44 Issue 6
/
Pages.111-117
/
2007
/
1229-6384(pISSN)

The Institute of Electronics and Information Engineers (대한전자공학회)

Noisy Speech Enhancement Based on Complex Laplacian Probability Density Function

복소 라플라시안 확률 밀도 함수에 기반한 음성 향상 기법

Park, Yun-Sik (School of Electronic and Electrical Engineering, Inha University) ;
Jo, Q-Haing (School of Electronic and Electrical Engineering, Inha University) ;
Chang, Joon-Hyuk (School of Electronic and Electrical Engineering, Inha University)

박윤식 (인하대학교 전자전기공학부) ;
조규행 (인하대학교 전자전기공학부) ;
장준혁 (인하대학교 전자전기공학부)

Published : 2007.11.25

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents a novel approach to speech enhancement based on a complex Laplacian probability density function (pdf). With a use of goodness-of-fit (GOF) test we show that the complex Laplacian pdf is more suitable to describe the conventional Gaussian pdf. The likelihood ratio (LR) is applied to derive the speech absence probability in the speech enhancement algorithm. The performance of the proposed algorithm is evaluated by the objective test and yields better results compared with the conventional Gaussian pdf-based scheme.

본 논문에서는 복소 라플라시안 확률밀도함수 (PDF, Probability Density Function)에 기반한 새로운 음성 향상 기법을 제시한다. 적용된 복소 라플라시안 PDF가 기존의 가우시안 PDF보다 오염된 음성 분포를 정확하게 표현한다는 것을 Goodness-of-Fit (GOF) 테스트로 확인하였고, 음성 향상 알고리즘의 음성부재확률을 위해 우도비 (LR, Likelihood Ratio)를 적용하였다. 제시된 알고리즘의 성능은 객관적 테스트에 의해 평가하였고 기존의 가우시안 PDF보다 개선된 음성 향상 결과를 나타내었다.

Keywords

References

N. S. Kim and J.-H. Chang, 'Spectral enhancement based on global soft decision,' IEEE Signal Processing Letters, vol. 7, no. 5, pp. 108-110, May 2000 https://doi.org/10.1109/97.841154
J.-H. Chang and N. S. Kim, 'Speech enhancement : new approaches to soft decision,' em IEICE Trans. Inf. and Syst., vol. 27, E84-D, pp. 1231-1240, Sep. 2001
J.-H. Chang and N. S. Kim, 'Voice activity detection based on complex Laplacian model,' Electronics Letters, vol. 39, no. 7, pp. 632-634, Apr. 2003 https://doi.org/10.1049/el:20030392
J.-H. Chang, N. S. Kim and S. K. Mitra, 'Voice activity detection based on multiple statistical models,' IEEE Trans. Signal Processing, June 2006
J.-H. Chang and N. S. Kim, 'A new structural approach in system identification with generalized analysis-by-synthesis for Robust Speech Coding,' IEEE Trans. Speech and Audio Processing, vol. 14, no. 3, pp. 747-751, May 2006 https://doi.org/10.1109/TSA.2005.858069
J.-H. Chang, 'Perceptual weighting filter for robust speech modification,' Signal Processing, vol. 86, Issue 5, pp. 1089-1093, May 2006 https://doi.org/10.1016/j.sigpro.2005.07.025
J.-H. Chang, 'Warped discrete cosine transform -based noisy speech enhancement,' IEEE Trans. Circuit and Systems II, vol. 52, issue 9, pp. 535-539, Sept. 2005
J.-H. Chang and N. S. Kim, 'Distorted speech rejection for automatic speech recogntion in wireless communication,' IEICE Trans. Info. and Syst., vol. E87-D, no. 7, pp. 1978-1981, July 2004
Y. Ephraim and D. Malah, 'Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,' IEEE Trans. Acoust., Speech, Signal Processing, vol. 32, no. 6, pp. 1109-1121, Dec. 1984 https://doi.org/10.1109/TASSP.1984.1164453
J. Sohn and W. Sung, 'A voice activity detector employing soft decision based noise spectrum adaptation,' in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, pp. 365-368, 1998
J. Sohn, N. S. Kim and W. Sung, 'A statistical model-based voice activity detection,' IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, Jan. 1999
I. Cohen and B. Berdugo, 'Speech enhancement for nonstationary noise environments,' Signal Processing, vol 81, pp. 2403-2418, Nov. 2001 https://doi.org/10.1016/S0165-1684(01)00128-1
I. Cohen and B. Berdugo, 'Noise estimation by minima controlled recursive averaging for robust speech enhancement,' IEEE Signal Processing Letters, vol. 9, no. 1, pp. 12-15, Jan. 2002 https://doi.org/10.1109/97.988717
I. Cohen, 'Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator,' IEEE Signal Processing Letters, vol. 9, no. 4, pp. 113-116, Apr. 2002 https://doi.org/10.1109/97.1001645
A. G. Glen, L. M. Leemis, and D. R. Barr, 'Order statistics in goodness-of-fit testin,' IEEE Trans. Reliability., vol. 50, no. 2, pp. 209-213, June 2001 https://doi.org/10.1109/24.963129
R. C. Reininger and J. D. Gibson, 'Distributions of the two dimensional DCT coefficients for images,' IEEE Trans. Commnuications., vol. Com-31, no. 6, pp. 835-839, June 1983
D. R. Brillinger, Time Series: Data Analysis and Theory, New York: Holden-Day, 1981
TIA/EIA/IS-127, 'Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems,' 1996
R. J. McAulary and M. L. Malpass, 'Speech enhancement using a soft-decision noise suppression filter,' IEEE Trans. Acoust., Speech, Signal Processing, vol.28, pp. 137-145, Apr. 1980 https://doi.org/10.1109/TASSP.1980.1163394
O. Cappe, 'Elimination of musical noise phenomenon with the Ephraim and Malah noise suppressor,' IEEE Trans. Speech and Audio Processing, vol. 2, no. 2, pp. 345-349, Apr. 1994 https://doi.org/10.1109/89.279283
R. Martin, 'Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors,' Proc. of IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, pp. I253-I256, Orlando, FL. , May 2002
S. Gazor and W. Zhang, 'Speech probability distribution,' IEEE Signal Processing Letters, vol. 10, no. 7, pp. 204-207, July 2003 https://doi.org/10.1109/LSP.2003.813679
R. Martin, 'Noise power spctral density estimation based on optimal smoothing and minimum statistics,' IEEE Trans. Speech and Audio Processing, vol. 9, no. 5, pp. 504-512, July 2001 https://doi.org/10.1109/89.928915
A. Varga and H. J. M. Steeneken, 'Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems,' Speech Communication, vol 12, no. 3, pp. 247-251, July 1993 https://doi.org/10.1016/0167-6393(93)90095-3
N. Ma, M. Bouchard and R. Goubran, 'Perceptual Kalman filtering for speech enhancement in colored noise,' in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 1, pp. 717-720, Montreal, May 2004

Journal of the Institute of Electronics Engineers of Korea SP (대한전자공학회논문지SP)

Noisy Speech Enhancement Based on Complex Laplacian Probability Density Function

복소 라플라시안 확률 밀도 함수에 기반한 음성 향상 기법

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)