Noisy Speech Enhancement Based on Complex Laplacian Probability Density Function

복소 라플라시안 확률 밀도 함수에 기반한 음성 향상 기법

  • Park, Yun-Sik (School of Electronic and Electrical Engineering, Inha University) ;
  • Jo, Q-Haing (School of Electronic and Electrical Engineering, Inha University) ;
  • Chang, Joon-Hyuk (School of Electronic and Electrical Engineering, Inha University)
  • 박윤식 (인하대학교 전자전기공학부) ;
  • 조규행 (인하대학교 전자전기공학부) ;
  • 장준혁 (인하대학교 전자전기공학부)
  • Published : 2007.11.25

Abstract

This paper presents a novel approach to speech enhancement based on a complex Laplacian probability density function (pdf). With a use of goodness-of-fit (GOF) test we show that the complex Laplacian pdf is more suitable to describe the conventional Gaussian pdf. The likelihood ratio (LR) is applied to derive the speech absence probability in the speech enhancement algorithm. The performance of the proposed algorithm is evaluated by the objective test and yields better results compared with the conventional Gaussian pdf-based scheme.

본 논문에서는 복소 라플라시안 확률밀도함수 (PDF, Probability Density Function)에 기반한 새로운 음성 향상 기법을 제시한다. 적용된 복소 라플라시안 PDF가 기존의 가우시안 PDF보다 오염된 음성 분포를 정확하게 표현한다는 것을 Goodness-of-Fit (GOF) 테스트로 확인하였고, 음성 향상 알고리즘의 음성부재확률을 위해 우도비 (LR, Likelihood Ratio)를 적용하였다. 제시된 알고리즘의 성능은 객관적 테스트에 의해 평가하였고 기존의 가우시안 PDF보다 개선된 음성 향상 결과를 나타내었다.

Keywords

References

  1. N. S. Kim and J.-H. Chang, 'Spectral enhancement based on global soft decision,' IEEE Signal Processing Letters, vol. 7, no. 5, pp. 108-110, May 2000 https://doi.org/10.1109/97.841154
  2. J.-H. Chang and N. S. Kim, 'Speech enhancement : new approaches to soft decision,' em IEICE Trans. Inf. and Syst., vol. 27, E84-D, pp. 1231-1240, Sep. 2001
  3. J.-H. Chang and N. S. Kim, 'Voice activity detection based on complex Laplacian model,' Electronics Letters, vol. 39, no. 7, pp. 632-634, Apr. 2003 https://doi.org/10.1049/el:20030392
  4. J.-H. Chang, N. S. Kim and S. K. Mitra, 'Voice activity detection based on multiple statistical models,' IEEE Trans. Signal Processing, June 2006
  5. J.-H. Chang and N. S. Kim, 'A new structural approach in system identification with generalized analysis-by-synthesis for Robust Speech Coding,' IEEE Trans. Speech and Audio Processing, vol. 14, no. 3, pp. 747-751, May 2006 https://doi.org/10.1109/TSA.2005.858069
  6. J.-H. Chang, 'Perceptual weighting filter for robust speech modification,' Signal Processing, vol. 86, Issue 5, pp. 1089-1093, May 2006 https://doi.org/10.1016/j.sigpro.2005.07.025
  7. J.-H. Chang, 'Warped discrete cosine transform -based noisy speech enhancement,' IEEE Trans. Circuit and Systems II, vol. 52, issue 9, pp. 535-539, Sept. 2005
  8. J.-H. Chang and N. S. Kim, 'Distorted speech rejection for automatic speech recogntion in wireless communication,' IEICE Trans. Info. and Syst., vol. E87-D, no. 7, pp. 1978-1981, July 2004
  9. Y. Ephraim and D. Malah, 'Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator,' IEEE Trans. Acoust., Speech, Signal Processing, vol. 32, no. 6, pp. 1109-1121, Dec. 1984 https://doi.org/10.1109/TASSP.1984.1164453
  10. J. Sohn and W. Sung, 'A voice activity detector employing soft decision based noise spectrum adaptation,' in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, pp. 365-368, 1998
  11. J. Sohn, N. S. Kim and W. Sung, 'A statistical model-based voice activity detection,' IEEE Signal Processing Letters, vol. 6, no. 1, pp. 1-3, Jan. 1999
  12. I. Cohen and B. Berdugo, 'Speech enhancement for nonstationary noise environments,' Signal Processing, vol 81, pp. 2403-2418, Nov. 2001 https://doi.org/10.1016/S0165-1684(01)00128-1
  13. I. Cohen and B. Berdugo, 'Noise estimation by minima controlled recursive averaging for robust speech enhancement,' IEEE Signal Processing Letters, vol. 9, no. 1, pp. 12-15, Jan. 2002 https://doi.org/10.1109/97.988717
  14. I. Cohen, 'Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator,' IEEE Signal Processing Letters, vol. 9, no. 4, pp. 113-116, Apr. 2002 https://doi.org/10.1109/97.1001645
  15. A. G. Glen, L. M. Leemis, and D. R. Barr, 'Order statistics in goodness-of-fit testin,' IEEE Trans. Reliability., vol. 50, no. 2, pp. 209-213, June 2001 https://doi.org/10.1109/24.963129
  16. R. C. Reininger and J. D. Gibson, 'Distributions of the two dimensional DCT coefficients for images,' IEEE Trans. Commnuications., vol. Com-31, no. 6, pp. 835-839, June 1983
  17. D. R. Brillinger, Time Series: Data Analysis and Theory, New York: Holden-Day, 1981
  18. TIA/EIA/IS-127, 'Enhanced variable rate codec, speech service option 3 for wideband spectrum digital systems,' 1996
  19. R. J. McAulary and M. L. Malpass, 'Speech enhancement using a soft-decision noise suppression filter,' IEEE Trans. Acoust., Speech, Signal Processing, vol.28, pp. 137-145, Apr. 1980 https://doi.org/10.1109/TASSP.1980.1163394
  20. O. Cappe, 'Elimination of musical noise phenomenon with the Ephraim and Malah noise suppressor,' IEEE Trans. Speech and Audio Processing, vol. 2, no. 2, pp. 345-349, Apr. 1994 https://doi.org/10.1109/89.279283
  21. R. Martin, 'Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors,' Proc. of IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, pp. I253-I256, Orlando, FL. , May 2002
  22. S. Gazor and W. Zhang, 'Speech probability distribution,' IEEE Signal Processing Letters, vol. 10, no. 7, pp. 204-207, July 2003 https://doi.org/10.1109/LSP.2003.813679
  23. R. Martin, 'Noise power spctral density estimation based on optimal smoothing and minimum statistics,' IEEE Trans. Speech and Audio Processing, vol. 9, no. 5, pp. 504-512, July 2001 https://doi.org/10.1109/89.928915
  24. A. Varga and H. J. M. Steeneken, 'Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems,' Speech Communication, vol 12, no. 3, pp. 247-251, July 1993 https://doi.org/10.1016/0167-6393(93)90095-3
  25. N. Ma, M. Bouchard and R. Goubran, 'Perceptual Kalman filtering for speech enhancement in colored noise,' in Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 1, pp. 717-720, Montreal, May 2004