DOI QR코드

DOI QR Code

Transient Noise Reduction in Speech Signal Utilizing a Long-term Predictor

장구간 예측 필터를 이용한 음성 신호에서의 돌발 잡음 제거

  • 최민석 (연세대학교 전기전자공학과) ;
  • 강홍구 (연세대학교 전기전자공학과)
  • Received : 2011.09.29
  • Accepted : 2011.11.23
  • Published : 2012.01.31

Abstract

This paper presents a transient noise reduction system in a speech signal. The proposed transient noise reduction system utilizes a median filter to reduce the transient noise. Since the median filter can distort speech during the noise reduction, a long-term prediction (LTP) filter is adopted as a pre-processor to minimize speech distortion. The speech information preserved by the LTP filter is re-synthesized after reducing the noise. This paper verifies the weakness of a linear prediction (LP) filter and the superiority of the LTP filter for preserving the speech component in transient noise presence environment. Applying the proposed system, the signal-to-noise ratio (SNR) of output is improved by 8dB in both speech and noise presence region, and PESQ score is increased by 1 point comparing with noisy input.

본 논문에서는 음성 신호에 더해진 돌발 잡음을 제거하는 시스템을 제안한다. 제안한 돌발 잡음 제거 시스템은 중앙값 필터를 이용하여 돌발 잡음을 제거한다. 중앙값 필터는 잡음을 제거하는 과정에서 음성을 왜곡시킬 수 있기 때문에, 음성의 왜곡을 최소화하기 위하여 장구간 예측 필터를 전처리단으로 사용한다. 장구간 예측 필터로 보존된 음성 정보는 잡음이 제거된 후 다시 합성된다. 본 논문에서는 돌발 잡음이 존재하는 환경에서 음성의 정보를 보존하는데 있어 단구간 예측 필터의 문제점을 밝히고 장구간 예측 필터의 우수함을 보인다. 제안한 돌발 잡음 제거 시스템의 출력 신호는 입력 신호에 비해 음성이 존재하는 구간에서 신호 대 잡음비가 약 8dB 향상 되었으며, PESQ 점수가 약 1점 증가하였다.

Keywords

References

  1. S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. on Acoust., Speech, and Signal Process., vol. 27, no. 2, pp. 113- 120, 1979. https://doi.org/10.1109/TASSP.1979.1163209
  2. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. on Acoust., Speech, and Signal Process., vol. 33, no. 2, pp. 443-445, 1985. https://doi.org/10.1109/TASSP.1985.1164550
  3. P. C. Loizou, Speech enhancement, Theory and practice, CRC Press, 2007.
  4. I. Cohen and B. Berdugo, "Speech enhancement for non-stationary noise environments," Signal Process., vol. 81, Issue 11, pp. 2401-4218, 2001.
  5. S. V. Vaseghi, Advanced digital signal processing and noise reduction, 2nd ed., John Wiley & Sons, 2000.
  6. T. Kasparis and J. Lane, "Suppression of impulsive disturbances from audio signals," Electronics letters, vol. 29, no. 22, pp. 1926-1927, 1993. https://doi.org/10.1049/el:19931282
  7. A. J. Efron and H. Jeen, "Detection in impulsive noise based on robust whitening," IEEE Trans. on Signal Process., vol. 42, no. 6, pp. 1572-1576, 1994. https://doi.org/10.1109/78.286980
  8. S. R. Kim and A. Efron, "Adaptive robust impulse noise filtering," IEEE Trans. on Signal Process., vol. 43, no. 8, pp. 1855-1866, 1995. https://doi.org/10.1109/78.403344
  9. I. Kauppinen, "Methods for detecting impulsive noise in speech and audio signals," in Proc. IEEE Int. Conf. on Digital Signal Process. 2002, vol. 2, pp. 967-970, 2002.
  10. A. M. Kondoz, Digital speech - coding for low bit rate communication systems, John Wiley & Sons, 1994.
  11. ITU-T, ITU-T recommendation G. 729, 1996.
  12. R. Talmin, I. Cohen, and S. Gannot, "Speech enhancement in transient noise environment using diffusion filtering," in Proc. IEEE Int. Conf. on Acoust., Speech, Signal Process. 2010, pp. 4782-4785, 2010.
  13. J. Beh, K. Kim and H. Ko, "Noise estimation for robust speech enhancement in transient noise environment," KSCSP 2007, vol. 24, no. 1, pp. 35-36, 2007.
  14. M. S. Choi, H. S. Shin, and Y. S. Hwang, and H. G. Kang, "Time-frequency domain impulsive noise detection system in speech signal," The journal of the Acoust. Society of Korea, vol. 30, no. 2, pp. 73-79, 2011. https://doi.org/10.7776/ASK.2011.30.2.073
  15. A. Papoulis and S. U. Piillai, Probability, random variables and stochastic processes, forth edition, Mc Graw Hill, 2002.
  16. J. Benesty, S. Makino, and J. Chen, Speech enhancement, Springer, 2005.
  17. ITU-T, ITU-T Recommendation P.862. Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs, 2001.

Cited by

  1. Transient noise reduction in speech signal with a modified long-term predictor vol.2011, pp.1, 2011, https://doi.org/10.1186/1687-6180-2011-141