An Adaptive Speech Enhancement System Using Lateral Inhibition and Time-Delay Neural Network

상호억제와 시간지연 신경회로망을 사용한 적응적인 음성강조시스템

  • Published : 2008.03.25

Abstract

This paper proposes an adaptive speech enhancement system based on an auditory system to enhance speech that is degraded by various background noises. As such, the proposed system detects voiced and unvoiced sections, adaptively adjusts the coefficients for both the lateral inhibition and the amplitude component according to the detected sections for each input fame, then reduces the noise signal using a time-delay neural network. Based on measuring the signal-to-noise ratio, experiments confirm that the proposed system is effective for speech degraded by various noises.

본 논문에서는 다양한 배경잡음에 의해 열화된 음성을 강조하기 위하여 청각시스템을 기초로 한 적응적인 음성강조시스템을 제안한다. 제안한 시스템은 먼저 유성음과 무성음의 구간을 검출한 후, 각 입력 프레임에서 검출된 결과에 따라서 상호억제 계수와 진폭성분조정계수를 적응적으로 조정한다. 마지막으로 시간지연신경회로망을 사용하여 잡음신호를 제거한다. 실험결과 본 시스템은 신호대잡음비의 평가방법을 통하여 다양한 잡음에 의해서 열화된 음성신호를 백색잡음 및 유색잡음에 대해서 효과적인 것을 보여준다.

Keywords

References

  1. K. K. Paliwal, 'Neural net classifiers for robust speech recognition under noisy environments', IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.1, pp. 429-432, 1990
  2. J. S. Lim, 'Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise', IEEE Trans. Acoust., Speech, Signal Processing. Vol. 6, No. 5, pp. 471-472, 1978
  3. B. Widrow, R. John, J. R. Glover, J. M. McCool, J. Kaunitz, C. S. Williams, R. H. Hearn, J. R. Zeidler, E. Dong, R. C. Goodlin, 'Adaptive noise cancelling: Principles and applications', Proc. IEEE, Vol. 63, No. 12, pp. 1692-1716, 1975 https://doi.org/10.1109/PROC.1975.10036
  4. W. G. Knecht, M. E. Schenkel, G. S. Moschytz, 'Neural network filters for speech enhancement', IEEE Trans. Speech and Audio Processing, Vol. 3, No. 6, pp. 433-438, 1995 https://doi.org/10.1109/89.482210
  5. 최재승, '신경 회로망을 사용한 잡음이 중첩된 음성강조', 대한전자공학회 논문지, 제42권 5호 SP편, pp. 165-172, 2005. 9
  6. Y.M. Cheng, D. O'Shaughnessy, 'Speech enhancement based conceptually on auditory evidence'. IEEE Trans. Signal Processing. Vol. 39, No.9, pp. 19431954, 1991
  7. J.B. Hampshire, A.H. Waibel, 'A novel objective function for improved phoneme recognition using time delay neural networks', IEEE Transactions on Neural Networks, Vol. 1, No. 2, pp. 216-228, 1990 https://doi.org/10.1109/72.80233
  8. 최재승, '시간지연 신경회로망을 이용한 잡음제거 시스템', 대한전자공학회 논문지, 제42권 3호 SP편, pp. 121-128, 2005. 5
  9. H. Hirsch and D. Pearce, 'The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions', in Proc. ISCA ITRW ASR2000 on Automatic Speech Recognition: Challenges for the Next Millennium, Paris, France, 2000
  10. Y. Ephraim and D. Malah, 'Speech Enhancement Using a Minimum Mean-Square Error Log-Spectral Amplitude Estimator', IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 33, No. 2, pp. 443-445, 1985 https://doi.org/10.1109/TASSP.1985.1164550
  11. Y. Ephraim, D. Malah, 'Speech Enhancement Using a Minimum-Mean Square Error Short-Time Spectral Amplitude Estimator', IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 32, No. 6, pp. 1109-1121, 1984 https://doi.org/10.1109/TASSP.1984.1164453