DOI QR코드

DOI QR Code

공간 필터와 결합된 음성 왜곡 가중 다채널 위너 필터에서의 신호 대 잡음 비에 의한 가중치 결정 방법

SNR-based Weight Control for the Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filtering

  • Kim, Gibak (School of Electrical Engineering, Soongsil University)
  • 투고 : 2013.01.17
  • 심사 : 2013.04.15
  • 발행 : 2013.05.30

초록

본 논문에서는 여러 개의 마이크를 이용하여 잡음을 제거하는 방법인 공간 필터로 전처리된 신호를 입력으로 하는 음성 왜곡 가중 다채널 위너 필터 (Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filter: SP-SDW-MWF)에 대해 소개하고, 가중치를 결정하는 방법을 제안한다. SP-SDW-MWF는 마이크로폰 어레이를 이용한 잡음 제거 알고리즘으로서 마이크로폰 불일치와 같은 오차에 강인한 것으로 알려져 있다. SP-SDW-MWF는 필터 계수를 최적화할 때 음성 왜곡과 잡음 제거에 대한 기준으로 나누어 가중치를 두고 있다. 이러한 가중치를 결정하기 위해, 본 논문에서는 전력 스펙트럼 밀도 오차를 평가 척도로 사용하여 마이크로폰으로부터 입력된 음성 신호와 잡음의 전력 스펙트럼 밀도의 비 (a priori SNR)를 이용하는 방법을 제안한다. 실험결과에서 나타난 바와 같이 a priori SNR에 따라 가변적인 가중치를 사용하는 것이 고정된 값을 가중치로 사용하는 것보다 향상된 성능을 보임을 알 수 있다.

This paper introduces the Spatially Preprocessed Speech Distortion Weighted Multi-channel Wiener Filter (SP-SDW-MWF) for multi-microphone noise reduction and proposes a method to determine the speech distortion weights. The SP-SDW-MWF is known as a robust noise reduction algorithm against the error caused by the mismatch in microphones. The SP-SDW-MWF adopts weights which determine the amount of noise reduction at the expense of introducing speech distortion in the noise-suppressed speech. In this paper, we use the error of power spectral density between the estimated signal and the desired signal as the evaluation measure. Thus the a priori SNR is used to control the speech distortion weights in the frequency domain. In the experimental results, the proposed method yields better result in terms of MFCC distortion compared to the conventional method.

키워드

참고문헌

  1. J. S. Lim and a. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech," Proceedings of the IEEE, vol. 67, no. 12, pp. 1586-1604, 1979. https://doi.org/10.1109/PROC.1979.11540
  2. S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. ASSP-27, no. 2, pp. 113-120, 1979.
  3. Y. Ephraim and D. Malah, "Speech enhancement using a minimum- mean square error short-time spectral amplitude estimator," IEEE Trans. on Acoustics, Speech, and Signal Processing, vol. ASSP-32, no. 6, pp. 1109-1121, 1984.
  4. Y. Ephraim and H. Van Trees, "A signal subspace approach for speech enhancement," IEEE Trans. on Speech and Audio Processing, vol. 3, no. 4, pp. 251-266, 1995. https://doi.org/10.1109/89.397090
  5. J. Huang and Y. Zhao, "An energy-constrained signal subspace method for speech enhancement and recognition in white and colored noises," Speech Communication, vol. 26, no. 3, pp. 165-181, Nov. 1998. https://doi.org/10.1016/S0167-6393(98)00041-7
  6. K. Hermus, P. Wambacq, and H. Hamme, "A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition," EURASIP Journal on Advances in Signal Processing, vol. 2007, no. 1, p. 045821, 2007. https://doi.org/10.1155/2007/45821
  7. M. Brandstein and D. Ward (Eds.), Microphone Arrays, Springer-Verlag, 2001.
  8. S. Doclo and M. Moonen, "GSVD-based optimal filtering for multimicrophone speech enhancement," IEEE Trans. on Signal Processing, vol. 50, no. 9, pp. 2230-2244, Sep. 2002. https://doi.org/10.1109/TSP.2002.801937
  9. G. Rombouts and M. Moonen, "QRD-based unconstrained optimal filtering for acoustic noise reduction," Signal Processing, vol. 83, no. 9, pp. 1889-1904, Sep. 2003. https://doi.org/10.1016/S0165-1684(03)00107-5
  10. A. Spriet, M. Moonen, and J. Wouters, "Spatially preprocessed speech distortion weighted multi-channel Wiener filtering for noise reduction," Signal Processing, vol. 84, no. 12, pp. 2367-2387, Dec. 2004. https://doi.org/10.1016/j.sigpro.2004.07.028
  11. L. Griffiths and C. Jim, "An alternative approach to linearly constrained adaptive beamforming," IEEE Trans. Antennas Propagation, vol. AP-30, no. 1, pp. 27-34, Jan. 1982.
  12. J. Benesty, S. Makino, J. Chen (Eds.), Speech Enhancement, Springer, 2005.
  13. L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, 1993.
  14. J. Allen and D. Berkely, "Image method for efficiently simulating small-room acoustics," Journal of the Acoustical Society of America, vol. 65, pp. 943-950, April 1979. https://doi.org/10.1121/1.382599