DOI QR코드

DOI QR Code

Speech Reinforcement Based on Soft Decision Under Far-End Noise Environments

원단 잡음 환경에서 Soft Decision에 기반한 새로운 음성 강화 기법

  • 최재훈 (인하대학교 전자공학부) ;
  • 장준혁 (인하대학교 전자공학부)
  • Published : 2008.10.31

Abstract

In this paper, we propose an effective speech reinforcement technique under the near-end and the far-end noise environments. In general, since the intelligibility of the far-end speech for the near-end listener is significantly reduced under near-end noise environments, we require a far-end speech reinforcement approach to avoid this phenomena. Specifically, based on the estimated background noise spectrum of the near-end, we reinforce the far-end speech spectrum by incorporating the more general cases under the near-end with background noise. Also, we propose the novel approach to reinforce the actual speech signal except for the noise signal in the far-end noisy speech signal. The performance of the proposed algorithm is evaluated by the CCR (Comparison Category Rating) test of the method for subjective determination of transmission quality in ITU-T P.800 under various noise environments and shows better performances compared with the conventional method.

본 논문에서는 근단 (Hear-End)및 원단 (Far-End) 잡음 환경에서 효과적인 음성 강화 기법을 제시한다. 일반적으로 배경 잡음이 존재하는 근단 환경에서 수신하는 원단 화자 음성의 명료도가 매우 감소하므로, 이를 극복하기 위한 원단 화자 음성 강화 기법이 필요하다. 구체적으로, 추정된 근단 화자의 배경 잡음 전력을 기반으로 원단 화자의 음성 전력을 강화시키는데, 특별히 근단 환경에서도 잡음이 존재하는 일반적인 경우를 고려하여, 잡음에 오염된 원단 음성 신호중 잡음을 제외한 실제 음성 신호만 강화하는 개선된 알고리즘을 제안한다 제안된 음성 강과 기법의 성능은 다양한 잡음 환경 하에서 ITU-T P.800의 주관적 음질 측정 방법인 CCR (Comparison Category Rating) 테스트에 의해 평가되었으며, 기존의 음성 강화기법과 비교해서 우수한 성능을 보여주었다.

Keywords

References

  1. N. S. Kim, J. -H. Chang, "Spectral enhancement based on global soft decision," IEEE Signal Processing Letters, 7(5), May 2000, pp. 108-110
  2. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., ASSP-32(6), 1109-1121, Dec. 1984
  3. B. C. J. Moore, An Introduction to the Psychology of Hearing, (Academic Press, 2003)
  4. B. Sauert and P. Vary, "Near end listening enhancement :Speech intelligibility improvement in noisy environmnets," in Proc. IEEE Int. Conf. Acoustics., Speech, Signal Processing., 1(I-493-I-496), 2006
  5. O. Cappe, "Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor," IEEE Trans. Speech Audio Process., 2(2), 345-349, Apr. 1994
  6. J. Sohn, N. S. Kim, W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, 6(1), 1-3, Jan. 1999
  7. J. W. Shin, N. S. Kim, "Perceptual reinforcement of speech signal based on partial specific loudness," IEEE Signal Processing Letters, to appear
  8. R. J. McAualy and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Processing, ASSP-28, 137-145, Apr. 1980
  9. Russell J. Niederjohn and James H. Grotelueschen, "The enhancement of speech intelligibility in high noise levels by highpass filtering followed by rapid amplitude compression," in Proc. of ICASSP, Aug. 1976, 24, 277-282
  10. ITU-T P.800, Methods for Subjective Determination of Transmission Quality, Aug. 1996