DOI QR코드

DOI QR Code

스테레오 음향학적 에코 제거를 위한 Soft Decision 기반 필터 확장 기법

Spectro-Temporal Filtering Based on Soft Decision for Stereophonic Acoustic Echo Suppression

  • Lee, Chul Min (Seoul National University, Department of Electrical and Computer Engineering and Institute of New Media and Communications) ;
  • Bae, Soo Hyun (Seoul National University, Department of Electrical and Computer Engineering and Institute of New Media and Communications) ;
  • Kim, Jeung Hun (Seoul National University, Department of Electrical and Computer Engineering and Institute of New Media and Communications) ;
  • Kim, Nam Soo (Seoul National University, Department of Electrical and Computer Engineering and Institute of New Media and Communications)
  • 투고 : 2014.09.30
  • 심사 : 2014.11.06
  • 발행 : 2014.12.31

초록

본 논문은 스테레오 환경에서 발생하는 음향학적 에코 신호를 효율적으로 제거하기 위하여 시간 및 주파수 상관관계 (spectro-temporal correlation) 를 고려한 필터링을 제안하였다. 기존의 에코 패스를 직접 추정하는 방식에서 벗어나 동시 통화 검출 기법 없이 에코 스펙트럼을 추정하는 음향학적 에코 억제 기법 (acoustic echo suppression, AES) 을 적용하였다. 개선된 에코 추정을 위해 확장된 파워 스펙트럼 밀도 행렬 (extended power spectrum density matrix) 과 에코 과추정 조절 행렬 (echo overestimation control matrix) 을 도입하였다. 또한, 주파수 영역에서 음성이 존재하지 않을 확률을 적용한 soft decision 기반의 본 기법을 통해 스테레오 환경에서의 음향학적 에코 신호 제거 성능이 기존 기법에 비해 보다 향상됨을 확인하였다.

We propose a novel approach for stereophonic acoustic echo suppression using spectro-temporal filtering based on soft decision. Unlike the conventional approaches estimating the echo pathes directly, the proposed technique can estimate stereo echo spectra without any double-talk detector. In order to improve the estimation of echo spectra, the extended power spectrum density matrix and echo overestimation control matrix are applied on this method. In addition, this echo suppression technique is based on soft decision technique using speech absence probability in STFT domain. Experimental results show that the proposed method improves compared with the conventional approaches.

키워드

참고문헌

  1. H. M. Yoon and H. W. Lee, "A new adaptive algorithm for the acoustic echo canceller," J. Commun. Networks (JCN), vol. 26, no. 6, pp. 77-81, Jun. 2001.
  2. C. M. Lee, Y. G. Jin, T. G. Kang, and N. S. Kim, "A study of acoustic echo cancellation using subband adaptive Kalman filtering in frequency domain," in Proc. KICS Int. Conf. Commun. 2012 (KICS ICC 2012), pp. 346- 347, Jeju Island, Korea, Jun. 2012.
  3. J. Benesty, D. R. Morgan, and M. M. Sondhi, "A better understanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation," IEEE Trans. Speech, Audio. Process., vol. 6, no. 2, pp. 156-165, Mar. 1998. https://doi.org/10.1109/89.661474
  4. F. Yang, M. Wu, and J. Yang, "Stereophonic acoustic echo suppression based on Wiener filter in the short-time Fourier transform domain," IEEE Signal Process. Lett., vol. 19, no. 4, pp. 227-230, Apr. 2012. https://doi.org/10.1109/LSP.2012.2187446
  5. Y. S. Park and J. H. Chang, "Residual echo suppression based on tracking echo-presence uncertainty," J. Commun. Networks (JCN), vol. 34, no. 10, pp. 955-960, Oct. 2009.
  6. S. Y. Lee and N. S. Kim, "A statistical model based residual echo suppression," IEEE Signal Process. Lett., vol. 14, no. 10, pp. 758-761, Oct. 2007. https://doi.org/10.1109/LSP.2007.896452
  7. C. M. Lee, J. W. Shin, and N. S. Kim, "Stereophonic acoustic echo suppression incorporating spectro-temporal correlations," IEEE Signal Process. Lett., vol. 21, no. 3, pp. 316-320, Mar. 2014 https://doi.org/10.1109/LSP.2014.2302438
  8. J. Bergstra and Y. Bengio, "Random search for hyper-parameter optimization," J. Machine Learning Research, vol. 13, pp. 281-305, 2012.
  9. ITU-T, "Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs," ITU-T Rec., p. 862, 2000.