Acoustic Echo Cancellation using Time-Frequency Masking and Higher-order Statistics

시간-주파수 마스킹과 고차 신호 통계를 이용한 음향 반향신호 제거

  • 김경재 (한양대학교 전자통신컴퓨터학과) ;
  • 남상원 (한양대학교 전자통신컴퓨터학과)
  • Published : 2007.03.01

Abstract

In hands-free full-duplex communication systems, acoustic signals picked up by the microphones can be mixed with echo signals as well as noises, which may result in poor performance of the corresponding communication system. Also, the system performance may decrease further if the reverberation occurs since it is harder to estimate the impulse response of the demixing system. For blind source separation (BSS) in such cases, a time-frequency masking approach can be employed to separate undesired echo signals and noises, but, permutation ambiguities also should be solved for the echo cancellation. In this paper, we propose a new acoustic echo cancellation (AEC) approach utilizing the time-frequency masking and higher-order statistics, whereby a desired signal selection, based on coherence and third-order statistics (i.e., kurtosis), is introduced along with output signal normalization. Simulation results demonstrate that the proposed approach yields better echo and noise cancellation performances than the conventional AEC approaches.

Keywords

References

  1. E. Hansler and G. Schmidt, Acoustic Echo and Noise Control: A Practical Approach, New Jersey, Wiley, 2004
  2. S. Haykin, Adaptive Filter Theory, 4th Ed., New Jersey, Prentice-Hall, 2002
  3. A. Hyvarinen, J Karhunen, and E. Oja, Independent Component Analysis. New York: Wiley, 2001
  4. D. Kim, H. Choi, and H. Bae, 'Acoustic echo cancellation using blind source separation,' Proc. IEEE Workshop on SIPS2003, pp. 241-244, Aug. 2003
  5. A. Jourjine, S. Rickard, and O. Yilmaz, 'Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures,' Proc. ICASSP2000, vol. 5, pp. 2985-2988, Jun. 2CXXl
  6. S. Araki, S. Makino, A. Blin, R. Mukai, and H. Sawada, 'Underdetermined blind separation for speech in real environments with sparseness and ICA,' Proc. ICASSP2004, vol. 3, pp. 881-884, May 2004
  7. O. Yilmaz and S. Rickard, 'Blind separation of speech mixtures via time-frequency,' IEEE Trans. on Audio, Speech, and Language Process., vol. 52, no. 7, pp. 1830-1847, Jul. 2005
  8. S.Y. Low, S. Nordholm, and R. Togneri, 'Convolutive blind signal separation with post-processing,' IEEE Trans. on Speech and Audio Process., vol. 12, no. 5, pp. 539-548, Sept. 2004 https://doi.org/10.1109/TSA.2004.832993
  9. T. Gansler, M Hansson, C.J. Ivarsson, and G. Salomonsson, 'A double-talk detector based on coherence,' IEEE Trans. on Communications, vol. 44, no. 11, pp. 1421-1427, Nov. 1996 https://doi.org/10.1109/26.544458