Browse > Article
http://dx.doi.org/10.7776/ASK.2009.28.4.401

Frequency Domain Double-Talk Detector Based on Gaussian Mixture Model  

Lee, Kyu-Ho (인하대학교 전자공학부)
Chang, Joon-Hyuk (인하대학교 전자공학부)
Abstract
In this paper, we propose a novel method for the cross-correlation based double-talk detection (DTD), which employing the Gaussian Mixture Model (GMM) in the frequency domain. The proposed algorithm transforms the cross correlation coefficient used in the time domain into 16 channels in the frequency domain using the discrete fourier transform (DFT). The channels are then selected into seven feature vectors for GMM and we identify three different regions such as far-end, double-talk and near-end speech using the likelihood comparison based on those feature vectors. The presented DTD algorithm detects efficiently the double-talk regions without Voice Activity Detector which has been used in conventional cross correlation based double-talk detection. The performance of the proposed algorithm is evaluated under various conditions and yields better results compared with the conventional schemes. especially, show the robustness against detection errors resulting from the background noises or echo path change which one of the key issues in practical DTD.
Keywords
Gaussian mixture model; Double-talk detection; Cross-correlation coefficient; Likelihood; Voice activity detector;
Citations & Related Records
연도 인용수 순위
  • Reference