Browse > Article
http://dx.doi.org/10.6109/jkiice.2015.19.1.61

Target Speech Detection Using Gaussian Mixture Model of Frequency Bandwise Power Ratio for GSC-Based Beamforming  

Chang, Hyungwook (Department of Electronics Engineering, Gyeongsang National University)
Kim, Youngil (Department of Electronics Engineering, Gyeongsang National University)
Jeong, Sangbae (Department of Electronics Engineering, Gyeongsang National University)
Abstract
Noise reduction is necessary to compensate for the degradation of recognition performance by various types of noises. Among many noise reduction techniques using microphone array, generalized sidelobe canceller (GSC) has been widely applied to reduce nonstationary noises. The performance of GSC is directly affected by its adaptation mode controller (AMC). That is, accurate target speech detection is essential to guarantee the sufficient noise reduction in pure noise intervals and the less distortion in target speech intervals. Thus, this paper proposes an improved AMC design technique in which the power ratio of the output of fixed beamforming to that of blocking matrix is calculated frequency bandwise and probabilistically modeled by mixture Gaussians for each class. Experimental results show that the proposed algorithm outperforms conventional AMCs in receiver operating curves (ROC) and output SNRs.
Keywords
noise reduction; microphone array; generalized sidelobe canceller;
Citations & Related Records
연도 인용수 순위
  • Reference
1 ETSI ES 202 212, Speech processing, transmission and quality aspects (STQ), v.1.1.2, 2005.
2 S. Jeong and M. Hahn, "Speech quality and recognition rate improvement in car noise environments," Electronics Letters, Vol.37, No.12, pp. 801-802, 2001.
3 A. Hyvarinen and E. Oja, "Independent component analysis: Algorithms and applications," Neural Networks, vol. 13, no. 4, pp. 411-430, 2000.   DOI
4 O. Frost, "An algorithm for linearly constrained adaptive array processing," Proceedings of the IEEE, Vol 60, No. 8, pp. 926-935, 1972.   DOI
5 S. Gannot et al., "Signal enhancement using beamforming and nonstationarity with applications to speech," IEEE Trans. Signal Process., Vol. 49, No. 8, pp. 1614-1626, 2001.   DOI
6 Y. Jung, H. Kang, C. Lee, D. Youn, C. Choi, and J. Kim, "Adaptive microphone array system with two-stage adaptation mode controller," IEICE Trans. Fund., vol. E88-A, no. 4, pp. 972-977, Apr. 2005.   DOI
7 O. Hoshuyama, A. Sugiyama, and A. Hirano, "A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters," IEEE Trans. Signal Process., Vol 47, No. 10, pp. 2677-2684, 1999.   DOI
8 L. Rabiner and B. Juang, Fundamentals of Speech Recognitions, Prentice Hall, 1993.
9 M. Hayes, Statistical Digital Signal Processing and Modeling, John Wiley & Sons, 1996.
10 F. Tom, "An introduction to ROC analysis", Pattern Recognition Letters, Vol. 27, pp. 861-874. 2006.   DOI