Performance Improvement of Microphone Array Speech Recognition Using Features Weighted Mahalanobis Distance

가중특징 Mahalanobis거리를 이용한 마이크 어레이 음석인식의 성능향상

  • Received : 2009.09.14
  • Accepted : 2009.11.12
  • Published : 2010.03.31

Abstract

In this paper, we present the use of the Features Weighted Mahalanobis Distance (FWMD) in improving the performance of Likelihood Maximizing Beamforming (Limabeam) algorithm in speech recognition for microphone array. The proposed approach is based on the replacement of the traditional distance measure in a Gaussian classifier with adding weight for different features in the Mahalanobis distance according to their distances after the variance normalization. By using Features Weighted Mahalanobis Distance for Limabeam algorithm (FWMD-Limabeam), we obtained correct word recognition rate of 90.26% for calibrate Limabeam and 87.23% for unsupervised Limabeam, resulting in a higher rate of 3% and 6% respectively than those produced by the original Limabearn. By implementing a HM-Net speech recognition strategy alternatively, we could save memory and reduce computation complexity.

Keywords

References

  1. L. J. Griffiths and C. W. Jim, "An alternative approach to linearly constrained adaptive beamforming," IEEE Transaction on Antennas and Propagation, vol, AP-30, no. 1, pp. 27-34, January 1982.
  2. R. Zelinski, "A microphone array with adaptive post-filtering for noise reduction in reverberant room." ICASSP-88, vol. 5, pp. 2578-2581, 1988.
  3. I. A. McCowan and H. Bourlard, "Microphone array post-filter for diffuse noise filed," ICASSP 2002, vol. 1, pp. 905-908, 2002.
  4. S. Leukimmiatis, "An Optimum Microphone Array Post-Filter for Speech Application," ICSLP-INTERSPEECH, pp. 2142-2145, Sep. 2006.
  5. M. Seltzer, "Microphone array processing for robust speech recognition,' Doctoral dissertation, Carnegie Mellon University, Pittsburgh, PA, 2003.
  6. M. Wolfel and H. K. Ekenel, "Feature weighted Mahalanobis distance: improved robustness for Gaussian classifiers," 13th European Signal Processing conference EUSIPCO2006, Antalya, Turkey, Sep. 2005.
  7. C. H. Knapp and C. Carter, "The generalized correlation method for estimation of time delay," IEEE Transactions on Information Theory, vol. 13, no. 2, pp. 260-269, April 1967.
  8. A. J. Viterbi, "Error bounds for convolution codes and an asymptotically optimum decoding algorithm," IEEE Transactions on Information Theory, Vol. 13, inssue 2. pp.260-267, April 1967.
  9. M. Suzuki, S. Makino, A. Ito, H. Aso and H. Shimodaira, "A new HMnet construction algorithm requiring no contextual factors," IEICE Trans, On Information System, Vol. E78-D, No 6, pp. 662-669, 1995.
  10. N. T. Hieu, "Robust Speech Recognition using Microphone Array in Adverse Environment," M,S Thesis, Yeungnam University, 2006.