Improved Acoustic Modeling Based on Selective Data-driven PMC

  • Kim, Woo-Il (Dept. of Electronics and Computer Engineering, Korea University) ;
  • Kang, Sun-Mee (Dept. of Computer Science, Seokyeong University) ;
  • Ko, Han-Seok (Dept. of Electronics and Computer Engineering, Korea University, Dept. of Electrical and Computer Engineering, Johns Hopkins University)
  • Published : 2002.03.01

Abstract

This paper proposes an effective method to remedy the acoustic modeling problem inherent in the usual log-normal Parallel Model Composition intended for achieving robust speech recognition. In particular, the Gaussian kernels under the prescribed log-normal PMC cannot sufficiently express the corrupted speech distributions. The proposed scheme corrects this deficiency by judiciously selecting the 'fairly' corrupted component and by re-estimating it as a mixture of two distributions using data-driven PMC. As a result, some components become merged while equal number of components split. The determination for splitting or merging is achieved by means of measuring the similarity of the corrupted speech model to those of the clean model and the noise model. The experimental results indicate that the suggested algorithm is effective in representing the corrupted speech distributions and attains consistent improvement over various SNR and noise cases.

Keywords