Filtering of Filter-Bank Energies for Robust Speech Recognition

  • 투고 : 2003.11.03
  • 발행 : 2004.06.30

초록

We propose a novel feature processing technique which can provide a cepstral liftering effect in the log-spectral domain. Cepstral liftering aims at the equalization of variance of cepstral coefficients for the distance-based speech recognizer, and as a result, provides the robustness for additive noise and speaker variability. However, in the popular hidden Markov model based framework, cepstral liftering has no effect in recognition performance. We derive a filtering method in log-spectral domain corresponding to the cepstral liftering. The proposed method performs a high-pass filtering based on the decorrelation of filter-bank energies. We show that in noisy speech recognition, the proposed method reduces the error rate by 52.7% to conventional feature.

키워드

참고문헌

  1. IEEE Trans. ASSP v.28 Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences Davis, S.B.;Mermelstein, P.
  2. ETRI J. v.24 no.6 Speaker Adaptation Using ICA-Based Feature Transformation Jung, Ho-Young;Park, Man-Soo;Kim, Hoi-Rin;Hahn, Min-Soo
  3. Proc. Eurospeech On the Decorrelation of Filter-Bank Energies in Speech Recognition Nadeu, C.;Hemando, J.;Gorricho, M.
  4. Proc. Eurospeech Decorrelated and Liftered Filter-Bank Energies for Robust Speech Recognition Paliwal, K.K.
  5. Speech Communication v.34 Time and Frequency Filtering of Filter-Bank Energies for Robust HMM Speech Recognition Nadeu, C.;Macho, D.;Hemando, J.
  6. IEEE Trans. ASSP v.35 On the Use of Bandpass Liftering in Speech Recognition Juang, B.H.;Rabiner, L.R.;Wilpon, J.G.
  7. Speech Communication v.41 Cepstrum Derived from Differentiated Power Spectrum for Robust Speech Recognition Chen, J.;Paliwal, K.K.;Nakamura, S.
  8. Speech Communication v.12 Assessment for Automatic Speech Recognition: II. NOISEX92: A Database and an Experiment to Study the Effect of Additive Noise on Speech Recognition System Vargas, A.;Steeneken, H.
  9. Proc. Eurospeech On-Line Adaptation of a Speech Recognizer to Variations in Telephone Line Conditions Mokbel, C.;Monne, J.;Jouvet, D.