Browse > Article
http://dx.doi.org/10.7776/ASK.2006.25.8.370

Mel-Frequency Cepstral Coefficients Using Formants-Based Gaussian Distribution Filterbank  

Son, Young-Woo (경북대학교 전자공학과)
Hong, Jae-Keun (경북대학교 전자공학과)
Abstract
Mel-frequency cepstral coefficients are widely used as the feature for speech recognition. In FMCC extraction process. the spectrum. obtained by Fourier transform of input speech signal is divided by met-frequency bands, and each band energy is extracted for the each frequency band. The coefficients are extracted by the discrete cosine transform of the obtained band energy. In this Paper. we calculate the output energy for each bandpass filter by taking the weighting function when applying met-frequency scaled bandpass filter. The weighting function is Gaussian distributed function whose center is at the formant frequency In the experiments, we can see the comparative performance with the standard MFCC in clean condition. and the better Performance in worse condition by the method proposed here.
Keywords
MFCC (met-frequency cepstral coefficients); Formant; Gaussian distribution; Speech recognition;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. J. F. Gales and S. J. Young, 'Cepstral parameter compensation for HMM recognition in noise', Speech Communication, 12 231-239, 1993   DOI   ScienceOn
2 S. Young, D. Kershaw,J. Odell,D. Ollason,and P. Woodland,The HTK Book version 3.2. I, 2002
3 H. Hermansky, 'Perceptual linear predictive (PLP) analysis of speech', J. Acoust. Soc. Am 87 1738-1752, April 1990   DOI
4 K. K. Chu, S. H. Leung, 'Feature extraction based on perceptually non-uniform spectral compression for speech recognition', Proc. ISCAP 2003, 726-729, 2003
5 L. R. Rabiner, 'A tutorial on hidden Markov models and selected applications in speech recognition,' Proc. IEEE, 77(2) 257-286, Feb. 1989
6 P. Lockwood and J. Boudy, 'Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection for robust speech recognition in cars', Speech Communication, 11 215-228, June 1992   DOI   ScienceOn
7 L. Welling and H. Ney, 'Formant estimation for speech recognition', IEEE Trans. On Speech and Audio Processing, 6(1) Jan. 1998
8 K. K. Chu, S. H. Leung and C. S. Yip, 'Perceptually non-uniform spectral compression for noisy speech recognition', Proc. ICASSP 2003, 404-407 2003
9 K. K. Chu and S. H. Leung, 'SNR-dependent non-uniform spectral compression for noisy speech recognition', Proc ICASSP 2004, 973-976, 2004