[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2006.25.8.370

Mel-Frequency Cepstral Coefficients Using Formants-Based Gaussian Distribution Filterbank

Son, Young-Woo (경북대학교 전자공학과)
Hong, Jae-Keun (경북대학교 전자공학과)

Publication Information

The Journal of the Acoustical Society of Korea / v.25, no.8, 2006 , pp. 370-374 More about this Journal

Abstract

Mel-frequency cepstral coefficients are widely used as the feature for speech recognition. In FMCC extraction process. the spectrum. obtained by Fourier transform of input speech signal is divided by met-frequency bands, and each band energy is extracted for the each frequency band. The coefficients are extracted by the discrete cosine transform of the obtained band energy. In this Paper. we calculate the output energy for each bandpass filter by taking the weighting function when applying met-frequency scaled bandpass filter. The weighting function is Gaussian distributed function whose center is at the formant frequency In the experiments, we can see the comparative performance with the standard MFCC in clean condition. and the better Performance in worse condition by the method proposed here.

Keywords

MFCC (met-frequency cepstral coefficients); Formant; Gaussian distribution; Speech recognition;

Citations & Related Records

Reference

1	M. J. F. Gales and S. J. Young, 'Cepstral parameter compensation for HMM recognition in noise', Speech Communication, 12 231-239, 1993 DOI ScienceOn
2	S. Young, D. Kershaw,J. Odell,D. Ollason,and P. Woodland,The HTK Book version 3.2. I, 2002
3	H. Hermansky, 'Perceptual linear predictive (PLP) analysis of speech', J. Acoust. Soc. Am 87 1738-1752, April 1990 DOI
4	K. K. Chu, S. H. Leung, 'Feature extraction based on perceptually non-uniform spectral compression for speech recognition', Proc. ISCAP 2003, 726-729, 2003
5	L. R. Rabiner, 'A tutorial on hidden Markov models and selected applications in speech recognition,' Proc. IEEE, 77(2) 257-286, Feb. 1989
6	P. Lockwood and J. Boudy, 'Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection for robust speech recognition in cars', Speech Communication, 11 215-228, June 1992 DOI ScienceOn
7	L. Welling and H. Ney, 'Formant estimation for speech recognition', IEEE Trans. On Speech and Audio Processing, 6(1) Jan. 1998
8	K. K. Chu, S. H. Leung and C. S. Yip, 'Perceptually non-uniform spectral compression for noisy speech recognition', Proc. ICASSP 2003, 404-407 2003
9	K. K. Chu and S. H. Leung, 'SNR-dependent non-uniform spectral compression for noisy speech recognition', Proc ICASSP 2004, 973-976, 2004

KSCI

Mel-Frequency Cepstral Coefficients Using Formants-Based Gaussian Distribution Filterbank 포만트 기반의 가우시안 분포를 가지는 필터뱅크를 이용한 멜-주파수 켑스트럴 계수

Mel-Frequency Cepstral Coefficients Using Formants-Based Gaussian Distribution Filterbank