Browse > Article

Extraction of MFCC feature parameters based on the PCA-optimized filter bank and Korean connected 4-digit telephone speech recognition  

정성윤 (경북대학교)
김민성 (경북대학교)
손종목 (경북대학교)
배건성 (경북대학교)
Publication Information
Abstract
In general, triangular shape filters are used in the filter bank when we extract MFCC feature parameters from the spectrum of the speech signal. A different approach, which uses specific filter shapes in the filter bank that are optimized to the spectrum of training speech data, is proposed by Lee et al. to improve the recognition rate. A principal component analysis method is used to get the optimized filter coefficients. Using a large amount of 4-digit telephone speech database, in this paper, we get the MFCCs based on the PCA-optimized filter bank and compare the recognition performance with conventional MFCCs and direct weighted filter bank based MFCCs. Experimental results have shown that the MFCC based on the PCA-optimized filter bank give slight improvement in recognition rate compared to the conventional MFCCs but fail to achieve better performance than the MFCCs based on the direct weighted filter bank analysis. Experimental results are discussed with our findings.
Keywords
PCA-optimized 필터뱅크;4연숫자 전화음성인식;MFCC특징파라미터 추출;
Citations & Related Records
연도 인용수 순위
  • Reference
1 정성윤, 김민성, 손종목, 배건성, 김상훈, '한국어 연속숫자음 전화음성의 인식성능 개선,' 대한전자공학회 추계학술대회 논문집, 제 25권 2호, 582-585쪽, 2002
2 I. T. Jolliffe, Principal component analysis, Springer Verlag, 2002
3 http://www.sitec.or.kr/index.asp
4 Steve Young, Gunnar Evermann and D. Kershaw, The HTK Book (HTK Version 3.0), Cambridge, 2000
5 정성윤, 김민성, 손종목, 배건성, 김상훈, '채널보상기법 및 특징파라미터추출 방법에 따른 연속숫자음 전화음성의 인식성능향상,' 대한음성학회 정기총회 및 학술발표대회 논문집, 201-203쪽, 2002
6 김성탁, 김상진, 정호영, 김회린, 한민수, '전화망 환경에서의 연속숫자음 인식 성능평가,' 한국음향학회 논문집, 제 21권 1호, 253-256쪽, 2002
7 A.Biern, S.Katagiri, E.McDermott and B.H.Juang, 'An application of discriminative feature extraction to filter-bank based speech recognition,' IEEE Transaction on Speech and Audio Processing, Vol.9, no.2, Feb. 2001   DOI   ScienceOn
8 C. Benitez, L. Burget, H.Hermansky, P.Jain, and N.Morgan, 'Robust ASR front-end spectral-based and discriminant features : experiments on the Aurora tasks,' Proc. Eurospeech, 2001
9 S. M. Lee, S. H. Fang, J. Hung, and L. S. Lee, 'Improved mfcc feature extraction by pca-optimized filter-bank for speech recognition,' Automatic Speech Recognition and Understanding, pp. 49-52, 2001