Browse > Article
http://dx.doi.org/10.7776/ASK.2006.25.1.036

A PCA-based MFDWC Feature Parameter for Speaker Verification System  

Hahm Seong-Jun (영남대학교 정보통신공학과)
Jung Ho-Youl (영남대학교 정보통신공학과)
Chung Hyun-Yeol (영남대학교 정보통신공학과)
Abstract
A Principal component analysis (PCA)-based Mel-Frequency Discrete Wavelet Coefficients (MFDWC) feature Parameters for speaker verification system is Presented in this Paper In this method, we used the 1st-eigenvector obtained from PCA to calculate the energy of each node of level that was approximated by. met-scale. This eigenvector satisfies the constraint of general weighting function that the squared sum of each component of weighting function is unity and is considered to represent speaker's characteristic closely because the 1st-eigenvector of each speaker is fairly different from the others. For verification. we used Universal Background Model (UBM) approach that compares claimed speaker s model with UBM on frame-level. We performed experiments to test the effectiveness of PCA-based parameter and found that our Proposed Parameters could obtain improved average Performance of $0.80\%$compared to MFCC. $5.14\%$ to LPCC and 6.69 to existing MFDWC.
Keywords
Speaker Verification; Principal Component Analysis; Wavelet Transform; Universal Background Model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Gowdy. J.N. Tufekci. Z., 'Mel-Scaled discrete Wavelet Coefficients for Speech Recognition,' ICASSP. 3. 1351-1354, 2000
2 Bourlard, H. and Dupont, S., 'A New ASR Approach Based on Independent Processing and Recombination of Partial Frequency Bands', ICSLP, 1996
3 Goswami. J. & all., 'Fundamental of Wavelets'. Wiley, 1999
4 S. Furui. 'An overview of speaker recognition technol-ogy.' in Acoustic speech and speaker recognition(C.-H Lee. F. K. Soong, and K. K. Paliwal, eds.), Ch. 2, 31-56, Kluwer Acad. Pub., 1996
5 Dat Tran. Michael Wagner. 'A Proposed Likelihood Transformation for Speaker Verification'. in proc. IEEE International Conference on. 2, 1069-1072, 2000
6 A. E. Rosenberg, J. Delong. C.-H. Lee, B.-H.Juang and F. K. Soong, 'The use of cohort normalized scores for speaker verification' in Proc. Inter. Conf. on Spoken Language Processing (ICSLP'92). 599-602. 1992
7 Gish, H. and Schmidt, M., 'Text-lndependent Speaker recognitioin', IEEE Sig. Proc. Magazine, 18-32, 1994
8 Farooq, O. and Datta, S., 'Mel Filter-Like Admissible Wavelet Packet Structure for Speech Recognition', IEEE Signal Processing Letters, 8 (7), 196-198. 2001   DOI   ScienceOn
9 A. L. Higgins, L. Bahler and J. Porter, 'Speaker Verification using Randomized Phrase Prompting', Digital Signal Processing. 1.89-106. 1991   DOI   ScienceOn
10 G. Strang. T. Nguyen, Wavelets and Filter Banks. (Wellesley-Cambridge Press, 1997)
11 Jolliffe. l.T. Principal Component Analvsis, (Springer Verlag). 1986
12 C. Tadj, A. Benlahouar, 'Speaker Characterization Using Principal Component Analysis and Wavelet Transform for Speaker Verification', in Proceedings of Eurospeech, 2981-2984. 2003
13 S. B. Davis, P. Mermelstein. 'Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences,' IEEE Trans. Acoustics, Speech, and Signal Processing, 28, 357-366. 1980   DOI
14 T. Matsui and S. Furui, 'Concatenated Phoneme Models for Text Variable Speaker Heconnlton'. in Proc. IEEE Inter. Conf. on Acoustics, Speech. and Signal Procssing (ICASSP'93), 391-392, 1993
15 함성준, 박하중, 김민정, 김주곤, 정호열, 정현열, 'Wavelet 변환을 이용한 GMIM 기반 문액독립 화자검증 시스템의 성능평가', 한국음향학회 학술발표대회 논문집 24 1(s), 한국음향학회, 297-300. 2005
16 D. A. Reynolds. 'Comparison of Background Normalization Methods for Text-Independent Speaker Verification,' in Proc. Eurospeech97. 963-967. 1997