Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2003.10B.1.067

A Semi-Noniterative VQ Design Algorithm for Text Dependent Speaker Recognition  

Lim, Dong-Chul (아주대학교 대학원 전자공학부)
Lee, Haing-Sei (아주대학교)
Abstract
In this paper, we study the enhancement of VQ (Vector Quantization) design for text dependent speaker recognition. In a concrete way, we present the non-Iterative method which makes a vector quantization codebook and this method Is nut Iterative learning so that the computational complexity is epochally reduced. The proposed semi-noniterative VQ design method contrasts with the existing design method which uses the iterative learning algorithm for every training speaker. The characteristics of a semi-noniterative VQ design is as follows. First, the proposed method performs the iterative learning only for the reference speaker, but the existing method performs the iterative learning for every speaker. Second, the quantization region of the non-reference speaker is equivalent for a quantization region of the reference speaker. And the quantization point of the non-reference speaker is the optimal point for the statistical distribution of the non-reference speaker In the numerical experiment, we use the 12th met-cepstrum feature vectors of 20 speakers and compare it with the existing method, changing the codebook size from 2 to 32. The recognition rate of the proposed method is 100% for suitable codebook size and adequate training data. It is equal to the recognition rate of the existing method. Therefore the proposed semi-noniterative VQ design method is, reducing computational complexity and maintaining the recognition rate, new alternative proposal.
Keywords
Vector Quantization; Clustering; Speaker Recognition; Computational Complexity; Learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 T. Kinnunen, T. Kilpelinen and P. Frnti, 'Comparison of clustering algorithms in speaker identification,' Proc, IASTED Int. Conf. Signal Processing and Communications (SPC 2000), Marbella, Spain, pp.222-227, 2000
2 H. Gish and M. Schmidt, 'Text-independent speaker identification,' IEEE Signal Processing Mag., Vol.11, p.1832, 1994   DOI   ScienceOn
3 Y. Linde, A. Buzo and Gray R. M., 'An algorithm for vector quantizer design,' IEEE Trans. On Communications, 28(1), pp.84-95, January, 1980   DOI
4 A. K. Jain, R. P. W. Duin and J. Mao, 'Statistical pattern recognition: A review,' IEEE Trans. Pattern Anal. Machine Intell., Vol.22, p.437, Jan., 2000   DOI   ScienceOn
5 D. A Reynolds,   DOI
6 T. Kinnunen, I. Krkkinen and P. Frnti, 'Is speech data clustered? - Statistical analysis of cepstral features,' Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark, Vol.4, pp.2627-2630, 2001
7 D. A Reynolds and Robust, 'Text-independent speaker identification using gaussian mixture speaker models,' IEEE Transactions on speech and audio processing, Vol.3, No.1, January, 1995   DOI   ScienceOn
8 S. Furui, 'Cepstral analysis technique for automatic speaker verification.,' IEEE Trans. on Acoustics, Speech and Signal Processing, 29(2), pp.254-272, 1981   DOI