[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3745/KIPSTB.2003.10B.1.067

A Semi-Noniterative VQ Design Algorithm for Text Dependent Speaker Recognition

Lim, Dong-Chul (아주대학교 대학원 전자공학부)
Lee, Haing-Sei (아주대학교)

Publication Information

The KIPS Transactions:PartB / v.10B, no.1, 2003 , pp. 67-72 More about this Journal

Abstract

In this paper, we study the enhancement of VQ (Vector Quantization) design for text dependent speaker recognition. In a concrete way, we present the non-Iterative method which makes a vector quantization codebook and this method Is nut Iterative learning so that the computational complexity is epochally reduced. The proposed semi-noniterative VQ design method contrasts with the existing design method which uses the iterative learning algorithm for every training speaker. The characteristics of a semi-noniterative VQ design is as follows. First, the proposed method performs the iterative learning only for the reference speaker, but the existing method performs the iterative learning for every speaker. Second, the quantization region of the non-reference speaker is equivalent for a quantization region of the reference speaker. And the quantization point of the non-reference speaker is the optimal point for the statistical distribution of the non-reference speaker In the numerical experiment, we use the 12th met-cepstrum feature vectors of 20 speakers and compare it with the existing method, changing the codebook size from 2 to 32. The recognition rate of the proposed method is 100% for suitable codebook size and adequate training data. It is equal to the recognition rate of the existing method. Therefore the proposed semi-noniterative VQ design method is, reducing computational complexity and maintaining the recognition rate, new alternative proposal.

Keywords

Vector Quantization; Clustering; Speaker Recognition; Computational Complexity; Learning;

Citations & Related Records

Reference

1	T. Kinnunen, T. Kilpelinen and P. Frnti, 'Comparison of clustering algorithms in speaker identification,' Proc, IASTED Int. Conf. Signal Processing and Communications (SPC 2000), Marbella, Spain, pp.222-227, 2000
2	H. Gish and M. Schmidt, 'Text-independent speaker identification,' IEEE Signal Processing Mag., Vol.11, p.1832, 1994 DOI ScienceOn
3	Y. Linde, A. Buzo and Gray R. M., 'An algorithm for vector quantizer design,' IEEE Trans. On Communications, 28(1), pp.84-95, January, 1980 DOI
4	A. K. Jain, R. P. W. Duin and J. Mao, 'Statistical pattern recognition: A review,' IEEE Trans. Pattern Anal. Machine Intell., Vol.22, p.437, Jan., 2000 DOI ScienceOn
5	D. A Reynolds, DOI
6	T. Kinnunen, I. Krkkinen and P. Frnti, 'Is speech data clustered? - Statistical analysis of cepstral features,' Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark, Vol.4, pp.2627-2630, 2001
7	D. A Reynolds and Robust, 'Text-independent speaker identification using gaussian mixture speaker models,' IEEE Transactions on speech and audio processing, Vol.3, No.1, January, 1995 DOI ScienceOn
8	S. Furui, 'Cepstral analysis technique for automatic speaker verification.,' IEEE Trans. on Acoustics, Speech and Signal Processing, 29(2), pp.254-272, 1981 DOI

KSCI

A Semi-Noniterative VQ Design Algorithm for Text Dependent Speaker Recognition 문맥종속 화자인식을 위한 준비반복 벡터 양자기 설계 알고리즘

A Semi-Noniterative VQ Design Algorithm for Text Dependent Speaker Recognition