Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2003.10B.6.673

A Classified Space VQ Design for Text-Independent Speaker Recognition  

Lim, Dong-Chul (아주대학교 대학원 전자공학부)
Lee, Hanig-Sei (아주대학교 전자공학부)
Abstract
In this paper, we study the enhancement of VQ (Vector Quantization) design for text independent speaker recognition. In a concrete way, we present a non-iterative method which makes a vector quantization codebook and this method performs non-iterative learning so that the computational complexity is epochally reduced The proposed Classified Space VQ (CSVQ) design method for text Independent speaker recognition is generalized from Semi-noniterative VQ design method for text dependent speaker recognition. CSVQ contrasts with the existing desiEn method which uses the iterative learninE algorithm for every traininE speaker. The characteristics of a CSVQ design is as follows. First, the proposed method performs the non-iterative learning by using a Classified Space Codebook. Second, a quantization region of each speaker is equivalent for the quantization region of a Classified Space Codebook. And the quantization point of each speaker is the optimal point for the statistical distribution of each speaker in a quantization region of a Classified Space Codebook. Third, Classified Space Codebook (CSC) is constructed through Sample Vector Formation Method (CSVQ1, 2) and Hyper-Lattice Formation Method (CSVQ 3). In the numerical experiment, we use the 12th met-cepstrum feature vectors of 10 speakers and compare it with the existing method, changing the codebook size from 16 to 128 for each Classified Space Codebook. The recognition rate of the proposed method is 100% for CSVQ1, 2. It is equal to the recognition rate of the existing method. Therefore the proposed CSVQ design method is, reducing computational complexity and maintaining the recognition rate, new alternative proposal and CSVQ with CSC can be applied to a general purpose recognition.
Keywords
Vector Quantization; Clustering; Speaker Recognition; Computational Complexity; Learning;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 http://www.sitec.or.kr
2 R. Neapolitan and K. Naimipour, 'Foundations of Algorithms,' 1st, Johns & Bartlett Pub, 1997
3 D.A. Reynolds, 'An overview of automatics speaker recognition technology,' Acoustics, Speech, and Signal Processing, 2002 IEEE International Conference on, Vol.4, pp.4072-4075, 2002   DOI
4 S. Furui, 'Cepstral analysis technique for automatic speaker verification,' IEEE Trans. on Acoustics, Speech and Signal Processing, 29(2), pp.254-272, 1981   DOI
5 J. Deller, J. Proakis and J.H. Hansen, 'Discrete-Time Processing of Speech Signal,' 1st Ed., Macmillan Publishing Company, 1993
6 임동철, 이행세, '문맥종속 화자인식을 위한 준비반복 벡터양 자기 설계 알고리즘,' 정보처리학회논문지B, 제10-B권 제1호, pp.67-72, 2003   과학기술학회마을   DOI
7 T. Kinnunen, T. Kilpelinen and P. Frnti, 'Comparison of clustering algorithms in speaker identification,' Proc. IA STED Int. Conf. Signal Processing and Communications (SPC 2000), Marbella, Spain, pp.222-227, 2000
8 Y. Linde, A. Buzo and R.M. Gray, 'An algorithm for vector quantizer design,' IEEE Trans. On Communications, 28(1), pp.84-95, January, 1980   DOI
9 정광우, '화자인식을 위한 음성신호처리,' 전자공학회지, 제26권 제11호, pp.53-63, 1999
10 S. Theodoridis and K. Koutroumbas, 'Pattern Recognition,' 1st Ed., Academic Press, 1999
11 H. Gish and M. Schmidt, 'Text-independent speaker identification,' IEEE Signal Processing Mag., Vol.11, pp.18-32, 1994   DOI   ScienceOn