A Study on Modified Clustering Algorithm for Text-Dependent Speaker Verification System

;;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 23 Issue 7
/
Pages.548-553
/
2004
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

A Study on Modified Clustering Algorithm for Text-Dependent Speaker Verification System

문장종속 화자확인 시스템을 위한 개선된 군집화 알고리즘에 관한 연구

강철호 (광운대학교 전자통신학과) ;
정희석 (광운대학교 전자통신학과)

Published : 2004.10.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper we propose modified LBG algorithm to minimize quantization errors. When we apply conventional LBG algorithm for speaker verification system, problems that result from small amount of training data can be generated. That is, quantization error comes from fixed-sized codebook without any consideration for speaker characteristics and splitting vector in the wrong direction worsen performance of speaker verification system. So, we propose modified clustering method that has variable sized codebook according to speaker characteristics and makes right splitting direction by finding the farthest member away from mean and then find another member from the member. Simulation results show effectiveness of the proposed algorithm.

본 연구에서는 집단화 오차를 최소로 하기위해 개선된 LBG 알고리즘을 제안한다. 기존의 LBG 알고리즘은 화자확인 시스템에 적용시 소량의 학습 데이터의 분포가 가지는 특수성으로부터 기인하는 문제점들이 발생한다. 즉, 개인별 특성을 무시하고 항상 일정한 크기의 코드북을 생성해야 하는데서 기인하는 군집화 오류와 분할할 (Splitting) 방향을 잘못 선택하면서 발생하는 집단화의 오류가 전체 화자 인식율 저하의 원인이 된다. 따라서, 본 연구에서는 개인별로 최적의 크기를 가지는 가변길이 코드북 생성 기법과 중심값으로부터 최외곽의 멤버 벡터 인덱스를 찾고 다시 최외곽 멤버 벡터에서 가장 먼 멤버 벡터 인덱스를 찾음으로써 분할할 방향을 인위적으로 지정해 주는 개선된 군집화 알고리즘을 제안한다. 실험 결과, 제안된 방식을 적용한 화자확인 시스템이 기존의 LBG알고리즘을 사용한 시스템보다 오거부율(FR)은 3.165%, 오수락율 (FA)는 0.06%씩 각각 향상 되었다.

Keywords

References

F. K. Soong, A. E. Rosenberg, L. R. Rabiner, and B. H. Juang, 'A vector quantization approach to speaker recognition', Proc. ICASSP'85, pp.387-390, March 1985
Yoseph Linde, Andres Buzo, Robert M. Gray, 'An Algorithm for Vector Quantizer Design'. IEEE Trans. Communications, vol. COM-28, pp. 84 - 95, January 1980 https://doi.org/10.1109/TCOM.1980.1094577
Allen Gersho and Robert M. Gray, Vector Quantization and Signal Compression, Kluwer Academic Publishers, 1992
M. R. Anderberg, Cluster Analysis for Applications, Academic, New York, 1973
진세훈, 이재희, 강철호, '화자확인 시스템을 위한 적응적 모델 갱신과 사전 문턱치 결정에 관한 연구', 한국 음향학회지, 19(5), pp.20-26, 2000년 7월

The Journal of the Acoustical Society of Korea (한국음향학회지)

A Study on Modified Clustering Algorithm for Text-Dependent Speaker Verification System

문장종속 화자확인 시스템을 위한 개선된 군집화 알고리즘에 관한 연구

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)