Browse > Article

Performance Improvement of Fast Speaker Adaptation Based on Dimensional Eigenvoice and Adaptation Mode Selection  

송화전 (부산대학교 전자공학과)
이윤근 (㈜보이스웨어)
김형순 (부산대학교 전자공학과)
Abstract
Eigenvoice method is known to be adequate for fast speaker adaptation, but it hardly shows additional improvement with increased amount of adaptation data. In this paper, to deal with this problem, we propose a modified method estimating the weights of eigenvoices in each feature vector dimension. We also propose an adaptation mode selection scheme that one method with higher performance among several adaptation methods is selected according to the amount of adaptation data. We used POW DB to construct the speaker independent model and eigenvoices, and utterances(ranging from 1 to 50) from PBW 452 DB and the remaining 400 utterances were used for adaptation and evaluation, respectively. With the increased amount of adaptation data, proposed dimensional eigenvoice method showed higher performance than both conventional eigenvoice method and MLLR. Up to 26% of word error rate was reduced by the adaptation mode selection between eigenvoice and dimensional eigenvoice methods in comparison with conventional eigenvoice method.
Keywords
Eigenvoice; MLLR; Speaker adaptation; Fast speaker adaptation; Eigenvoice; MLLR; Speech recognition;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models /
[ C.J.Leggetter;P.C.Woodland ] / Computer Speech and Language   ScienceOn
2 Very fast adaptation with a compact context-dependent eigenvoice model /
[ R.Kuhn;F.Paronninp;P.Nguyen;J.C.Junqua;L.Rigazio ] / Proc. ICASSP
3 Implementation of the PCW(Phonetically Optimized Words) algorithm for speech database /
[ Yeonja Lim;Youngjik Lee ] / Proc. ICASSP'95
4 Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition /
[ H.Botterweck ] / Proc. ICASSP
5 연속음성인식을 위한 음성 단위 발음사전 구성방법 연구 /
[ 유재원 ] / 위탁과제 최종연구보고서
6 A study on speaker adaptation of the parameters of continuous density hidden Markov models /
[ C.H.Lee;C.H.Lin;B.H.Juang ] / IEEE Trans. Signal Processing   ScienceOn
7 Eigenvoices for speaker adaptation /
[ R.Kuhn;P.Nguyen;J.C.Jungua;L.Goldwasser;N.Niedzielski;S.Finche;K.Field;M.Contolini ] / Proc. ICSLP
8 공동이용을 위한 단어음성 DB의 구축 및 PBS 설계에 관한 검토 /
[ 김봉완;김종진;김선태;김태환;김영일;이용주 ] / 제13회 음성통신 신호처리 워크샵 논문집
9 Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices /
[ H.Botterweck ] / Proc. ICSLP