Performance Improvement of Rapid Speaker Adaptation Using Bias Compensation and Mean of Dimensional Eigenvoice Models

;;;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 23 Issue 5
/
Pages.383-389
/
2004
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

Performance Improvement of Rapid Speaker Adaptation Using Bias Compensation and Mean of Dimensional Eigenvoice Models

바이어스 보상과 차원별 Eigenvoice 모델 평균을 이용한 고속화자적응의 성능향상

박종세 (부산대학교 전자공학과) ;
김형순 (부산대학교 전자공학) ;
송화전 (부산대학교 전자공학과)

Published : 2004.07.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper. we propose the bias compensation methods and the eigenvoice method using the mean of dimensional eigenvoice to improve the performance of rapid speaker adaptation based on eigenvoice under mismatch between training and test environment. Experimental results for vocabulary-independent word recognition task (using PBW 452 DB) show that the proposed methods yield improvements for small adaptation data. We obtained about 22∼30% relative improvement by the bias compensation methods as amount of adaptation data varied from 1 to 50, and obtained 41% relative improvement in error rate by the eigenvoice method using the mean of dimensional eigenvoice with only single adaptation word.

본 논문에서는 훈련 및 인식 환경이 다른 상황에서 eigenvoice 기반 고속화자적응의 성능향상을 위하여 바이어스 보상을 적용한 eigenvoice 적응방식과 차원별 eigenvoice 모델 평균 가중합 방식을 제안하였다. PBW 452 DB를 사용한 어휘독립 단어인식 실험 결과에서 적은 양의 적응데이터를 사용했을 때 제안된 방식이 기존의 eigenvoice 방식에 비하여 많은 성능향상을 얻을 수 있었다. 적응단어 수를 1개에서 50개로 변경시키면서 바이어스 보상을 적용한 eigenvoice 적응방식을 사용한 경우 기존 eigenvoice 방식보다 단어 오인식률이 약 22∼30% 감소하였다. 또한 차원별 eigenvoice 모델 평균을 이용한 eigenvoice 적응방식에서는 1개의 단어를 적응데이터로 사용했을 경우에 기존 eigenvoice 방식보다 단어 오인식률이 최고 41%까지 감소하였다.

Keywords

References

IEEE Trans. Signal Processing v.39 no.4 A study on speaker adaptation of the parameters of continuous density hidden Markov models C. H. Lee;C. H. Lin;B. H. Juang https://doi.org/10.1109/78.80902
Computer Speech and Language v.9 no.1 Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov moedls C. J. Leggetter;P. C. Woodland https://doi.org/10.1006/csla.1995.0010
Proc. ICSLP v.5 Eigenvoices for speaker adaptation R. Kuhn;P. Nguyen;J. C. Jungua;L. Goldwasser;N. Niedzielski;S. Finche;K. Field; M. Contolini
IEEE Trans. Speech and Audio Processing v.8 no.6 Rapid speaker adaptation in eigenvoice space R. Kuhn;J. C. Jungua;P. Nguyen;N. Niedzielski https://doi.org/10.1109/89.876308
Proc. Eurospeech v.2 Segmental eigenvoice for rapid speaker adaptation Y. Tsao;S. M. Lee;F. C. Chou;L. S. Lee
한국음향학회지 v.22 no.1 차원별 Eigenvoice와 화자적응 모드 선택에 기반한 고속화자적응 성능 향상 송화전;이윤근;김형순
IEEE Trans. Speech and Audio Processing v.4 no.3 A maximum-likelihood approach to stochastic matching for robust speech recognition A. Sanker https://doi.org/10.1109/89.496215
Proc. ICASSP v.1 Implementation of the POW (Phonetically Optimized Words) algorithm for speech database Y. Lim;Y. Lee
위탁과제 최종연구보고서 연속음성인식을 위한 음성 단위 발음사전 구성방법 연구 유재원
제 12회 음성통신 및 신호처리워크샵 논문집 음성 DB용 PBW에 관한 검토 이용주;김봉완;김종진;양옥렬;임선영

The Journal of the Acoustical Society of Korea (한국음향학회지)

Performance Improvement of Rapid Speaker Adaptation Using Bias Compensation and Mean of Dimensional Eigenvoice Models

바이어스 보상과 차원별 Eigenvoice 모델 평균을 이용한 고속화자적응의 성능향상

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)