Voice Conversion Using Linear Multivariate Regression Model and LP-PSOLA Synthesis Method

;;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 20 Issue 3
/
Pages.15-23
/
2001
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

Voice Conversion Using Linear Multivariate Regression Model and LP-PSOLA Synthesis Method

선형다변회귀모델과 LP-PSOLA 합성방식을 이용한 음성변환

권홍석 (경북대학교 전자ㆍ전기공학부) ;
배건성 (경북대학교 전자ㆍ전기공학부)

Published : 2001.04.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents a voice conversion technique that modifies the utterance of a source speaker as if it were spoken by a target speaker. Feature parameter conversion methods to perform the transformation of vocal tract and prosodic characteristics between the source and target speakers are described. The transformation of vocal tract characteristics is achieved by modifying the LPC cepstral coefficients using Linear Multivariate Regression (LMR). Prosodic transformation is done by changing the average pitch period between speakers, and it is applied to the residual signal using the LP-PSOLA scheme. Experimental results show that transformed speech by LMR and LP-PSOLA synthesis method contains much characteristics of the target speaker.

본 논문에서는 임의의 사람이 발성한 음성을 마치 다른 사람이 발성한 것처럼 들리도록 하는 음성변환 기술에 대하여 설명하고, 화자간의 성도 특성과 여기신호 특성 파라미터 변환을 독립적으로 수행하기 위한 변환방법을 실험한다. 성도 특성 파라미터 변환은 입력되는 음성신호에서 LPC (Linear Predictive Cofficient)켑스트럼을 추출하여 선형다변회귀모델에 적용하여 수행하고, 여기신호 특성 파라미터 변환은 잔차신호를 추출하여 LP-PSOLA (Linear Predictive-Pitch Synchronous Overlap and Add) 합성방식을 이용한 화자간의 평균 피치주기 변환으로 수행된다. 실험결과는 선형다변회귀모델과 LP-PSOLA 합성방식을 이용하여 변환된 음성이 대상화자의 음성에 유사함을 보여준다

Keywords

References

Speech Communication v.16 no.2 Acoustic characteristics of speaker individuality: Control and conversion Hisao Kuwabara;Yoshinori Sagisaka
ICASSP'88 Voice conversion through vector quantization Masanobu Abe;Satoshi Nakamura;Kiyohiro Shikano;Hisao Kuwabara
Speech Communication v.11 no.2;3 Voice transformation using PSOLA technique H. Valbret;E. Moulines;J. P. Tubach
Applied Numerical methods Brice carnahan;H. A. Luther;James O. Wilkes
Speech coding and synthesis W. B. Kleijn;K. K. Paliwal
IEEE Trans. Commun. v.28 no.1 An algorithm for vector quantizer design Y. Linde;A. Buzo;R. M. Gray
Discrete-time signal processing Anal V. Oppenheim;Roland W. Schafer
한국음향학회 정기총회 및 학술발표회 논문집 v.16 no.2(s) 유성음의 잔류신호 변환을 이용한 음색 변환 최철민;전범기;성굉모

The Journal of the Acoustical Society of Korea (한국음향학회지)

Voice Conversion Using Linear Multivariate Regression Model and LP-PSOLA Synthesis Method

선형다변회귀모델과 LP-PSOLA 합성방식을 이용한 음성변환

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)