Voice conversion using low dimensional vector mapping

낮은 차원의 벡터 변환을 통한 음성 변환

  • Lee, Kee-Seung (Center for Signal Processing Research Yonsei Univ.) ;
  • Doh, Won (Center for Signal Processing Research Yonsei Univ.) ;
  • Youn, Dae-Hee (Center for Signal Processing Research Yonsei Univ.)
  • 이기승 (연세대학교 신호처리연구센터) ;
  • 도원 (연세대학교 신호처리연구센터) ;
  • 윤대희 (연세대학교 신호처리연구센터)
  • Published : 1998.04.01

Abstract

In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.

Keywords