Control of Duration Model Parameters in HMM-based Korean Speech Synthesis

HMM 기반의 한국어 음성합성에서 지속시간 모델 파라미터 제어

  • 김일환 (경북대학교 전자전기컴퓨터학부) ;
  • 배건성 (경북대학교 전자전기컴퓨터학부)
  • Published : 2008.12.30


Nowadays an HMM-based text-to-speech system (HTS) has been very widely studied because it needs less memory and low computation complexity and is suitable for embedded systems in comparison with a corpus-based unit concatenation text-to-speech one. It also has the advantage that voice characteristics and the speaking rate of the synthetic speech can be converted easily by modifying HMM parameters appropriately. We implemented an HMM-based Korean text-to-speech system using a small size Korean speech DB and proposes a method to increase the naturalness of the synthetic speech by controlling duration model parameters in the HMM-based Korean text-to speech system. We performed a paired comparison test to verify that theses techniques are effective. The test result with the preference scores of 73.8% has shown the improvement of the naturalness of the synthetic speech through controlling the duration model parameters.