음성과학 (Speech Sciences)
- 제8권4호
- /
- Pages.35-43
- /
- 2001
- /
- 1226-5276(pISSN)
혼합 위상 정보를 이용한 TTS 합성음 생성 알고리즘
Speech Synthesis Algorithm Using Mixed Phase Information for TTS Systems
초록
New speech synthesis algorithms capable of flexible prosody (especially F0) modification are desired for a high quality TTS system. TD-PSOLA is the most popular synthesis algorithm. The algorithm shows very high quality when F0 modification is limited. However, the quality degradation due to pitch epoch detection error becomes severe as the F0 modification factor becomes large. On the other hand, the vocoder framework is very flexible in F0 manipulation. The synthesized speech quality from the vocoder is far from natural human speech and suffers from buzziness. To remedy the buzzy quality from the vocoder and make more natural synthetic speech, we propose a mixed phase vocoder.