Automatic Synthesis Method Using Prosody-Rich Database

;

Proceedings of the Acoustical Society of Korea Conference (한국음향학회:학술대회논문집)

1998.08a
/
Pages.87-92
/
1998

The Acoustical Society of Korea (한국음향학회)

Automatic Synthesis Method Using Prosody-Rich Database

대용량 운율 음성데이타를 이용한 자동합성방식

김상훈 (한국전자통신연구원)

Published : 1998.08.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In general, the synthesis unit database was constructed by recording isolated word. In that case, each boundary of word has typical prosodic pattern like a falling intonation or preboundary lengthening. To get natural synthetic speech using these kinds of database, we must artificially distort original speech. However, that artificial process rather resulted in unnatural, unintelligible synthetic speech due to the excessive prosodic modification on speech signal. To overcome these problems, we gathered thousands of sentences for synthesis database. To make a phone level synthesis unit, we trained speech recognizer with the recorded speech, and then segmented phone boundaries automatically. In addition, we used laryngo graph for the epoch detection. From the automatically generated synthesis database, we chose the best phone and directly concatenated it without any prosody processing. To select the best phone among multiple phone candidates, we used prosodic information such as break strength of word boundaries, phonetic contexts, cepstrum, pitch, energy, and phone duration. From the pilot test, we obtained some positive results.

Proceedings of the Acoustical Society of Korea Conference (한국음향학회:학술대회논문집)

Automatic Synthesis Method Using Prosody-Rich Database

대용량 운율 음성데이타를 이용한 자동합성방식

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)