Modality-Based Sentence-Final Intonation Prediction for Korean Conversational-Style Text-to-Speech Systems

Oh, Seung-Shin;Kim, Sang-Hun;

ETRI Journal

Volume 28 Issue 6
/
Pages.807-810
/
2006
/
1225-6463(pISSN)
/
2233-7326(eISSN)

Electronics and Telecommunications Research Institute (한국전자통신연구원)

Modality-Based Sentence-Final Intonation Prediction for Korean Conversational-Style Text-to-Speech Systems

Oh, Seung-Shin (Embedded Software Research Division, ETRI) ;
Kim, Sang-Hun (Embedded Software Research Division, ETRI)

Received : 2006.06.21
Published : 2006.12.31

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This letter presents a prediction model for sentence-final intonations for Korean conversational-style text-to-speech systems in which we introduce the linguistic feature of 'modality' as a new parameter. Based on their function and meaning, we classify tonal forms in speech data into tone types meaningful for speech synthesis and use the result of this classification to build our prediction model using a tree structured classification algorithm. In order to show that modality is more effective for the prediction model than features such as sentence type or speech act, an experiment is performed on a test set of 970 utterances with a training set of 3,883 utterances. The results show that modality makes a higher contribution to the determination of sentence-final intonation than sentence type or speech act, and that prediction accuracy improves up to 25% when the feature of modality is introduced.

ETRI Journal

Modality-Based Sentence-Final Intonation Prediction for Korean Conversational-Style Text-to-Speech Systems

Abstract

Keywords