POSTTS : Corpus Based Korean TTS based on Natural Language Analysis

POSTTS : 자연어 분석을 통한 코퍼스 기반 한국어 TTS

  • 하주홍 (포항공과대학교 컴퓨터공학과) ;
  • 정옥 (포항공과대학교 컴퓨터공학과) ;
  • 김병창 (위덕대학교 컴퓨터 멀티미디어공학부) ;
  • 이근배 (포항공과대학교 컴퓨터공학과)
  • Published : 2003.05.01

Abstract

In order to produce high quality synthesized speech, it is very important to get an accurate grapheme-to-phoneme conversion and prosody model from texts using natural language processing. Robust preprocessing for non-Korean characters should also be required. In this paper, we analyzed Korean texts using a morphological analyzer, part-of-speech tagger and syntactic chunker. We present a new grapheme-to-phoneme conversion method, i.e. a dictionary-based and rule-based hybrid method, for unlimited vocabulary Korean TTS. We constructed a prosody model using a probabilistic method and decision tree-based method.

Keywords