Construction of Korean Speech DB for Common Use and Implementation of Workbench for Spoken Language Data Acquisition

공동이용을 위한 음성DB의 구축 및 음성 자료 수집을 위한 Workbench의 구현

  • Published : 1998.12.01

Abstract

This study discusses Korean speech database that has been designed and constructed for common use, especially focusing on designing a list of words or sentences that covers various phonological environments. As the results, PBW(Phonetically Balanced words) and PBS(Phonetically Balanced Sentences) was selected from balanced text corpus using maximum entropy method. And, implemented workbench for spoken language data acquisition is presented in this paper. The workbench consists of grapheme to phoneme converter, utterance list selection module, speech data editing module, multi-layer labelling module, and phoneme context search module.

Keywords