Design and Implementation of a Text-to Speech System using the Prosody and Duration Information

Yang, Jin-Seok;Kim, Jae-Beom;Lee, Jeong-Hyeon;

The Transactions of the Korea Information Processing Society (한국정보처리학회논문지)

Volume 3 Issue 5
/
Pages.1121-1129
/
1996
/
1226-9190(pISSN)

Korea Information Processing Society (한국정보처리학회)

Design and Implementation of a Text-to Speech System using the Prosody and Duration Information

운율 및 길이 정보를 이용한 무제한 음성 합성기의 설계 및 구현

Yang, Jin-Seok ;
Kim, Jae-Beom ;
Lee, Jeong-Hyeon (Dept.of Computer Science Engineering, Inha University)

양진석 (인하대학교 전자계산공학과) ;
김재범 (인하대학교 전자계산공학과) ;
이정현 (인하대학교 전자계산공학과)

Published : 1996.09.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

To produce more natural speech in a Text-to-Speech system, the processing of the prosody and duration must be processing in advance, and then extracted the prosody and duration information by means of trial-and-error experiments. In this paper, a method is proposed to improve the naturalness in a Text-to Speech system using this information. As the results, the Text-to-Speech system proposed and implemented in this paper showed more natural speech synthesis than the systems, which do not use this information, did.

Text-to-Speech 시스템에서 자연스럽게 음성을 합성하기 위해서는 운율과 길이 에 대한 처리가 선행되어야 한다. 이를 위해서, 자연어 처리에 의해 분석된 문장들에 대해 억양 규칙을 적용한 후, 반복적인 실험을 통해 운율 및 길이 정보를 추출하였다. 본 논문에서는 이러한 정보를 이용하여 Text-to-Speech 시스템에서 자연성을 향상 시 킬 수 있는 방법을 제안한다. 실험 결과, 본 논문에서 제안하고 구현한 무제한 Text- to-Speech 시스템이 이러한 정보들을 사용하지 않는 시스템과 비교해서 더 자연스럽게 문장들을 합성해 낸다는 것을 보였다.

The Transactions of the Korea Information Processing Society (한국정보처리학회논문지)

Design and Implementation of a Text-to Speech System using the Prosody and Duration Information

운율 및 길이 정보를 이용한 무제한 음성 합성기의 설계 및 구현

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)