Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech

Hong, Mun-Ki;Shin, Ji-Young;Kang, Sun-Mee;

음성과학 (Speech Sciences)

제10권2호
/
Pages.7-25
/
2003
/
1226-5276(pISSN)

한국음성학회 (Korean Society of Speech Sciences)

한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리

Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech

홍문기 (서경대학교 컴퓨터과학과) ;
신지영 (고려대학교 국어국문학과) ;
강선미 (서경대학교 컴퓨터과학과)

발행 : 2003.06.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.

음성과학 (Speech Sciences)

한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리

Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)