[KSCI] Korea Science Citation Index Service

Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases

Lee, Ki-Young (Dept. of Information Communication Engineering, Kwandong University)

Publication Information

The Journal of the Acoustical Society of Korea / v.23, no.1E, 2004 , pp. 10-15 More about this Journal

Abstract

In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker's individuality and meaning of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two 1 evels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization [7] in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.

Keywords

Speech conversion; Pitch contour; Goussian normalization; Intonation phrases; Accentual phrases;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	H. Kuwabara, Y. Sagisaka, 'Acoustic Characteristics of Speaker Individuality: Control and Conversion', Speech Communication, 16, pp.165-173, 1995 DOI ScienceOn
2	L. M. Arslan, D. Talkin,'Speaker Transformation using Sentence HMM based Alignments and Detailed Prosody Modification', Proc, ICASSP'98, 1, pp. 289-292, 1998
3	M. Nespor, I. Vogel, Prosodic Phonology, Dordrecht : Faris Publication
4	Jun, Sun-Ah, The Phonetics and Phonology of Korean Prosody, Ph. D. Dissertation, The Ohio State University, 1993
5	A. Kain, M.w. Macon,'Spectral Voice Conversion for Text-To-Speech Synthesis', Proc, ICASSP'98, 1, pp. 285-288, 1998
6	K. Y. Lee, M. S. Song, 'Automatic Detection of Korean Accentual Phrase Boundaries', The Journal of Acoustic Society of Korea, 18(1E), pp.27-31, 1999
7	E. Moulines, F. Charpentier, 'Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones', Speech Communication 9(5,6) pp,453-467, 1990 DOI
8	M. Akagi, T. lenaga,'Speaker Individualities in Fundamental Frequency Contours and Its Control', Proc. EuroSpeech'95, pp. 439-442, Sep, 1995
9	J. P. H. van Santen, 'Prosodic Modeling in Text-to-Speech Synthesis', Proc. EuroSpeech'97, KN 19-KN 28, 1997
10	Y. J. Kim, H. J. Byeon, Y. H. Oh, 'Prosodic Phrasing in Korean; Determine Governor, and then Split or Not', Proc. EuroSpeech'99, pp.539-542, 1999
11	D. T. Chappel, J. H. L. Hansen,'Speaker Specific Pitch Contour Modeling and Modification', Proc, ICASSP'98, 1, pp. 885- 888, 1998

KSCI

Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases 한국어 운율구 기반의 피치궤적 변환의 통계적 접근

Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases