Browse > Article

Statistical Approaches to Convert Pitch Contour Based on Korean Prosodic Phrases  

Lee, Ki-Young (Dept. of Information Communication Engineering, Kwandong University)
Abstract
In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker's individuality and meaning of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two 1 evels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization [7] in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.
Keywords
Speech conversion; Pitch contour; Goussian normalization; Intonation phrases; Accentual phrases;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 H. Kuwabara, Y. Sagisaka, 'Acoustic Characteristics of Speaker Individuality: Control and Conversion', Speech Communication, 16, pp.165-173, 1995   DOI   ScienceOn
2 L. M. Arslan, D. Talkin,'Speaker Transformation using Sentence HMM based Alignments and Detailed Prosody Modification', Proc, ICASSP'98, 1, pp. 289-292, 1998
3 M. Nespor, I. Vogel, Prosodic Phonology, Dordrecht : Faris Publication
4 Jun, Sun-Ah, The Phonetics and Phonology of Korean Prosody, Ph. D. Dissertation, The Ohio State University, 1993
5 A. Kain, M.w. Macon,'Spectral Voice Conversion for Text-To-Speech Synthesis', Proc, ICASSP'98, 1, pp. 285-288, 1998
6 K. Y. Lee, M. S. Song, 'Automatic Detection of Korean Accentual Phrase Boundaries', The Journal of Acoustic Society of Korea, 18(1E), pp.27-31, 1999
7 E. Moulines, F. Charpentier, 'Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones', Speech Communication 9(5,6) pp,453-467, 1990   DOI
8 M. Akagi, T. lenaga,'Speaker Individualities in Fundamental Frequency Contours and Its Control', Proc. EuroSpeech'95, pp. 439-442, Sep, 1995
9 J. P. H. van Santen, 'Prosodic Modeling in Text-to-Speech Synthesis', Proc. EuroSpeech'97, KN 19-KN 28, 1997
10 Y. J. Kim, H. J. Byeon, Y. H. Oh, 'Prosodic Phrasing in Korean; Determine Governor, and then Split or Not', Proc. EuroSpeech'99, pp.539-542, 1999
11 D. T. Chappel, J. H. L. Hansen,'Speaker Specific Pitch Contour Modeling and Modification', Proc, ICASSP'98, 1, pp. 885- 888, 1998