• Title/Summary/Keyword: English prosody

Search Result 64, Processing Time 0.022 seconds

'Hanmal' Korean Language Diphone Database for Speech Synthesis

  • Chung, Hyun-Song
    • Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.55-63
    • /
    • 2005
  • This paper introduces a 'Hanmal' Korean language diphone database for speech synthesis, which has been publicly available since 1999 in the MBROLA web site and never been properly published in a journal. The diphone database is compatible with the MBROLA programme of high-quality multilingual speech synthesis systems. The usefulness of the diphone database is introduced in the paper. The paper also describes the phonetic and phonological structure of the database, showing the process of creating a text corpus. A machine-readable Korean SAMPA convention for the control data input to the MBROLA application is also suggested. Diphone concatenation and prosody manipulation are performed using the MBR-PSOLA algorithm. A set of segment duration models can be applied to the diphone synthesis of Korean.

  • PDF

Topicality and Focality of Contrastive Topic (대조주제의 주제성과 초점성)

  • Wee, Hae-Kyung
    • Language and Information
    • /
    • v.14 no.2
    • /
    • pp.47-70
    • /
    • 2010
  • This study investigates the semantic and prosodic properties of the so-called contrastive topic. We posit two informational primitives, namely, topical feature [+-T] and focal feature [+-F], from which four different informational categories, i.e., [+T, +F], [+T, -F], [-T, +F], and [-T, -F], are yielded. It is proposed that the informational category of contrastive topic has focal property [+F] as well as topical property [+T]. Based on the semantic approach that regards the function of [+F] as identificational predication and that of [+T] as forming a semantic conditional clause, it is shown that the semantic function of contrastive topic, which is specified as [+T, +F], is the combination of these two functions, i.e., identificational predication in a semantic conditional clause. This is supported by a scrutinized exploration of the prosodic pattern of English contrastive topic.

  • PDF

SWAPPING NATIVE AND NON-NATIVE SPEAKERS' PROSODY USING THE PSOLA ALGORITHM

  • Yoon Kyu-Chul
    • Proceedings of the KSPS conference
    • /
    • 2006.05a
    • /
    • pp.77-81
    • /
    • 2006
  • This paper presents a technique of imposing the prosodic features of a native speaker's utterance onto the same sentence uttered by a non-native speaker. Three acoustic aspects of the prosodic features were considered: the fundamental frequency (F0) contour, segmental durations, and the intensity contour. The fundamental frequency contour and the segmental durations of the native speaker's utterance were imposed on the non-native speaker's utterance by using the PSOLA (pitch-synchronous overlap and add) algorithm [1] implemented in Praat[2]. The intensity contour transfer was also done in Praat. The technique of transferring one or more of these prosodic features was elaborated and its implications in the area of language education were discussed.

  • PDF

Prosodic Disambiguation of Low versus High Syntactic Attachment across Lexical Biases in English

  • Jeon, Yoon-Shil;Yoon, Kyu-Chul
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.55-65
    • /
    • 2012
  • In this study, the prosodic disambiguation of the syntactic attachment differences was investigated in relation to the effect of lexical bias. Speech materials were composed of N1-conj-N2-PP phrases such as "walkers and runners with dogs." The results show that the use of durational pattern is dominant over the pitch pattern to differentiate the attachment differences. The characteristic pitch contour was the rise and fall over N1 and N2 in the high attachment. The pitch contour in the low attachment was the rise and fall over N2 and N3 although the frequency of such patterns was lower for the low attachment case. For the durational pattern, the lengthening in the N2 region plays a significant role in the disambiguation of the syntactic attachments. The interaction between the lexical bias and the syntactic attachment was not statistically significant in the duration data.

An Acoustic Study of the Stress and Intonational System in Lakhota: A Preliminary Report

  • Cho, Tae-Hong
    • Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.23-42
    • /
    • 2006
  • This paper reports a preliminary result of an acoustic study on the stress and intonational system in Lakhota, a native American language. It investigates how the stress and intonation in Lakhota are phonetically manifested; and how the stress interacts with other prosodic factors. The results preliminarily obtained from one native Lakhota speaker suggest that the primary cue of the stress is relatively high F0 which is often accompanied by higher intensity (for the vowel) and longer VOT (for aspirated stops). The results also indicate that stress is not reliably marked by duration. The stress system, however, interacts with the intonational pattern, such that, for example, intonational peak falls on the stressed syllable with a general pattern of L+H* and that it interacts with the boundary tone L%, resulting in mid tone utterance-finally. This paper can be viewed largely as a qualitative study on an understudied native American language, Lakhota and as forming a basis for further development of its stress and intonation system whose acoustic properties of its prosodic system have not been investigated before.

  • PDF

A cross-modal naming study: Effects of prosodic boundaries on the comprehension of relative clauses in Japanese

  • Kang, Soyoung;Kashiwagi, Akiko;Nakayama, Mineharu;Speer, Shari R.
    • Cross-Cultural Studies
    • /
    • v.24
    • /
    • pp.157-169
    • /
    • 2011
  • Compared to studies on prosodic effects on the comprehension of syntactic ambiguity in English, there are relatively few that investigated prosodic effects in East-Asian languages. This study examined the role of prosodic information in processing syntactically ambiguous sentences in Japanese. For syntactically ambiguous sentences containing relative clauses, this paper investigated whether prosodic information is immediately available during the process of these ambiguous sentences. Results from an auditory comprehension experiment with an on-line, cross-modal naming task seemingly suggest that contrary to the findings from the off-line study that examined the same constructions, prosodic information may not be immediately available to Japanese listeners. A possible account for failure to obtain effects of prosodic information is provided.

Prosodic Boundary Effects on the V-to-V Lingual Movement in Korean

  • Cho, Tae-Hong;Yoon, Yeo-Min;Kim, Sa-Hyang
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.101-113
    • /
    • 2010
  • The present study investigated how the kinematics of the /a/-to-/i/ tongue movement in Korean would be influenced by prosodic boundary. The /a/-to-/i/ sequence was used as 'transboundary' test materials which occurred across a prosodic boundary as in /ilnjəʃ$^h$a/ # / minsakwae/ ('일년차#민사과에' 'the first year worker' # 'dept. of civil affairs'). It also tested whether the V-to-V tongue movement would be further influenced by its syllable structure with /m/ which was placed either in the coda condition (/am#i/) or in the onset condition (/a#mi). Results of an EMA (Electromagnetic Articulagraphy) study showed that kinematical parameters such as the movement distance (displacement), the movement duration, and the movement velocity (speed) all varied as a function of the boundary strength, showing an articulatory strengthening pattern of a "larger, longer and faster" movement. Interestingly, however, the larger, longer and faster pattern associated with boundary marking in Korean has often been observed with stress (prominence) marking in English. It was proposed that language-specific prosodic systems induce different ways in which phonetics and prosody interact: Korean, as a language without lexical stress and pitch accent, has more degree of freedom to express prosodic strengthening, while languages such as English have constraints, so that some strengthening patterns are reserved for lexical stress. The V-to-V tongue movement was also found to be influenced by the intervening consonant /m/'s syllable affiliation, showing a more preboundary lengthening of the tongue movement when /m/ was part of the preboundary syllable (/am#i/). The results, together, show that the fine-grained phonetic details do not simply arise as low-level physical phenomena, but reflect higher-level linguistic structures, such as syllable and prosodic structures. It was also discussed how the boundary-induced kinematic patterns could be accounted for in terms of the task dynamic model and the theory of the prosodic gesture ($\pi$-gesture).

  • PDF

Shapes of Vowel F0 Contours Influenced by Preceding Obstruents of Different Types - Automatic Analyses Using Tilt Parameters-

  • Jang, Tae-Yeoub
    • Speech Sciences
    • /
    • v.11 no.1
    • /
    • pp.105-116
    • /
    • 2004
  • The fundamental frequency of a vowel is known to be affected by the identity of the preceding consonant. The general agreement is that strong consonants trigger higher F0 than weak consonants. However, there has been a disagreement on the shape of this segmentally affected F0 contours. Some studies report that shapes of contours are differentiated based on the consonant type, but others regard this observation as misleading. This research attempts to resolve this controversy by investigating shapes and slopes of F0 contours of Korean word level speech data produced by four male speakers. Instead of entirely relying on traditional human intuition and judgment, I employed an automatic F0 contour analysis technique known as tilt parameterisation (Taylor 2000). After necessary manipulation of an F0 contour of each data token, various parameters are collapsed into a single tilt value which directly indicates the shape of the contour. The result, in terms of statistical inference, shows that it is not viable to conclude that the type of consonant is significantly related to the shape of F0 contour. A supplementary measurement is also made to see if the slope of each contour bears meaningful information. Unlike shapes themselves, slopes are suspected to be practically more practical for consonantal differentiation, although confirmation is required through further refined experiments.

  • PDF

The Role of Post-lexical Intonational Patterns in Korean Word Segmentation

  • Kim, Sa-Hyang
    • Speech Sciences
    • /
    • v.14 no.1
    • /
    • pp.37-62
    • /
    • 2007
  • The current study examines the role of post-lexical tonal patterns of a prosodic phrase in word segmentation. In a word spotting experiment, native Korean listeners were asked to spot a disyllabic or trisyllabic word from twelve syllable speech stream that was composed of three Accentual Phrases (AP). Words occurred with various post-lexical intonation patterns. The results showed that listeners spotted more words in phrase-initial than in phrase-medial position, suggesting that the AP-final H tone from the preceding AP helped listeners to segment the phrase-initial word in the target AP. Results also showed that listeners' error rates were significantly lower when words occurred with initial rising tonal pattern, which is the most frequent intonational pattern imposed upon multisyllabic words in Korean, than with non-rising patterns. This result was observed both in AP-initial and in AP-medial positions, regardless of the frequency and legality of overall AP tonal patterns. Tonal cues other than initial rising tone did not positively influence the error rate. These results not only indicate that rising tone in AP-initial and AP_final position is a reliable cue for word boundary detection for Korean listeners, but further suggest that phrasal intonation contours serve as a possible word boundary cue in languages without lexical prominence.

  • PDF

The Role of H Tone of an AP in Korean: The Relation Between Prosody and Morphology

  • Kang, Hyun-Sook
    • Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.7-23
    • /
    • 2008
  • This paper investigates tonal patterns of the prosodic constituents of an AP and a PWD in Korean and their relation with the morphological/syntactic structure. Specifically, this paper asks the following questions: First, if there are more than one PWD in an AP, how is each PWD specified in terms of tones? Secondly, in case that there is only one PWD in an AP that consists of several morphemes, is there any preference of the association between tones and the morphemes that constitute that PWD? Thirdly, if an AP dominates a PWD and if a PWD contains at least one morpheme of the lexical category, it follows that an AP should contain at least one morpheme of the lexical category. Can this be verified with the experimental data? In order to answer these questions, Experiment I and II were conducted with the target material consisting of a stem and suffixes that varied in length. The results of this preliminary test show that as the number of syllables in the target material increases, the more number of an AP tonal pattern occurs in it and as a result, in some cases, an AP consisting of suffixes only may occur.

  • PDF