• Title/Summary/Keyword: Prosodic perception

Search Result 37, Processing Time 0.018 seconds

Acoustic Realization of Metrical Structure in Orally Produced Korean Modern Poetry (한국 현대시 운율의 음향 발현)

  • Kim, Hyun-Gi;Hong, Ki-Hwan;Kim, Sun-Sook
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.181-192
    • /
    • 2004
  • The metrical structures in orally produced the poetry were generally analyzed by accent, metre and syllable. The purpose of this study is to investigate of metrical structures of Korean modem poetry using computer implemented speech analysis system. Two famous poet's poems confidential talk, Miloe and 'A buddhist dance, Sungmu' were selected for prosodic analysis. The informant is 60 years old professor in major of Korean and French poetry. The syllable structures of poems were analyzed primarily by vowel timbers, which can classified compact and diffuse vowels according to the distance of F2-F1. The perception cues of consonants were analyzed by VOT and tensity features of articulation. Rhythm is classified by dactyl, anapest, trochee, spondee and iambic. As a result, syllable structures of Korean modem poetry were mainly CV and CVC and the reading times of each lines were 3-4sec for 12 and 15 syllables. Main metre of Korean modem poems constructed the Imbic and Anapest. The break of each lines were demarcated by grammatical structure or meaning rather than phonetic structures.

  • PDF

Production and Perception from Perspective of Focus

  • Noh, Bo-Kyung
    • Language and Information
    • /
    • v.6 no.1
    • /
    • pp.105-121
    • /
    • 2002
  • This paper investigates the effect of semantic argument structure on the comprehension and production of sentences by observing the prosodic realizations of English secondary predications. Specifically, the goal of this study is to show how the theory of predication, argument structure, and focus semantically interact to account for similarities and differences between English resultative and depictive predications. To address this issue, production and comprehension tests were performed. In the fried focus domain (verb phrase), subjects were asked to utter and to comprehend ambiguous sentences in the context monologues. The experimental results were generally consistent with general linguistic analyses: In the resultative constructions, secondary subject NPs tend to be accented, as in other argument-head constructions, while in the depictive constructions, secondary predicates tend to have accents, as in other adjunct-head constructions.

  • PDF

Confusion in the Perception of English Anterior Coronal Consonants by Korean EFL Students (한국 EFL 학생들의 영어 전방 설정 자음 혼동)

  • Cho, Mi-Hui
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.5
    • /
    • pp.460-466
    • /
    • 2010
  • It is well-known that Korean EFL learners have difficulties in producing English fricatives which are not in the inventory of Korean and consequently tend to replace English fricatives with stops. The purpose of this paper is to investigate whether Korean students also have difficulties perceiving English anterior coronal consonants including fricatives. To this end, forty Korean college students participated in an identification test which consisted of 24 nonce words with English anterior coronal consonants in 4 different prosodic locations (CV, VC, VCVV,VVCV). It was shown that the mean accuracy rates were higher in strong position like CV and VCVV than in weak position like VC and VVCV, providing confusion matrices for each target consonant. It was also found that Korean participants had a great difficulty identifying English[$\theta$] and [$\eth$], which are novel in Korean. Importantly, the confusion patterns found in the perception test tended not to be identical with those found in the previous production studies in that both stops and fricatives were misperceived as fricatives while fricatives were misproduced as stops. Further, perceptual devoicing and intervocalic voicing were attested inVC and intervocalic position, respectively. Based on the findings of this study, pedagogical implications were drawn.

Duration of the Japanese 'sokuon' and 'haneruon' in Korean and Japanese speakers' production (일본어의 촉음과 발음의 지속시간 연구 - 한국인과 일본인을 중심으로 -)

  • Lee Jae Kang
    • MALSORI
    • /
    • no.38
    • /
    • pp.99-112
    • /
    • 1999
  • The aim of this paper is to measure the duration of Japanese 'sokuon' [t/k] and 'haneruon' [m/n] produced by Korean and Japanese native speakers. It was shown that in the case of Korean speakers, the duration of geminate of 'sokuon' was 1.5 times longer than that of a single consonant, whereas in the case of Japanese speakers, it was 2 times longer. The difference between Korean and Japanese prosodic structures appears to affect the perception and acquisition of a foreign rhythmic patternm non-existent in the speaker's native tongue. The duration of geminate of [s] was 2 times as long as a single consonant in both Korean and Japanese speakers' production. On the average, the duration of Japanese 'sokuon' [t/k/s] was 1.7 times longer than that of a single consonant in Korean speakers' pronunciation, whereas 2 times longer in Japanese speakers' pronunciation. The production of 'haneruon' by either Korean or Japanese speakers yielded a similar result to 'sokoun': 1) geminates lasted longer than a single consonant; 2) single [m] is longer than single [n]: 3) geminate of [n] is 3 times as long as single [n], whereas geminate of [m] is 2 times as long as single [m].

  • PDF

A 3D Audio-Visual Animated Agent for Expressive Conversational Question Answering

  • Martin, J.C.;Jacquemin, C.;Pointal, L.;Katz, B.
    • 한국정보컨버전스학회:학술대회논문집
    • /
    • 2008.06a
    • /
    • pp.53-56
    • /
    • 2008
  • This paper reports on the ACQA(Animated agent for Conversational Question Answering) project conducted at LIMSI. The aim is to design an expressive animated conversational agent(ACA) for conducting research along two main lines: 1/ perceptual experiments(eg perception of expressivity and 3D movements in both audio and visual channels): 2/ design of human-computer interfaces requiring head models at different resolutions and the integration of the talking head in virtual scenes. The target application of this expressive ACA is a real-time question and answer speech based system developed at LIMSI(RITEL). The architecture of the system is based on distributed modules exchanging messages through a network protocol. The main components of the system are: RITEL a question and answer system searching raw text, which is able to produce a text(the answer) and attitudinal information; this attitudinal information is then processed for delivering expressive tags; the text is converted into phoneme, viseme, and prosodic descriptions. Audio speech is generated by the LIMSI selection-concatenation text-to-speech engine. Visual speech is using MPEG4 keypoint-based animation, and is rendered in real-time by Virtual Choreographer (VirChor), a GPU-based 3D engine. Finally, visual and audio speech is played in a 3D audio and visual scene. The project also puts a lot of effort for realistic visual and audio 3D rendering. A new model of phoneme-dependant human radiation patterns is included in the speech synthesis system, so that the ACA can move in the virtual scene with realistic 3D visual and audio rendering.

  • PDF

The Relationship Between Perception of Prosody, Pitch Discrimination, and Melodic Contour Identification in Cochlear Implants Recipients (인공와우이식 난청인의 말소리 운율변화에 따른 구어 이해와 음도 변별, 선율윤곽 확인 간 관련성)

  • Kim, Eun Yeon;Moon, Il Joon;Cho, Yang-sun;Chung, Won-ho;Hong, Sung Hwa
    • Journal of Music and Human Behavior
    • /
    • v.14 no.2
    • /
    • pp.1-18
    • /
    • 2017
  • The relationships between the ability to understand changes in meaning depending on the prosody of spoken words and the ability to perceive pitch and melodic contour in cochlear implants (CI) recipients were examined. Fifteen postlingual CI recipients were measured in terms of speech prosody perception, speech perception, pitch discrimination (PD), and melody contour identification (MCI). The speech prosody perception test consists of words with positive (PW) and neutral meaning (NW). Participants were asked to identify the meaning of words depending on the conditions of positive and negative prosody. The MCI consists of subtests 1 and 2 with different chance levels to choose. Then, the relationships between speech prosody perception, speech perception, PD, and MCI performance were analyzed. There was a significant difference in identifying the meaning of words expressed in a different prosody between the PW and NW conditions. Speech prosody perception showed a significant correlation with MCI 1 while there was no significant relationship with speech perception. Although speech perception may be possible after CI, limited spoken word comprehension due to decreased sensitivity for prosodic changes may persist in CI recipients. In addition, there was a limitation in perception of melodic contour change compared to pitch discrimination, which is related to speech prosody perception.

A perceptual study on the correlation between the meaning of Korean polysemic ending and its boundary tone (동형다의 종결어미의 의미와 경계성조의 상관성에 대한 지각연구)

  • Youngsook Yune
    • Phonetics and Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.1-10
    • /
    • 2022
  • The Korean polysemic ending '-(eu)lgeol' can has two different meanings, 'guess' and 'regret'. These are expressed by different boundary-tone types: a rising tone for guess, a falling one for regret. Therefore the sentence-final boundary-tone type is the most salient prosodic feature. However, besides tone type, the pitch difference between the final and penultimate syllables of '-(eu)lgeol' can also affect semantic discrimination. To investigate this aspect, we conducted a perception test using two sentences that were morphologically and syntactically identical. These two sentences were spoken using different boundary-tone types by a Korean native speaker. From these two sentences, the experimental stimuli were generated by artificially raising or lowering the pitch of the boundary syllable by 1Qt while fixing the pitch of the penultimate syllable and boundary-tone type. Thirty Korean native speakers participated in three levels of perceptual test, in which they were asked to mark whether the experimental sentences they listened to were perceived as guess or regret. The results revealed that regardless of boundary-tone types, the larger the pitch difference between the final and penultimate syllable in the positive direction, the more likely it is perceived as guess, and the smaller the pitch difference in the negative direction, the more likely it is perceived as regret.