• Title/Summary/Keyword: VOT shift

Search Result 6, Processing Time 0.016 seconds

The Production of Stops by Seoul and Yanbian Korean Speakers

  • Oh, Mira;Yang, Hui
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.185-193
    • /
    • 2013
  • This study investigates dialectal differences in the acoustic properties of Korean lenis, aspirated, and tense stops Seoul Korean (standard Korean) and Yanbian Korean (spoken in the largest Korean Autonomous Prefecture in China). This production study the main acoustic cues that each dialect uses to mark the laryngeal distinction between the three types of Korean stops. Measurements included VOT, and the initial F0 of the following vowel. Data collected from 10 young Seoul Korean speakers, 10 young Yanbian Korean speakers, and 6 older Yanbian speakers. two key findings: First, aspirated and lenis stops are mainly differentiated by F0 in Seoul Korean, and by $H1^*-H2^*$ in Yanbian Korean. Second, there is no VOT merger between lenis and aspirated stops in Yanbian Korean, whereas there is in Seoul Korean. These results are discussed in terms of the phenomenon of VOT shift and the function of F0t is argued that the function of F0 to substitute for VOT difference as a primary cue for the coding of laryngeal contrast can be predicted by the pitch accent system of the language involved.

Reinterpretation of Stop Production in Korean Elderly Speakers (노년층 파열음 발음의 재해석)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.139-145
    • /
    • 2015
  • Researchers have claimed that Korean younger speakers tend to less clearly differentiate aspirated and lax stops with VOT values while older speakers clearly differentiate these two stops with VOT values. To explain this phenomena, the current study consider both an aging effect and a general sound shift. For this study, VOT values and F0 of Korean stops produced by eight male speakers(years of birth were 1942 ~ 1952) analyzed using Praat. Their productions were compared with the values of participants whose year of birth were 1943 ~ 1952) in Silva(2006)'s research. Silva's research was conducted in 2004 using the same methods. The result shows that 2014's VOT gap between aspirated and lax stops and less F0 gap between aspirated and lax stops than those of 2004. When the F0 values related to physical conditions of the larynx is considered, it could be analyzed as the following: to distinguish the three-way phonation type clearly, older speakers depend on the VOT value more instead of F0 which they have difficulty to control.

A study of L1 phonetic drift in the voice onset times of Korean learners of English with long L2 exposure

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.35-43
    • /
    • 2019
  • This study examines the voice onset times (VOTs) of Korean stops produced by Korean learners of English with high language proficiency and long L2 exposure (i.e., Korean-English bilinguals) to assess whether the VOTs of their lax and aspirated stops are merging and, if so, which types of stops are being changed. Thirteen Korean speakers (six female and seven male) who had studied in the USA for more than three to ten years participated. The results show that the speakers in this study with long L2 exposure are participating in the VOT merger, in which VOTs for aspirated stops are reduced while those for lax stops are increased. In other words, change in VOT affects not only aspirated stops but also lax stops. The results indicate that L1 phonetic drift may not be primarily affected by the amount of L2 exposure, and language contact may not be the primary factor triggering a sound change in the Korean stop system. Further study is necessary focusing on the phonetic shift of the "lax" category because it may play a pivotal role in a tonogenetic-like sound change in present-day Korean.

Speech processing strategy and executive function: Korean children's stop perception

  • Kong, Eun Jong;Yoo, Jeewon
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.57-65
    • /
    • 2017
  • The current study explored how Korean-speaking children processed the multiple acoustic cues (VOT and f0) for the stop laryngeal contrast (/t'/, /t/, and /$t^h$/) and examined whether individual perceptual strategies could be related to a general cognitive ability performing executive functions (EF). 15 children (aged from 7 to 8) participated in the speech perception task identifying the three Korean laryngeal stops (3AFC) on listening to the auditory stimuli of C-/a/ with synthetically varying VOT and f0. They completed a series of EF tasks to measure working memory, inhibition, and cognitive shifting ability. The findings showed that children used the two cues in a highly correlated manner. While children utilized VOT consistently for the three laryngeal categories, their use of f0 was either reduced or enhanced depending on the phonetic categories. Importantly, the children's processing strategies of a f0 suppression for a tense-aspirated contrast were meaningfully associated with children's better cognitive abilities such as working memory, inhibition, and attentional shifting. As a preliminary experimental investigation, the current research demonstrated that listeners with inefficient processing strategies were poor at the EF skills, suggesting that cognitive skills might be responsible for developmental variations of processing sub-phonemic information for the linguistic contrast.

Visual Object Tracking Fusing CNN and Color Histogram based Tracker and Depth Estimation for Automatic Immersive Audio Mixing

  • Park, Sung-Jun;Islam, Md. Mahbubul;Baek, Joong-Hwan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.3
    • /
    • pp.1121-1141
    • /
    • 2020
  • We propose a robust visual object tracking algorithm fusing a convolutional neural network tracker trained offline from a large number of video repositories and a color histogram based tracker to track objects for mixing immersive audio. Our algorithm addresses the problem of occlusion and large movements of the CNN based GOTURN generic object tracker. The key idea is the offline training of a binary classifier with the color histogram similarity values estimated via both trackers used in this method to opt appropriate tracker for target tracking and update both trackers with the predicted bounding box position of the target to continue tracking. Furthermore, a histogram similarity constraint is applied before updating the trackers to maximize the tracking accuracy. Finally, we compute the depth(z) of the target object by one of the prominent unsupervised monocular depth estimation algorithms to ensure the necessary 3D position of the tracked object to mix the immersive audio into that object. Our proposed algorithm demonstrates about 2% improved accuracy over the outperforming GOTURN algorithm in the existing VOT2014 tracking benchmark. Additionally, our tracker also works well to track multiple objects utilizing the concept of single object tracker but no demonstrations on any MOT benchmark.

Voice quality distinctions of the three-way stop contrast under prosodic strengthening in Korean

  • Jiyoung Jang;Sahyang Kim;Taehong Cho
    • Phonetics and Speech Sciences
    • /
    • v.16 no.1
    • /
    • pp.17-24
    • /
    • 2024
  • The Korean three-way stop contrast (lenis, aspirated, fortis) is currently undergoing a sound change, such that the primary cue distinguishing lenis and aspirated stops is shifting from voice onset time (VOT) to F0. Despite recent discussions of this shift, research on voice quality, traditionally considered an additional cue signaling the contrast, remains sparse. This study investigated the extent to which the associated voice quality [as reflected in the acoustic measurements of H1*-H2*, H1*- A1*, and cepstral peak prominence (CPP)] contributes to the three-way stop contrast, and how the realization is conditioned by prominence- vs. boundary-induced prosodic strengthening amid the ongoing sound change. Results for 12 native Korean speakers indicate that there was a substantial distinction in voice quality among the three stop categories with the breathiness of the vowel being the greatest after the lenis, intermediate after the aspirated, and least after the fortis stops, indicating the role of voice quality in the maintenance of the three-way stop contrast. Furthermore, prosodic strengthening has different effects on the contrast and contributes to the enhancement of the phonological contrast contingent on whether it is induced by prominence or boundary.