• Title/Summary/Keyword: speech cues

Search Result 117, Processing Time 0.023 seconds

Perceptual cues for /o/ and /u/ in Seoul Korean (서울말 /?/와 /?/의 지각특성)

  • Byun, Hi-Gyung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.1-14
    • /
    • 2020
  • Previous studies have confirmed that /o/ and /u/ in Seoul Korean are undergoing a merger in the F1/F2 space, especially for female speakers. As a substitute parameter for formants, it is reported that female speakers use phonation (H1-H2) differences to distinguish /o/ from /u/. This study aimed to explore whether H1-H2 values are being used as perceptual cues for /o/-/u/. A perception test was conducted with 35 college students using /o/ and /u/ spoken by 41 females, which overlap considerably in the vowel space. An acoustic analysis of 182 stimuli was also conducted to see if there is any correspondence between production and perception. The identification rate was 89% on average, 86% for /o/, and 91% for /u/. The results confirmed that when /o/ and /u/ cannot be distinguished in the F1/F2 space because they are too close, H1-H2 differences contribute significantly to the separation of the two vowels. However, in perception, this was not the case. H1-H2 values were not significantly involved in the identification process, and the formants (especially F2) were still dominant cues. The study also showed that even though H1-H2 differences are apparent in females' production, males do not use H1-H2 in their production, and both females and males do not use H1-H2 in their perception. It is presumed that H1-H2 has not yet been developed as a perceptual cue for /o/ and /u/.

Learning acoustic cue weights for Korean stops through L2 perception training (지각 훈련을 통한 한국어 폐쇄음 음향 신호 가중치의 L2 학습)

  • Oh, Eunjin
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.9-21
    • /
    • 2021
  • This study investigated whether Korean learners improve acoustic cue weights to identify Korean lenis and aspirated stops in the direction of native values through perception training that focused on contrasting the stops in various phonetic contexts. Nineteen native Chinese learners of Korean and two native Korean instructors for the perception training participated in the experiment. A training group and a non-training group were divided according to pretest results, and only the training group participated in the training for 5 days. To estimate the perceptual weights of the stop cues, a pretest and a posttest were conducted with stimuli whose stop cues (F0 and VOT) were systematically manipulated. Binary logistic regression analyses were performed on each learner's test results to calculate perceptual β coefficients, which estimate the perceptual weights of the acoustic cues used in identifying the stop contrast. The training group showed a statistically significant increase of 0.451 on average in the posttest for the coefficient values of the F0, which is the primary cue for the stop contrast, whereas the non-training group showed an insignificant increase of 0.246. The patterns of change in the F0 use after training varied considerably among individual learners.

Detecting lies through suspect's nonverbal behaviors in the investigation scene (군 수사현장에서 용의자의 비언어적 행동을 이용한 거짓말 탐지)

  • Si Up Kim;Woo Byoung Jhon;Chung Hyun Jeon
    • Korean Journal of Culture and Social Issue
    • /
    • v.12 no.2
    • /
    • pp.101-114
    • /
    • 2006
  • This study was examined the effective nonverbal behavior cues of detecting suspects' lies in the investigation scene. In order to search the suspects who drank the alcohol liquor without a permission, 18 soldiers were interviewed. 8 solders had drunken alcohol and had lied when was asked(lie group). The other 10 soldiers hadn't drunken alcohol and had told the truth(truth group). The mean frequencies of nonverbal behaviors were compared lie group with truth group. The following behaviors were measured by frequency: vocal characteristics (high pitch of voice, speech hesitations, speech error, frequency of pauses, period of pauses, latency period), facial characteristics (gaze, smile, touching face, blinking, facial micro-expression), body movement (illustrators, hand and finger movement, leg and foot movement, head movement, trunk movement, shifting position). As results, this study found that deception cues were periods and frequencies of pause, micro-expression, head movements. The lie group had less periods and frequencies of pause, and more micro-expression, head movements than truth group. But, this study didn't found Othello's error cues.

Intelligibility Enhancement of Multimedia Contents Using Spectral Shaping (스펙트럼 성형기법을 이용한 멀티미디어 콘텐츠의 명료도 향상)

  • Ji, Youna;Park, Young-cheol;Hwang, Young-su
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.11
    • /
    • pp.82-88
    • /
    • 2016
  • In this paper, we propose an intelligibility enhancement algorithm for multimedia contents using spectral shaping. The dialogue signals is essential to understand the plot of audio-visual media contents such as movie and TV. However, the non-dialogue components as like sound effects and background music often degrade the dialogue clarity. To overcome this problem, this paper tries to improves the dialogue clarity of audio soundtracks which contain important cues for the visual scenes. In the proposed method, the dialogue components are first detected by soft masker based on speech presence probability (SPP) which is widely used in speech enhancement field. Then, extracted dialogue signals are applied to the spectral shaping method. It reallocate the spectral-temporal energy of speech to enhanced the intelligibility. The total energy is maintained as unchanged via a loudness normalization process to prevent saturation. The algorithm was evaluated using the modeled and real movie soundtracks and it was shown that the proposed algorithm enhances the dialogue clarity while preserving the total audio power.

Gaps-In-Noise Test Performance in Children with Speech Sound Disorder and Cognitive Difficulty

  • Jung, Yu Kyung;Lee, Jae Hee
    • Journal of Audiology & Otology
    • /
    • v.24 no.3
    • /
    • pp.133-139
    • /
    • 2020
  • Background and Objectives: The Gaps-In-Noise (GIN) test is a clinically effective measure of the integrity of the central auditory nervous system. The GIN procedure can be applied to a pediatric population above 7 years of age. The present study conducted the GIN test to compare the abilities of auditory temporal resolution among typically developing children, children with speech sound disorder (SSD), and children with cognitive difficulty (CD). Subjects and Methods: Children aged 8 to 11 years-(total n=30) participated in this study. There were 10 children in each of the following three groups: typically developing children, children with SSD, and children with CD. The Urimal Test of Articulation and Phonology was conducted as a clinical assessment of the children's articulation and phonology. The Korean version of the Wechsler Intelligence Scale for Children-III (K-WISC-III) was administered as a screening test for general cognitive function. According to the procedure of Musiek, the pre-recorded stimuli of the GIN test were presented at 50 dB SL. The results were scored by the approximated threshold and the overall percent correct score (%). Results: All the typically developing children had normal auditory temporal resolution based on the clinical cutoff criteria of the GIN test. The children with SSD or CD had significantly reduced gap detection performance compared to age-matched typically developing children. The children's intelligence score measured by the K-WISC-III test explained 37% of the variance in the percent-correct score. Conclusions: Children with SSD or CD exhibited poorer ability to resolve rapid temporal acoustic cues over time compared to the age-matched typically developing children. The ability to detect a brief temporal gap embedded in a stimulus may be related to the general cognitive ability or phonological processing.

Gaps-In-Noise Test Performance in Children with Speech Sound Disorder and Cognitive Difficulty

  • Jung, Yu Kyung;Lee, Jae Hee
    • Korean Journal of Audiology
    • /
    • v.24 no.3
    • /
    • pp.133-139
    • /
    • 2020
  • Background and Objectives: The Gaps-In-Noise (GIN) test is a clinically effective measure of the integrity of the central auditory nervous system. The GIN procedure can be applied to a pediatric population above 7 years of age. The present study conducted the GIN test to compare the abilities of auditory temporal resolution among typically developing children, children with speech sound disorder (SSD), and children with cognitive difficulty (CD). Subjects and Methods: Children aged 8 to 11 years-(total n=30) participated in this study. There were 10 children in each of the following three groups: typically developing children, children with SSD, and children with CD. The Urimal Test of Articulation and Phonology was conducted as a clinical assessment of the children's articulation and phonology. The Korean version of the Wechsler Intelligence Scale for Children-III (K-WISC-III) was administered as a screening test for general cognitive function. According to the procedure of Musiek, the pre-recorded stimuli of the GIN test were presented at 50 dB SL. The results were scored by the approximated threshold and the overall percent correct score (%). Results: All the typically developing children had normal auditory temporal resolution based on the clinical cutoff criteria of the GIN test. The children with SSD or CD had significantly reduced gap detection performance compared to age-matched typically developing children. The children's intelligence score measured by the K-WISC-III test explained 37% of the variance in the percent-correct score. Conclusions: Children with SSD or CD exhibited poorer ability to resolve rapid temporal acoustic cues over time compared to the age-matched typically developing children. The ability to detect a brief temporal gap embedded in a stimulus may be related to the general cognitive ability or phonological processing.

Intonational Characteristics of Korean Focus Realization by American Learners of Korean

  • Oh, Mi-Ra;Kang, Sun-Mi;Kim, Kee-Ho
    • Speech Sciences
    • /
    • v.11 no.1
    • /
    • pp.131-145
    • /
    • 2004
  • The informative or important entities in utterances are focused and the focused items are usually accompanied by changes in phonetic manifestation. Phonetic realizations triggered by focus include changes of tonal contours as well as segmental strengthening. Focus in Korean is characterized by new phrase initiation, dephrasing, and initial tone contour with an enlarged pitch range in addition to segmentally lengthened initial segment. Focusing on the prosodic cues which play an important role in delivering the speakers' intention, this study aims to find out what intonational characteristics of Korean focus are realized by English learners of Korean. The English learners are divided into two groups according to their fluency in Korean, and the differences in focus realization between each group are discussed. Furthermore, the phonological and phonetic realizations of focus by English learners of Korean are compared to those by Korean native speakers. The results of this study yields two suggestions for Korean intonation education of L2 learners. First, the comparison between the two speaker groups can give better understanding in how and why the Korean intonation of English speakers is different from that of Koreans. Second, each phonological and phonetic characteristic of focus realization can weigh differently and its realization provides a criterion for evaluation of L2 Korean proficiency.

  • PDF

Perception of the English Epenthetic Stops by Korean Listeners

  • Han, Jeong-Im
    • Speech Sciences
    • /
    • v.11 no.1
    • /
    • pp.87-103
    • /
    • 2004
  • This study investigates Korean listeners' perception of the English stop epenthesis between the sonorant and fricative segments. Specifically this study investigates 1) how often English epenthetic stops are perceived by native Korean listeners, given the fact that Korean does not allow consonant clusters in codas; and 2) whether perception of the epenthetic stops, which are optional phonetic variations, not phonemes, could be improved without any explicit training. 120 English non-words with a mono-syllable structure of CVC1C2, where C1=/m, n, $\eta$, 1/, and C2=/s, $\theta$, $\int$/, were given to two groups of native Korean listeners, and they were asked to detect the target stops such as [p], [t], and [k]. The number of their responses were computed to determine how often listeners succeed in recovering the string of segments produced by the native English speaker. The results of the present study show that English epenthetic stops are poorly identified by native Korean listeners with low English proficiency, even in the case where stimuli with strong acoustic cues are provided with, but perception of epenthetic stops is closely related with listeners' English proficiency, showing the possibility of the improvement of perception. It further shows that perception of epenthetic stops shows asymmetry between coronal and non-coronal consonants.

  • PDF

L2 Proficiency Effect on the Acoustic Cue-Weighting Pattern by Korean L2 Learners of English: Production and Perception of English Stops

  • Kong, Eun Jong;Yoon, In Hee
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.81-90
    • /
    • 2013
  • This study explored how Korean L2 learners of English utilize multiple acoustic cues (VOT and F0) in perceiving and producing the English alveolar stop with a voicing contrast. Thirty-four 18-year-old high-school students participated in the study. Their English proficiency level was classified as either 'high' (HEP) or 'low' (LEP) according to high-school English level standardization. Thirty different synthesized syllables were presented in audio stimuli by combining a 6-step VOTs and a 5-step F0s. The listeners judged how close the audio stimulus was to /t/ or /d/ in L2 using a visual analogue scale. The L2 /d/ and /t/ productions collected from the 22 learners (12 HEP, 10 LEP) were acoustically analyzed by measuring VOT and F0 at the vowel onset. Results showed that LEP listeners attended to the F0 in the stimuli more sensitively than HEP listeners, suggesting that HEP listeners could inhibit less important acoustic dimensions better than LEP listeners in their L2 perception. The L2 production patterns also exhibited a group-difference between HEP and LEP in that HEP speakers utilized their VOT dimension (primary cue in L2) more effectively than LEP speakers. Taken together, the study showed that the relative cue-weighting strategies in L2 perception and production are closely related to the learner's L2 proficiency level in that more proficient learners had a better control of inhibiting and enhancing the relevant acoustic parameters.

자음의 단어내 음운환경별로 본 음가변화

  • 김종미
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.5
    • /
    • pp.69-76
    • /
    • 1994
  • Acoustic cues of some consonantal phonology were tested in Korean words. All Korean consonants were recorded and acoustically analyzed in controlled phonological environments :ⅰ) word-initial, ⅱ) inter-vocalic, and ⅲ) word-final positions. The observed acoustic regulations are : ⅰ) The lengths of obstruents are longer word-initially than word-finally, ⅱ) The lengths of sonorants are longer word-finally than in word-initial or inter-vocalic positions, ⅲ) The formants of the lateral sound /l/ are higher word-finally than intervocalically. The phonological explanations of these acoustic regulations can be found in the rules of ⅰ) inter-vocalic voicing of plain stops, ⅱ) syllable-final unreleasing of obstruents, ⅲ) word-initial aspiration of stops, and ⅳ) liquid alternation between [r] and [l]. Numerical data of all these acoustic regulations are reported in order to facilitate their application toward improving naturalness for speech synthesis and accurateness for speech recognition.

  • PDF