• Title/Summary/Keyword: speech cues

Search Result 117, Processing Time 0.02 seconds

Correlation of Acoustic Cues in Stop Productions of Korean and English Adults and Children

  • Kong, Eun-Jong;Weismer, Gary
    • Phonetics and Speech Sciences
    • /
    • v.2 no.4
    • /
    • pp.29-37
    • /
    • 2010
  • Previous studies have investigated a between-category relationship of multiple acoustic cues for a laryngeal contrast by examining the distributions of VOT, f0 and H1-H2. The current study examined within-category correlations between cues comprising stops by Korean- and English-speaking adults and children to understand how children master the internal structure of stop phonation types in two languages. Word-initial stops were collected from about 70 children and 15 adults speaking English and Korean, and were analyzed in terms of VOT, f0 and H1-H2 to compute correlation coefficients. Findings in adults' productions included a gender-differentiated cue-correlation pattern associated with H1-H2 in Korean tense stops and a trading relationship between f0 and VOT in Korean lax and aspirated stops and English voiced and voiceless stops. Children did not necessarily have adult-like cue-correlation patterns even in early-acquired categories, suggesting that the mastery of intra-category structure of phonation type might occur later than inter-category structure.

  • PDF

Attentional modulation on multiple acoustic cues in phonological processing of L2 sounds

  • Hyunjung Lee;Eun Jong Kong
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.11-16
    • /
    • 2023
  • The present study examines how a cognitive attention affects Korean learners of English (L2) in perceiving the English stop voicing distinction (/d/-/t/). This study tested the effect of attentional distractor on primary and non-primary acoustic cues, focusing on the role of Voice Onset Time (VOT) and fundamental frequency (F0). Using the dual-task paradigm, 28 Korean adult learners of English participated in the stop identification task carried with (distractor) and without (no-distractor) arithmetic calculation. Results showed that when distracted, Korean learners' sensitivity to VOT decreased as priorly reported with native English speakers. Furthermore, as F0 is a primary cue for a L1 Korean stop laryngeal contrast, its role in L2 English voicing distinction was also affected by a distractor, without compensating for the reduced VOT sensitivity. These findings suggest that flexible use of multiple cues in L1 is not necessarily beneficial for L2 phonological processing when coping with a adverse listening condition.

The Study on Asymmetry between Acoustics and Perception of the Temporal Cues of English Plosives (영어파열음 시구간신호의 음향과 지각 비대칭성 연구)

  • Kang Seok-Han
    • MALSORI
    • /
    • v.55
    • /
    • pp.15-31
    • /
    • 2005
  • This study tests the hypothesis that the voiced-voiceless distinction is influenced by the relationship between acoustics and perception. Production and perception tests are conducted with temporal cues in different environments(CV, VCV, VC). The result showed that acoustic cues indicating significant difference between voiceless/voiced plosives do not behave just as do in perception. The result also showed that there existed an asymmetry between acoustics and perception.

  • PDF

Executive function and Korean children's stop production

  • Eun Jong Kong;Hyunjung Lee;Jeffrey J. Holliday
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.45-52
    • /
    • 2023
  • Previous studies have established a role for cognitive differences in explaining variability in speech processing across individuals. In the case of perceptual cue weighting in the context of a sound change, studies have produced conflicting results regarding the relationship between executive function and the use of redundant cues. The current study aimed to explore this relationship in acoustic cue weighting during speech production. Forty-one Korean-speaking children read a list of stop-initial words and completed two tests that assess executive function, i.e., Dimensional Change Card Sorting (DCCS) and digit n-back. Voice onset time (VOT) and fundamental frequency (F0) were measured in each word, and analyses were carried out to determine the extent to which children's executive function predicted their use of both informative and less informative cues to the three pairs comprising the Korean three-way stop laryngeal contrast. No evidence was found for a relationship between cognitive ability and acoustic cue weighting in production, which is at odds with previous, albeit conflicting, results for speech perception. While this result may be due to the lack of task demands in the production task used here, it nevertheless expands the empirical ground upon which future work in this area may proceed.

A study on the release burst spectra of the voiceless plosives from the English and Korean spontaneous speech corpus (영어와 한국어 자연발화 코퍼스에서의 무성 폐쇄음 개방 파열 스펙트럼 연구)

  • Hwang, Sunmi;Yoon, Kyuchul
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.27-34
    • /
    • 2017
  • The purpose of this work is to examine the English and Korean voiceless plosives from the Buckeye[15] and Seoul[16] corpus in terms of their static spectral characteristics. The plosives were automatically extracted by a Praat script. In order to estimate the percent correctness in the classification of the plosives, discriminant analyses were performed whose trainings were based on four spectral moments, i.e. the center of gravity, variance, skewness and kurtosis as suggested in [6]. Another set of discriminant analyses were performed based on the spectral tilts. In the last set of analyeses, the spectral moments and tilts were both used in the training. Results showed that the correct classification rate did not exceed around 65% in the best case, which suggested that phonetic cues other than the release burst would be necessary including the dynamic spectral aspects and vowel-onset cues.

Phonetic Functionalism in Coronal/Non-coronal Asymmetry

  • Kim, Sung-A.
    • Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.41-58
    • /
    • 2003
  • Coronal/non-coronal asymmetry refers to the typological trend wherein coronals rather than non-coronals are more likely targets in place assimilation. Although the phenomenon has been accounted for by resorting to the notion of unmarkedness in formalistic approaches to sound patterns, the examination of rules and representations cannot answer why there should be such a process in the first place. Furthermore, the motivation of coronal/non-coronal asymmetry has remained controversial to date even in the field of phonetics. The present study investigated the listeners' perception of coronal and non-coronal stops in the context of $VC_{1}C_{2}V$ after critically reviewing the three types of phonetic accounts for coronal/non-coronal asymmetry, i.e., articulatory, perceptual, and gestural overlap accounts. An experiment was conducted to test whether the phenomenon in question may occur, given the listeners' lack of perceptual ability to identify weaker place cues in VC transitions as argued by Ohala (1990), i.e., coronals have weak place cues that cause listeners' misperception. 5pliced nonsense $VC_{1}C_{2}V$ utterances were given to 20 native speakers of English and Korean. Data analysis showed that majority of the subjects reported $C_{2}\;as\;C_{1}$. More importantly, the place of articulation of C1 did not affect the listeners' identification. Compared to non-coronals, coronals did not show a significantly lower rate of correct identifications. This study challenges the view that coronal/non-coronal asymmetry is attributable to the weak place cues of coronals, providing evidence that CV cues are more perceptually salient than VC cues. While perceptual saliency account may explain the frequent occurrence of regressive assimilation across languages, it cannot be extended to coronal/non-coronal asymmetry.

  • PDF

Speech processing strategy and executive function: Korean children's stop perception

  • Kong, Eun Jong;Yoo, Jeewon
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.57-65
    • /
    • 2017
  • The current study explored how Korean-speaking children processed the multiple acoustic cues (VOT and f0) for the stop laryngeal contrast (/t'/, /t/, and /$t^h$/) and examined whether individual perceptual strategies could be related to a general cognitive ability performing executive functions (EF). 15 children (aged from 7 to 8) participated in the speech perception task identifying the three Korean laryngeal stops (3AFC) on listening to the auditory stimuli of C-/a/ with synthetically varying VOT and f0. They completed a series of EF tasks to measure working memory, inhibition, and cognitive shifting ability. The findings showed that children used the two cues in a highly correlated manner. While children utilized VOT consistently for the three laryngeal categories, their use of f0 was either reduced or enhanced depending on the phonetic categories. Importantly, the children's processing strategies of a f0 suppression for a tense-aspirated contrast were meaningfully associated with children's better cognitive abilities such as working memory, inhibition, and attentional shifting. As a preliminary experimental investigation, the current research demonstrated that listeners with inefficient processing strategies were poor at the EF skills, suggesting that cognitive skills might be responsible for developmental variations of processing sub-phonemic information for the linguistic contrast.

Development of a Reading Training Software offering Visual-Auditory Cue for Patients with Motor Speech Disorder (말운동장애인을 위한 시-청각 단서 제공 읽기 훈련 프로그램 개발)

  • Bang, D.H.;Jeon, Y.Y.;Yang, D.G.;Kil, S.K.;Kwon, M.S.;Lee, S.M.
    • Journal of Biomedical Engineering Research
    • /
    • v.29 no.4
    • /
    • pp.307-315
    • /
    • 2008
  • In this paper, we developed a visual-auditory cue software for reading training of motor speech disorder patients. Motor speech disorder patients can use the visual and/or auditory cues for reading training and improving their symptom. The software provides some sentences with visual-auditory cues. Our sentences used for reading training are adequately comprised on modulation training according to a professional advice in speech therapy field. To ameliorate reading skills we developed two algorithms, first one is automatically searching the starting time of speech spoken by patients and the other one is removing auditory-cue from the recorded speech that recorded at the same time. The searching of speech starting time was experimented by 10 sentences per 6 subjects in four kinds of noisy environments thus the results is that $7.042{\pm}8.99[ms]$ error was detected. The experiment of the cancellation algorithm of auditory-cue was executed from 6 subjects with 1 syllable speech. The result takes improved the speech recognition rate $25{\pm}9.547[%]$ between before and after cancellation of auditory-cue in speech. User satisfaction index of the developed program was estimated as good.

An efficient method of spatial cues and compensation method of spectrums on multichannel spatial audio coding (멀티채널 Spatial Audio Coding에서의 효율적인 Spatial Cues 사용과 그에 따른 Spectrum 보상방법)

  • Lee, Byong-Hwa;Beack, Seung-Kwon;Seo, Jeong-Gil;Han, Min-Soo
    • MALSORI
    • /
    • no.53
    • /
    • pp.157-169
    • /
    • 2005
  • This paper proposes an efficiently representing method of spatial cues on multichannel spatial audio coding. The Binaural Cue Coding (BCC) method introduced recently represents multichannel audio signals by means of Inter Channel Level Difference (ICLD) or Source Index (SI). We tried to express more efficiently ICLD and SI information based on Inter Channel Correlation in this paper. We adopt different spatial cues according to ICC and propose a compensation method of empty spectrums created by using SI. We performed a MOS test and measuring spectral distortion. The results show that the proposed method can reduce the bitrate of side information without large degradation of the audio quality.

  • PDF

The role of prosodic phrasing in Korean word segmentation (음운 구조가 한국어 단어 분절에 미치는 영향)

  • Kim, Sa-Hyang
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.114-118
    • /
    • 2007
  • The current study investigates the degree to which various prosodic cues at the boundaries of a prosodic phrase in Korean (Accentual Phrase) contributed to word segmentation. Since most phonological words in Korean are produced as one AP, it was hypothesized that the detection of acoustic cues at AP boundaries would facilitate word segmentation. The prosodic characteristics of Korean APs include initial strengthening at the beginning of the phrase and pitch rise and final lengthening at the end. A perception experiment revealed that the cues that conform to the above-mentioned prosodic characteristics of Korean facilitated listeners' word segmentation. Results also showed that duration and amplitude cues were more helpful in segmentation than pitch. Further, the results showed that a pitch cue that did not conform to the Korean AP interfered with segmentation.

  • PDF