• Title/Summary/Keyword: spoken word production

Search Result 10, Processing Time 0.026 seconds

The influence of task demands on the preparation of spoken word production: Evidence from Korean

  • Choi, Tae-Hwan;Oh, Sujin;Han, Jeong-Im
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.1-7
    • /
    • 2017
  • It was shown in speech production studies that the preparation unit of spoken word production is language particular, such as onset phonemes for English and Dutch, syllables for Mandarin Chinese, and morae for Japanese. However, there have been inconsistent results on whether the onset phoneme is a planning unit of spoken word production in Korean. In this study, two sets of experiments investigated possible influences of task demands on the phonological preparation in native Korean adults, namely, implicit priming and word naming with the form preparation paradigm. Only the word naming task, but not the implicit priming task, showed a significant onset priming effect, even though there were significant syllable priming effects in both tasks. Following the attentional theory ($O^{\prime}S{\acute{e}}aghdha$ & Frazer, 2014), these results suggest that task demands might play a role in the absence/presence of onset priming effects in Korean. Native Korean speakers could maintain their attention to the shared onset phonemes in word naming, which is not very demanding, while they have difficulties in allocating their attention to such units in a more cognitive-demanding implicit priming, even though both tasks involve accessing phonological codes. These findings demonstrate that there are cross-linguistic differences in the first selectable unit in preparation of spoken word production, but within a single language, the preparation unit might not be immutable.

Three-Stage Framework for Unsupervised Acoustic Modeling Using Untranscribed Spoken Content

  • Zgank, Andrej
    • ETRI Journal
    • /
    • v.32 no.5
    • /
    • pp.810-818
    • /
    • 2010
  • This paper presents a new framework for integrating untranscribed spoken content into the acoustic training of an automatic speech recognition system. Untranscribed spoken content plays a very important role for under-resourced languages because the production of manually transcribed speech databases still represents a very expensive and time-consuming task. We proposed two new methods as part of the training framework. The first method focuses on combining initial acoustic models using a data-driven metric. The second method proposes an improved acoustic training procedure based on unsupervised transcriptions, in which word endings were modified by broad phonetic classes. The training framework was applied to baseline acoustic models using untranscribed spoken content from parliamentary debates. We include three types of acoustic models in the evaluation: baseline, reference content, and framework content models. The best overall result of 18.02% word error rate was achieved with the third type. This result demonstrates statistically significant improvement over the baseline and reference acoustic models.

Prosodic Strengthening in Speech Production and Perception: The Current Issues

  • Cho, Tae-Hong
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.7-24
    • /
    • 2007
  • This paper discusses some current issues regarding how prosodic structure is manifested in fine-grained phonetic details, how prosodically-conditioned articulatory variation is explained in terms of speech dynamics, and how such phonetic manifestation of prosodic structure may be exploited in spoken word recognition. Prosodic structure is phonetically manifested in prosodically important landmark locations such as prosodic domain-final position, domain-initial position and stressed/accented syllables. It will be discussed how each of the prosodic landmarks engenders particular phonetic patterns, ow articulatory variation in such locations are dynamically accounted for, and how prosodically-driven fine-grained phonetic detail is exploited by listeners in speech comprehension.

  • PDF

An acoustic and perceptual investigation of the vowel length contrast in Korean

  • Lee, Goun;Shin, Dong-Jin
    • Phonetics and Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.37-44
    • /
    • 2016
  • The goal of the current study is to investigate how the sound change is reflected in production or in perception, and what the effect of lexical frequency is on the loss of sound contrasts. Specifically, the current study examined whether the vowel length contrasts are retained in Korean speakers' productions, and whether Korean listeners can distinguish vowel length minimal pairs in their perception. Two production experiments and two perception experiments investigated this. For production tests, twelve Korean native speakers in their 20s and 40s completed a read-aloud task as well as a map-task. The results showed that, regardless of their age group, all Korean speakers produced vowel length contrasts with a small but significant differences in the read-aloud test. Interestingly, the difference between long and short vowels has disappeared in the map task, indicating that the speech mode affects producing vowel length contrasts. For perception tests, thirty-three Korean listeners completed a discrimination and a forced-choice identification test. The results showed that Korean listeners still have a perceptual sensitivity to distinguish lexical meaning of the vowel length minimal pair. We also found that the identification accuracy was affected by the word frequency, showing a higher identification accuracy in high- and mid- frequency words than low frequency words. Taken together, the current study demonstrated that the speech mode (read-aloud vs. spontaneous) affects the production of the sound undergoing a language change; and word frequency affects the sound change in speech perception.

Recent update on reading disability (dyslexia) focused on neurobiology

  • Kim, Sung Koo
    • Clinical and Experimental Pediatrics
    • /
    • v.64 no.10
    • /
    • pp.497-503
    • /
    • 2021
  • Reading disability (dyslexia) refers to an unexpected difficulty with reading for an individual who has the intelligence to be a much better reader. Dyslexia is most commonly caused by a difficulty in phonological processing (the appreciation of the individual sounds of spoken language), which affects the ability of an individual to speak, read, and spell. In this paper, I describe reading disabilities by focusing on their underlying neurobiological mechanisms. Neurobiological studies using functional brain imaging have uncovered the reading pathways, brain regions involved in reading, and neurobiological abnormalities of dyslexia. The reading pathway is in the order of visual analysis, letter recognition, word recognition, meaning (semantics), phonological processing, and speech production. According to functional neuroimaging studies, the important areas of the brain related to reading include the inferior frontal cortex (Broca's area), the midtemporal lobe region, the inferior parieto-temporal area, and the left occipitotemporal region (visual word form area). Interventions for dyslexia can affect reading ability by causing changes in brain function and structure. An accurate diagnosis and timely specialized intervention are important in children with dyslexia. In cases in which national infant development screening tests have been conducted, as in Korea, if language developmental delay and early predictors of dyslexia are detected, careful observation of the progression to dyslexia and early intervention should be made.

Phonetic investigation of epenthetic vowels produced by Korean learners of English

  • Shin, Dong-Jin;Iverson, Paul
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.17-26
    • /
    • 2014
  • The present study examined epenthetic vowels produced by Korean learners of English in read sentences, in terms of acoustic measures and extra-phonological factors. The results demonstrated three main findings. First, epenthetic vowels had relatively high F1 values and a wide range of F2 values. Most of the epenthetic vowels were inserted near Korean high central vowels, but some vowels were inserted near front vowels due to co-articulation with surrounding vowels. Second, vowel epenthesis was affected by the context. The results showed that the epenthesis was frequently seen with word junctions between obstruents (e.g., stops-fricatives). Third, Korean learners were not affected by English background and were very weakly affected by orthography. English experience, which is one of the extra-phonological factors, was not related to epenthesis production. However, orthography, the other extra-phonological factor, very weakly affected the amount of epenthesis production. Nine percent of all epenthesis production was affected by the English past-tense suffix '-ed'; approximately 70% of the participants were affected by this suffix. The findings of the present study contributed to understanding vowel epenthesis. First, the study revealed that the epenthetic vowels produced by Korean learners of English were close to the high central vowel, supporting previous studies that the epenthetic vowel is quite close to the shortest vowel. Second, the study examined the various phonetic environments of epenthetic vowels, revealing that vowel epenthesis occurred more frequently in a certain phonetic circumstance.

Formulaic Language Development in Asian Learners of English: A Comparative Study of Phrase-frames in Written and Oral Production

  • Yoon Namkung;Ute Romer
    • Asia Pacific Journal of Corpus Research
    • /
    • v.4 no.2
    • /
    • pp.1-39
    • /
    • 2023
  • Recent research in usage-based Second Language Acquisition has provided new insights into second language (L2) learners' development of formulaic language (Wulff, 2019). The current study examines the use of phrase-frames, which are recurring sequences of words including one or more variable slots (e.g., it is * that), in written and oral production data from Asian learners of English across four proficiency levels (beginner, low-intermediate, high-intermediate, advanced) and native English speakers. The variability, predictability, and discourse functions of the most frequent 4-word phrase-frames from the written essay and spoken dialogue sub-corpora of the International Corpus Network of Asian Learners of English (ICNALE) were analyzed and then compared across groups and modes. The results revealed that while learners' phrase-frames in writing became more variable and unpredictable as proficiency increased, no clear developmental patterns were found in speaking, although all groups used more fixed and predictable phrase-frames than the reference group. Further, no developmental trajectories in the functions of the most frequent phrase-frames were found in both modes. Additionally, lower-level learners and the reference group used more variable phrase-frames in speaking, whereas advanced-level learners showed more variability in writing. This study contributes to a better understanding of the development of L2 phraseological competence.

A Preliminary Report on Perceptual Resolutions of Korean Consonant Cluster Simplification and Their Possible Change over Time

  • Cho, Tae-Hong
    • Phonetics and Speech Sciences
    • /
    • v.2 no.4
    • /
    • pp.83-92
    • /
    • 2010
  • The present study examined how listeners of Seoul Korean would recover deleted phonemes in consonant cluster simplification. In a phoneme monitoring experiment, listeners had to monitor for C2 (/k/ or /p/) in C1C2C3 when C2 was deleted (C1 was preserved) or preserved (C1 was deleted). The target consonant (C2) was either /k/ or /p/ (e.g., i$\b{lk}$-t${\partial}$lato vs. pa$\b{lp}$-t${\partial}$lato), and there were two listener groups, one group tested in 2002 and the other in 2009. Some points have emerged from the results. First, listeners were able to detect deleted phonemes as accurately and rapidly as preserved phonemes, showing that the physical presence of the acoustic information did not improve the listeners' performance. This suggests that listeners must have relied on language-specific phonological knowledge about the consonant cluster simplification, rather than relying on the low-level acoustic-phonetic information. Second, listener groups (participants in 2002 vs. 2009), differed in processing /p/ versus /k/: listeners in 2009 failed to detect /p/ more frequently than those in 2002, suggesting that the way the consonant cluster sequence is produced and perceived has changed over time. This result was interpreted as coming from statistical patterns of speech production in contemporary Seoul Korean as reported in a recent study by Cho & Kim (2009): /p/ is deleted far more often than /p/ is preserved, which is likely reflected in the way listeners process simplified variants. Finally, listeners processed /k/ more efficiently than /p/, especially when the target was physically present (in C-preserved condition), indicating that listeners benefited more from the presence of /k/ than of /p/. This was interpreted as supporting the view that velars are perceptually more robust than labials, which constrains shaping phonological patterns of the language. These results were then discussed in terms of their implications for theories of spoken word recognition.

  • PDF

Literature Analysis on PROMPT Treatment (1984-2020) (프롬프트(PROMPT) 치료기법에 관한 문헌 분석(1984-2020년))

  • Kim, Wha-soo;Lee, Rio;Lee, Ji-woo
    • Journal of Digital Convergence
    • /
    • v.19 no.2
    • /
    • pp.447-456
    • /
    • 2021
  • This study analyzed 28 domestic and foreign studies related Prompts for Restructuring Oral Muscular Phonetic Targets treatment techniques from 1984 to 2020 to prepare basic data for the development of PROMPT intervention programs and examination tools. According to the analysis, continuous research has been conducted since 1984 when the prompt study was first started, and the method of research was 16 intervention studies, with the highest number of speech disorders, and the target age being 3 to 5 years old, the most frequently conducted for infancy. The treatment was the most frequent in the 16th sessions, and the activities were based on the Motor Speech Hierarchy(MSH), except for the subjects of the non-verbal autism spectrum disorder. According to the analysis of the dependent variables, 'speech production' was the most common, followed by 'speech motor control', 'articulation', and 'speech intelligibility' were highest. Combined with all these studies, it suggests that PROMPT, which are directly useful for exercise spoken word production, are effectively being used outside the country and that it is necessary to develop a PROMPT program that can be applied domestically, in Korea.