• Title/Summary/Keyword: Speech Perception

Search Result 398, Processing Time 0.027 seconds

Predictability effects on speech perception in noise (SPIN) in Korean (한국어 소음속말인지에 나타나는 예측성 효과)

  • Lee, Sun-Young
    • Korean Journal of Cognitive Science
    • /
    • v.27 no.1
    • /
    • pp.129-157
    • /
    • 2016
  • This study investigates speech perception in noise (SPIN) in Korean. A new type of Korean SPIN test was developed by adopting a similar format to the English SPIN test. The predictability effects, noise effects and their interactions were examined in order to verify the previous findings based on English. The data from 14 Korean adults collected with this new type of Korean SPIN test confirmed the previous findings: first, the participants' overall performance was better in low noise conditions than in high noise conditions. Secondly, there was a tendency for highly predictable words to be more accurately perceived than less predictable words especially in high noise conditions. The results were interpreted in such a way that the listeners actively used both types of information: acoustic information and contextual information in speech perception. When the acoustic property of the speech sound was degraded with noise, the listeners took advantage of the linguistic contextual information in their processing of the speech sound. The findings of this study conform to those of the previous studies based on the English SPIN test. In addition, a possible effect of the frequency of target word was also found, calling for further investigation in this field of research in Korean. Implications of the results were also discussed. (Cyber Hankuk University of Foreign Studies)

  • PDF

Cross-sectional perception studies of children's monosyllabic word by naive listeners (일반 청자의 아동 발화 단음절에 대한 교차 지각 분석)

  • Ha, Seunghee;So, Jungmin;Yoon, Tae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.14 no.1
    • /
    • pp.21-28
    • /
    • 2022
  • Previous studies have provided important findings on children's speech production development. They have revealed that essentially all aspects of children's speech shift toward adult-like characteristics over time. Nevertheless, few studies have examined the perceptual aspects of children's speech tokens, as perceived by naive adult listeners. To fill the gap between children's production and adults' perception, we conducted cross-sectional perceptual studies of monosyllabic words produced by children aged two to six years. Monosyllabic words in the consonant-vowel-consonant form were extracted from children's speech samples and presented aurally to five listener groups (20 listeners in total). Generally, the agreement rate between children's production of target words and adult listeners' responses increases with age. The perceptual responses to tokens produced by two-year old children induced the largest discrepancies and the responses to words produced by six years olds agreed the most. Further analyses were conducted to identify the sources of disagreement, including the types of segments and syllable structure. This study makes an important contribution to our understanding of the development and perception of children's speech across age groups.

Perception of the English Epenthetic Stops by Korean Listeners

  • Han, Jeong-Im
    • Speech Sciences
    • /
    • v.11 no.1
    • /
    • pp.87-103
    • /
    • 2004
  • This study investigates Korean listeners' perception of the English stop epenthesis between the sonorant and fricative segments. Specifically this study investigates 1) how often English epenthetic stops are perceived by native Korean listeners, given the fact that Korean does not allow consonant clusters in codas; and 2) whether perception of the epenthetic stops, which are optional phonetic variations, not phonemes, could be improved without any explicit training. 120 English non-words with a mono-syllable structure of CVC1C2, where C1=/m, n, $\eta$, 1/, and C2=/s, $\theta$, $\int$/, were given to two groups of native Korean listeners, and they were asked to detect the target stops such as [p], [t], and [k]. The number of their responses were computed to determine how often listeners succeed in recovering the string of segments produced by the native English speaker. The results of the present study show that English epenthetic stops are poorly identified by native Korean listeners with low English proficiency, even in the case where stimuli with strong acoustic cues are provided with, but perception of epenthetic stops is closely related with listeners' English proficiency, showing the possibility of the improvement of perception. It further shows that perception of epenthetic stops shows asymmetry between coronal and non-coronal consonants.

  • PDF

Comparison of Speech Rate and Long-Term Average Speech Spectrum between Korean Clear Speech and Conversational Speech

  • Yoo, Jeeun;Oh, Hongyeop;Jeong, Seungyeop;Jin, In-Ki
    • Journal of Audiology & Otology
    • /
    • v.23 no.4
    • /
    • pp.187-192
    • /
    • 2019
  • Background and Objectives: Clear speech is an effective communication strategy used in difficult listening situations that draws on techniques such as accurate articulation, a slow speech rate, and the inclusion of pauses. Although too slow speech and improperly amplified spectral information can deteriorate overall speech intelligibility, certain amplitude of increments of the mid-frequency bands (1 to 3 dB) and around 50% slower speech rates of clear speech, when compared to those in conversational speech, were reported as factors that can improve speech intelligibility positively. The purpose of this study was to identify whether amplitude increments of mid-frequency areas and slower speech rates were evident in Korean clear speech as they were in English clear speech. Subjects and Methods: To compare the acoustic characteristics of the two methods of speech production, the voices of 60 participants were recorded during conversational speech and then again during clear speech using a standardized sentence material. Results: The speech rate and longterm average speech spectrum (LTASS) were analyzed and compared. Speech rates for clear speech were slower than those for conversational speech. Increased amplitudes in the mid-frequency bands were evident for the LTASS of clear speech. Conclusions:The observed differences in the acoustic characteristics between the two types of speech production suggest that Korean clear speech can be an effective communication strategy to improve speech intelligibility.

Application of sinusoidal model to perception of electrical hearing in cochlear implants (인공와우 전기 청각 인지에 대한 정현파 모델 적용에 관한 연구)

  • Lee, Sungmin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.1
    • /
    • pp.52-57
    • /
    • 2022
  • Speech consists of the sum of complex sine-waves. This study investigated the perception of electrical hearing by applying the sinusoidal model to cochlear implant simulation. Fourteen adults with normal hearing participated in this study. The sentence recognition tests were implemented using the sentence lists processed by the sinusoidal model which extracts 2, 4, 6, 8 sine-wave components and sentence lists processed by the same sinusoidal model along with cochlear implant simulation (8 channel vocoders). The results showed lower speech recognition for the sentence lists processed by the sinusoidal model and cochlear implant simulation compared to those by the sinusoidal model alone. Notably, the lower the number of sine-wave components (2), the larger the difference was. This study provides the perceptual pattern of sine-wave speech for electrical hearing by cochlear implant listeners, and basic data for development of speech processing algorithms in cochlear implants.

A Production and Perception Experiment of Korean Alveolar Fricatives

  • Yoon, Kyu-Chul
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.169-184
    • /
    • 2002
  • Korean has two types of voiceless alveolar fricatives: a non-tense fricative /$S^{h}$ and a tense fricative /s'/. Twenty native speakers of Korean produced five pairs of isolated words containing word initial $S^{h}V$ and /s'V/ sequences where V was any one of five (/a, e, i, o, u/) of Korean vowels. Acoustic measures such as duration, fricative noise prominent frequency, energy change of following vowel, and fundamental frequency at vowel onset were examined. Results showed that among the parameters, aspiration noise duration of /s'/ in mid and low vowel contexts was less than 21 ms. In a perception experiment, where only the aspiration noise interval of the /$S^{h}$/ tokens was incrementally reduced, some listeners shifted perception from /$S^{h}$/ to /s'/.

  • PDF

A Study on Perceptual Sensitivity to Prosodic Cues in Disambiguation (중의성 해소에 기여하는 억양단서의 인지적 민감도 연구)

  • Kim, Mi-Hye;Kang, Sun-Mi;Kim, Kee-Ho
    • Phonetics and Speech Sciences
    • /
    • v.3 no.4
    • /
    • pp.3-11
    • /
    • 2011
  • This experimental study has a goal to explore the perceptual sensitivity to phonetic evidence such as duration, phrase accent, or pause in disambiguation. We argue that the realization of the intonational phrasal boundary at the meaningful grammatical boundary in structurally ambiguous sentences facilitates English native listeners to distinguish the meanings of the ambiguous sentences. Moreover, the duration of the phrase-final syllable, pitch range reset, or phrasal tones also provides listeners with important phonetic evidence in disambiguation. In our perception experiment, however, Korean English learners largely depend on the realization of pause. In the results from the perception experiment, all of the groups showed an increase in the response time from the perception of no pause to pause realization. This means that pause at the phonological phrasal boundary plays a role of facilitator to English native speakers with other prosodic cues such as duration, pitch accent, or phrasal tones, while an absolutely important cue to Korean English learners.

  • PDF

Investigation about Japanese perception of Korean Tense Consonants (일본어 모국어 화자의 한국어 경음 지각)

  • Kwon, Yeonjoo
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.77-83
    • /
    • 2015
  • The aim of this paper is to investigate Japanese speakers' perception of Korean tense consonants. In a range of perceptual experiments Japanese participants were directed to label Korean stimuli using Japanese katakana characters. The analysis of the results showed a strong influence of Japanese phonology in the responses. Japanese perception of sokuon was increased, (1) when the tense consonants were in word medial position, (2) when tense consonants were other than /s/, (3) when the tense consonant followed voiceless consonants, (4) when the consonants were part of a cluster sharing their point of articulation, (5) when preceding vowel were other than /u/, (6) when following vowel were /u/. This result, showing preference for phonology, is in harmony with previous research on the Japanese sokuon perception using Japanese (Takeyasu 2009, Matsui 2011), and Italian (Tanaka & Kubozono 2008) stimuli.

Korean Native Speakers' Perception of English Sounds According to the Groupings of Phonetic Contrasts

  • Kim, Gi-Na;Kim, Soo-Jin
    • Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.59-67
    • /
    • 2003
  • The purpose of this study was to investigate Korean native speakers' perception of English sounds according to groupings of phonetic contrasts. The four groupings looked at were vowels, voicing (voiced-unvoiced), fricatives with difference in place of articulation, and other clusters of specific sound contrasts, such as stop-fricatives and liquids. The position of a sound in syllable was also examined. According to the results of ANOVA and a post-hoc analysis, the perception of vowels, in the medial position was different from that of consonants in the initial and final position. Vowels proved to be the most difficult group to perceive correctly. With the consonants, there was not a big difference whether the contrasts came initially or finally. The order of difficulty was liquids, fricatives, stop-fricatives, and finally voicing.

  • PDF

Korean and Japanese EFL Learners' AV Benefit for the Perception of the Liquid Contrast in English (한국인 및 일본인 영어학습자의 유음 차이 지각에 미치는 시각/청각 효과)

  • Chung, Hyun-Song
    • MALSORI
    • /
    • no.60
    • /
    • pp.1-11
    • /
    • 2006
  • This paper investigated AV benefit of Korean and Japanese EFL learners' perception of the liquid contrast in English. In a perception experiment, the two English consonants /l/ and /r/ were embedded in initial and medial position in nonsense words in the context of the vowels /i, a, u/. Singletons and clusters were included in the speech material. Audio and video recordings were made using a total of 108 items. The items were presented to Korean and Japanese learners of English in three conditions: audio-alone (A), visual-alone (V) and audio-visual presentation (AV). The results showed that there was no evidence of AV benefit for the perception of the /l/-/r/ contrast for either Korean or Japanese learners of English. The results suggest that increasing auditory proficiency in identifying a non-native contrast is linked with an increasing proficiency in using visual cues to the contrast.

  • PDF