• Title/Summary/Keyword: Speech Perception

Search Result 397, Processing Time 0.023 seconds

Perception of Tamil Mono-Syllabic and Bi-Syllabic Words in Multi-Talker Speech Babble by Young Adults with Normal Hearing

  • Gnanasekar, Sasirekha;Vaidyanath, Ramya
    • Journal of Audiology & Otology
    • /
    • v.23 no.4
    • /
    • pp.181-186
    • /
    • 2019
  • Background and Objectives: This study compared the perception of mono-syllabic and bisyllabic words in Tamil by young normal hearing adults in the presence of multi-talker speech babble at two signal-to-noise ratios (SNRs). Further for this comparison, a speech perception in noise test was constructed using existing mono-syllabic and bi-syllabic word lists in Tamil. Subjects and Methods: A total of 30 participants with normal hearing in the age range of 18 to 25 years participated in the study. Speech-in-noise test in Tamil (SPIN-T) constructed using mono-syllabic and bi-syllabic words in Tamil was used as stimuli. The stimuli were presented in the background of multi-talker speech babble at two SNRs (0 dB and +10 dB SNR). Results: The effect of noise on SPIN-T varied with SNR. All the participants performed better at +10 dB SNR, the higher of the two SNRs considered. Additionally, at +10 dB SNR performance did not vary significantly for neither mono-syllabic or bi-syllabic words. However, a significant difference existed at 0 dB SNR. Conclusions: The current study indicated that higher SNR leads to better performance. In addition, bi-syllabic words were identified with minimal errors compared to mono-syllabic words. Spectral cues were the most affected in the presence of noise leading to more of place of articulation errors for both mono-syllabic and bi-syllabic words.

Perception of Tamil Mono-Syllabic and Bi-Syllabic Words in Multi-Talker Speech Babble by Young Adults with Normal Hearing

  • Gnanasekar, Sasirekha;Vaidyanath, Ramya
    • Korean Journal of Audiology
    • /
    • v.23 no.4
    • /
    • pp.181-186
    • /
    • 2019
  • Background and Objectives: This study compared the perception of mono-syllabic and bisyllabic words in Tamil by young normal hearing adults in the presence of multi-talker speech babble at two signal-to-noise ratios (SNRs). Further for this comparison, a speech perception in noise test was constructed using existing mono-syllabic and bi-syllabic word lists in Tamil. Subjects and Methods: A total of 30 participants with normal hearing in the age range of 18 to 25 years participated in the study. Speech-in-noise test in Tamil (SPIN-T) constructed using mono-syllabic and bi-syllabic words in Tamil was used as stimuli. The stimuli were presented in the background of multi-talker speech babble at two SNRs (0 dB and +10 dB SNR). Results: The effect of noise on SPIN-T varied with SNR. All the participants performed better at +10 dB SNR, the higher of the two SNRs considered. Additionally, at +10 dB SNR performance did not vary significantly for neither mono-syllabic or bi-syllabic words. However, a significant difference existed at 0 dB SNR. Conclusions: The current study indicated that higher SNR leads to better performance. In addition, bi-syllabic words were identified with minimal errors compared to mono-syllabic words. Spectral cues were the most affected in the presence of noise leading to more of place of articulation errors for both mono-syllabic and bi-syllabic words.

Perceptual Characteristics of Korean Vowels Distorted by the Frequency Band Limitation (주파수 대역 제한에 의한 한국어 모음의 지각 특성 분석)

  • Kim, YeonWhoa;Choi, DaeLim;Lee, Sook-Hyang;Lee, YongJu
    • Phonetics and Speech Sciences
    • /
    • v.6 no.1
    • /
    • pp.85-93
    • /
    • 2014
  • This paper investigated the effects of frequency band limitation on perceptual characteristics of Korean vowels. Monosyllabic speech (144 syllables of CV type, 56 syllables of VC type, 8 syllables of V type) produced by two announcers were low- and high-pass filtered with cutoff frequencies ranging from 300 to 5000 Hz. Six listeners with normal hearing performed perception tests by types of filter and cutoff frequencies. We reported phoneme recognition rates and types of perception error of band-limited Korean vowels to examine how frequency distortion in the process of speech transmission affect listener's perception.

Speech Production and Perception of Word-medial Singleton and Geminate Sonorants in Korean (한국어 어중 공명 중첩자음과 단자음의 조음 및 지각)

  • Kim, Taekyung
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.145-155
    • /
    • 2013
  • This study investigated the articulatory characteristics of Korean singleton and geminate sonorants in the word-medial position, effects of the duration of the sonorant consonant and the preceding vowel on perception, and the difference between native Korean speakers and foreign learners of Korean in perceiving the singleton and geminate consonant contrast. The Korean sonorant consonants(/m, n, l/) are examined from the VCCV, VCV sequences through speech production and perception experiments. The results suggest that the duration of the sonorant consonant is the most important factor for native Korean speakers to recognize whether sonorants are overlapped, and the duration of preceding vowel and other factors affect the recognition of singleton/geminate consonant contrast if the duration is not obvious. A perception experiment showed Chinese Korean language learners did not clearly distinguish singleton consonants from geminate consonants. The results of this study provide basic data for recognition of singleton/geminate consonant contrast in word-medial of Korean language, and can be utilized for teaching Korean pronunciation as a foreign language.

Relationship between executive function and cue weighting in Korean stop perception across different dialects and ages

  • Kong, Eun Jong;Lee, Hyunjung
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.21-29
    • /
    • 2021
  • The present study investigated how one's cognitive resources are related to speech perception by examining Korean speakers' executive function (EF) capacity and its association with voice onset time (VOT) and f0 sensitivity in identifying Korean stop laryngeal categories (/t'/ vs. /t/ vs. /th/). Previously, Kong et al. (under revision) reported that Korean listeners (N = 154) in Seoul and Changwon (Gyeongsang) showed differential group patterns in dialect-specific cue weightings across educational institutions (college, high school, and elementary school). We follow up this study by further relating their EF control (working memory, mental flexibility, and inhibition) to their speech perception patterns to examine whether better cognitive ability would control attention to multiple acoustic dimensions. Partial correlation analyses revealed that better EFs in Korean listeners were associated with greater sensitivity to available acoustic details and with greater suppression of irrelevant acoustic information across subgroups, although only a small set of EF components turned out to be relevant. Unlike Seoul participants, Gyeongsang listeners' f0 use was not correlated with any EF task scores, reflecting dialect-specific cue primacy using f0 as a secondary cue. The findings confirm the link between speech perception and general cognitive ability, providing experimental evidence from Korean listeners.

Perceptual Dimensions of Korean Vowel: A Link between Perception and Production (한국어 모음의 지각적 차원 -지각과 산출간의 연동-)

  • Choi, Yang-Gyu
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.181-191
    • /
    • 2001
  • The acoustic quality of a vowel is known to be mostly determined by the frequencies of the first formant(Fl) and the second formant(F2). The perceptual(or psychological) dimensions of vowel perception were examined in this study. Also the relationships among perceptual dimensions, acoustical dimensions(Fl & F2), and articulatory gestures of vowel were discussed. Using multi-dimensional scaling(MDS) technique, the experiment was performed in order to identify the perceptual dimensions of the perception of Korean vowel. In the experiment 8 Seoul standard speakers performed the similarity rating task of 10 synthesized Korean vowels. Two-dimensional MDS solution based. on the similarity rating scores was obtained. The results showed that two perceptual dimensions, D1 and D2 were correlated strongly with F2 and F1(r = -.895 and .878 respectively), and were so interpreted as 'vowel advancement' and 'vowel height' respectively. The relationship between the perceptual dimensions of vowel and the articulatory positions of tongue suggested that perception may be directly linked to production. Further research problems were discussed in the .final section.

  • PDF

Perceptual Structure of Korean Consonants in High Vowel Contexts (고설 모음 환경에서 한국어 자음의 지각적 구조)

  • Bae, Moon-Jung
    • Phonetics and Speech Sciences
    • /
    • v.1 no.2
    • /
    • pp.95-103
    • /
    • 2009
  • We investigated the perceptual structure of Korean consonants by analyzing the confusion among consonants in various vowel contexts. The 36 CV syllable types combined by 18 consonants and 2 vowels (/i/ and /u/) were presented with masking noises or in degraded intensity. The confusion data were analyzed by the INDSCAL (Individual Difference Scaling), ADCLUS (Additive Clustering) and the probability of the transmitted information. The results were compared with those of a previous study with /a/ vowel context (Bae and Kim, 2002). The overall results showed that the laryngeal features-aspiration, lax and tense-are the most salient features in the perception of Korean consonant regardless of vowel contexts, but the perceptual saliency of place features varies across vowel conditions. In high vowel (front and back vowel) contexts, sibilant consonants were perceptually salient compared to in low vowel contexts. In back vowel contexts, grave (labial and velar) consonants were perceptually salient. These findings imply that place features and vowel features strongly interact in speech perception as well as in speech production. All statistical measures from our confusion data ensured that the perceptual structure of Korean consonants correspond to the hierarchical structure suggested in the feature geometry (Clements, 1991). We discuss the link between speech perception and production as the basis of phonology.

  • PDF

The Role of L1 Phonological Feature in the L2 Perception and Production of Vowel Length Contrast in English

  • Chang, Woo-Hyeok
    • Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.37-51
    • /
    • 2008
  • The main goal of this study is to examine if there is a difference in the utilization of a vowel length cue between Korean and Japanese L2 learners of English in their perception and production of postvocalic coda contrast in English. Given that Japanese subjects' performances on the identification and production tasks were much better than Korean subjects' performance, we may support the prediction based on the Feature Hypothesis which maintains that L1 phonological features can facilitate the perception of L2 acoustic cue. Since vowel length contrast is a phonological feature in Japanese but not in Korean, the tasks, which assess L2 leaners' ability to discriminate vowel length contrast in English, are much easier for the Japanese group than for the Korean group. Although the Japanese subjects demonstrated a better performance than the Korean subjects, the performance of the Japanese group was worse than that of the English control group. This finding implies that L2 learners, even Japanese learners, should be taught that the durational difference of the preceding vowels is the most important cue to differentiate postvocalic contrastive codas in English.

  • PDF

The Relationship Between Perception of Prosody, Pitch Discrimination, and Melodic Contour Identification in Cochlear Implants Recipients (인공와우이식 난청인의 말소리 운율변화에 따른 구어 이해와 음도 변별, 선율윤곽 확인 간 관련성)

  • Kim, Eun Yeon;Moon, Il Joon;Cho, Yang-sun;Chung, Won-ho;Hong, Sung Hwa
    • Journal of Music and Human Behavior
    • /
    • v.14 no.2
    • /
    • pp.1-18
    • /
    • 2017
  • The relationships between the ability to understand changes in meaning depending on the prosody of spoken words and the ability to perceive pitch and melodic contour in cochlear implants (CI) recipients were examined. Fifteen postlingual CI recipients were measured in terms of speech prosody perception, speech perception, pitch discrimination (PD), and melody contour identification (MCI). The speech prosody perception test consists of words with positive (PW) and neutral meaning (NW). Participants were asked to identify the meaning of words depending on the conditions of positive and negative prosody. The MCI consists of subtests 1 and 2 with different chance levels to choose. Then, the relationships between speech prosody perception, speech perception, PD, and MCI performance were analyzed. There was a significant difference in identifying the meaning of words expressed in a different prosody between the PW and NW conditions. Speech prosody perception showed a significant correlation with MCI 1 while there was no significant relationship with speech perception. Although speech perception may be possible after CI, limited spoken word comprehension due to decreased sensitivity for prosodic changes may persist in CI recipients. In addition, there was a limitation in perception of melodic contour change compared to pitch discrimination, which is related to speech prosody perception.

Perceptual Characteristics of Korean Consonants Distorted by the Frequency Band Limitation (주파수 대역 제한에 의한 한국어 자음의 지각 특성 분석)

  • Kim, YeonWhoa;Choi, DaeLim;Lee, Sook-Hyang;Lee, YongJu
    • Phonetics and Speech Sciences
    • /
    • v.6 no.1
    • /
    • pp.95-101
    • /
    • 2014
  • This paper investigated the effects of frequency band limitation on perceptual characteristics of Korean consonants. Monosyllabic speech (144 syllables of CV type, 56 syllables of VC type, 8 syllables of V type) produced by two announcers were low- and high-pass filtered with cutoff frequencies ranging from 300 to 5000 Hz. Six listeners with normal hearing performed perception test by types of filter and cutoff frequencies. We reported phoneme recognition rates and types of perception error of band-limited Korean consonants to examine how frequency distortion in the process of speech transmission affect listener's perception. The results showed that recognition rates varied with the following factors: position in a syllable, manner of articulation, place of articulation, and phonation types. Consonants in the final position were stronger to the frequency band limitation than those in the initial position. Fricatives and Affricates are stronger than stops. Fortis consonants were less stronger than their lenis or aspirated counterparts. Types of perception error also varied depending on such factors as consonant's place of articulation: In case of bilabial stops, they were perceived as alveolar stops with while in cases of alveolar and velar stops, there were changes in phonation types without any change in the place of articulation.