• 제목/요약/키워드: Speech Perception

검색결과 398건 처리시간 0.025초

A Study on Korean Students' Production and Perception of English Word-final Stop Voicing

  • Kang, Seok-Han
    • 음성과학
    • /
    • 제14권1호
    • /
    • pp.105-119
    • /
    • 2007
  • The purpose of this study is to examine Korean students' production and perception of word-final stop voicing in light of their overseas experience. Subjects were English native speakers, Korean university students with residence experience in America, Korean university students without residence experience in America, and Korean elementary school students. They participated in both production and perception tests. Results showed that the students' production and perception with residence experience in America appeared quite similar to those of the English native speakers. In the production tests, we noticed somewhat different results in temporal and frequency features. The one-year residence in America had some influence on their frequency features, but not the temporal features in the word final stop production. That difference could be seen in the perception tests, too. We could not find any difference in the identification test of the final release environment between the Korean university students who had studied abroad and those who didn't. Rather the difference could be found in the cue influence test in both the final release and non-release environments.

  • PDF

모음의 포먼트 변형에 따른 인공와우 이식 아동의 청각적 인지변화 (Perception Ability of Synthetic Vowels in Cochlear Implanted Children)

  • 허명진
    • 대한음성학회지:말소리
    • /
    • 제64호
    • /
    • pp.1-14
    • /
    • 2007
  • The purpose of this study was to examine the acoustic perception different by formants change for profoundly hearing impaired children with cochlear implants. The subjects were 10 children after 15 months of experience with the implant and mean of their chronological age was 8.4 years and Standard deviation was 2.9 years. The ability of auditory perception was assessed using acoustic-synthetic vowels. The acoustic-synthetic vowel was combined with F1, F2, and F3 into a vowel and produced 42 synthetic sound, using Speech GUI(Graphic User Interface) program. The data was deal with clustering analysis and on-line analytical processing for perception ability of acoustic synthetic vowel. The results showed that auditory perception scores of acoustic-synthetic vowels for cochlear implanted children were increased in F2 synthetic vowels compaire to those of F1. And it was found that they perceived the differences of vowels in terms of distance rates between F1 and F2 in specific vowel.

  • PDF

아동이 산출한 치조마찰음 /ㅅ/에 대한 청지각적·음향학적 연구 (A perceptual and acoustical study of /ㅅ/ in children's speech)

  • 김지연;성철재
    • 말소리와 음성과학
    • /
    • 제10권3호
    • /
    • pp.41-48
    • /
    • 2018
  • This study examined the acoustic characteristics of Korean alveolar fricatives of normal children. Developing children aged 3 and 7, typically produced 2 types of nonsense syllables containing alveolar fricative /sV/ and /VsV/ sequences where V was any one of three corner vowels (/i, a, and u/). Stimuli containing the speech materials used in a production experiment were presented randomly to 12 speech language pathologists (SLPs) for a perception test. The SLPs responded by selecting one of seven alternative sounds. Acoustic measures such as duration of frication noise, normalized intensity, skewness, and center of gravity were examined. There was significant difference in acoustic measures when comparing vowels. Comparison of syllable structures indicated statistically significant differences in duration of frication noise and normalized intensity. Acoustic parameters could account for the perceptual data. Relating the acoustic and perception data by means of logistic regression suggests that duration of frication noise and normalized intensity are the primary cues to perceiving Korean fricatives.

Speech Recognition by Neural Net Pattern Recognition Equations with Self-organization

  • Kim, Sung-Ill;Chung, Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • 제22권2E호
    • /
    • pp.49-55
    • /
    • 2003
  • The modified neural net pattern recognition equations were attempted to apply to speech recognition. The proposed method has a dynamic process of self-organization that has been proved to be successful in recognizing a depth perception in stereoscopic vision. This study has shown that the process has also been useful in recognizing human speech. In the processing, input vocal signals are first compared with standard models to measure similarities that are then given to a process of self-organization in neural net equations. The competitive and cooperative processes are conducted among neighboring input similarities, so that only one winner neuron is finally detected. In a comparative study, it showed that the proposed neural networks outperformed the conventional HMM speech recognizer under the same conditions.

Perception and production of English fricatives by Chinese learners of English: Error patterns and perception-production relationship

  • Zhang, Buyi;Zhang, Jiaqi;Lee, Sook-hyang
    • 말소리와 음성과학
    • /
    • 제13권1호
    • /
    • pp.25-36
    • /
    • 2021
  • This study examined the perception and production of eight English fricatives /f/, /v/, /θ/, /ð/, /s/, /z/, /ʃ/, and /ʒ/ by thirty Chinese English majors and thirty Chinese middle school students through a fricative identification test, an intelligibility test, and a goodness rating test and focused on error patterns and the perception-production relationship. The results showed that substitution errors occurred frequently in the perception and production of English fricatives by both the English majors and the middle school students. Further, the error patterns were attributed to various influencing factors such as the negative transfer from Chinese consonant inventory, hypercorrection or overcompensation mistakes, deficiency of L2 teaching, and acoustic similarities. Significant overall correlations were found between the fricative perception and production by the two subject groups but were not manifested in all the eight fricatives, indicating that Chinese learners' perceptual competence of target fricatives was not necessarily tied to their productive excellence of those sounds in all cases. Furthermore, precedences of perception over production were incompletely manifested in the eight fricatives, which suggested that perception might not always be a necessary prerequisite for production. Additionally, subject group and vowel context differences were observed. The English majors performed better than the middle school students, both perceptually and productively, and the subjects' performances in perception and production varied when vowel contexts changed.

언어 사용력(Speech Register)원리를 활용한 유아의 교육용 로봇 인식 (Applying the Speech Register Principle to young children`s Perception of the Intelligent Service Robot)

  • 현은자;이하원;연혜민
    • 한국콘텐츠학회논문지
    • /
    • 제12권10호
    • /
    • pp.532-540
    • /
    • 2012
  • 본 연구의 목적은 유아교육기관에서 로봇을 경험한 만 5세 유아들의 로봇 인식을 조사하는 것이다. 연구의 이론적 배경은 언어 사용력(Speech Register) 이론이었으며 로봇, 친구, 인형을 대상으로 말투(Speech Tone) 비교를 하였다. 2-3년간 로봇을 경험한 3군데 유치원의 만 5세 유아 50명을 대상으로 인간에게 어울리는 자연스러운 말투와 단조로운 말투로 그림책을 읽어준 후 어떤 말투가 사람에게 어울리는 말투인지 유아들에게 질문하여 확인하였다. 그리고 인간에게 어울리는 말투가 로봇, 친구, 인형 중 어떤 대상에게 적합한지 선택하게 하였고 그 이유를 분석하였다. 연구결과, 사람에게 어울리는 말투는 86%의 유아가 인형보다 로봇에게, 74% 유아는 로봇보다 친구에게 그리고 68%유아는 인형보다 친구에게 어울린다고 인식하였다. 즉, 유아들은 로봇을 인공물보다는 인간에게 가까운 존재로서 중간자적 혼성물(hybrid beings)로 인식하고 있었다. 그리고 인식 기제는 인간 고유성(human uniqueness)인 인지적 특성이 반영된 결과였다. 그러므로 생물과 무생물의 이분법적 존재론적 인식 분류는 혼성물을 포함한 분류방식으로 대치할 필요성이 있음을 제안한다.

A Comparison Between the Korean Digits-in-Noise Test and the Korean Speech Perception-in-Noise Test in Normal-Hearing and Hearing-Impaired Listeners

  • Kim, Subin;You, Sungwha;Sohn, Myoung Eun;Han, Woojae;Seo, Jae-Hyun;Oh, Yonghee
    • Journal of Audiology & Otology
    • /
    • 제25권4호
    • /
    • pp.171-177
    • /
    • 2021
  • Background and Objectives: The purpose of the present study was to validate the performance and diagnostic efficacy of the Korean digits-in-noise (K-DIN) test in comparison to the Korean speech perception-in-noise (K-SPIN) test, which is the representative speech-in-noise test in clinical practice. Subjects and Methods: Twenty-seven subjects (15 normal-hearing and 12 hearing-impaired listeners) participated. The recorded Korean 0-9 digits were used to form quasirandom digit triplets; 50 target digit triplets were presented at the most comfortable level of each subject while presenting speech-shaped background noise at various levels of signal-to-noise ratios (-12.5, -10, -5, or +5 dB). Subjects were then instructed to listen to both target and noise masker unilaterally and bilaterally through a headphone. K-SPIN test was also conducted using the same procedure as the K-DIN. After calculating their percent correct responses, K-DIN and K-SPIN results were compared using a Pearson-correlation test. Results: Results showed a statistically significant correlation between K-DIN and K-SPIN in all hearing conditions (left: r=0.814, p<0.001; right: r=0.788, p<0.001; bilateral: r=0.727, p<0.001). Moreover, the K-DIN test achieved better testing efficacy, shorter average listening time (5 min vs. 30 min), and easier performance of task according to participants' qualitative reports than the K-SPIN test. Conclusions: In this study, the Korean version of digit triplet test was validated in both normal-hearing and hearing-impaired listeners. The findings suggest that the K-DIN test can be used as a simple and time-efficient hearing-in-noise test in audiology clinics in Korea.

A Comparison Between the Korean Digits-in-Noise Test and the Korean Speech Perception-in-Noise Test in Normal-Hearing and Hearing-Impaired Listeners

  • Kim, Subin;You, Sungwha;Sohn, Myoung Eun;Han, Woojae;Seo, Jae-Hyun;Oh, Yonghee
    • 대한청각학회지
    • /
    • 제25권4호
    • /
    • pp.171-177
    • /
    • 2021
  • Background and Objectives: The purpose of the present study was to validate the performance and diagnostic efficacy of the Korean digits-in-noise (K-DIN) test in comparison to the Korean speech perception-in-noise (K-SPIN) test, which is the representative speech-in-noise test in clinical practice. Subjects and Methods: Twenty-seven subjects (15 normal-hearing and 12 hearing-impaired listeners) participated. The recorded Korean 0-9 digits were used to form quasirandom digit triplets; 50 target digit triplets were presented at the most comfortable level of each subject while presenting speech-shaped background noise at various levels of signal-to-noise ratios (-12.5, -10, -5, or +5 dB). Subjects were then instructed to listen to both target and noise masker unilaterally and bilaterally through a headphone. K-SPIN test was also conducted using the same procedure as the K-DIN. After calculating their percent correct responses, K-DIN and K-SPIN results were compared using a Pearson-correlation test. Results: Results showed a statistically significant correlation between K-DIN and K-SPIN in all hearing conditions (left: r=0.814, p<0.001; right: r=0.788, p<0.001; bilateral: r=0.727, p<0.001). Moreover, the K-DIN test achieved better testing efficacy, shorter average listening time (5 min vs. 30 min), and easier performance of task according to participants' qualitative reports than the K-SPIN test. Conclusions: In this study, the Korean version of digit triplet test was validated in both normal-hearing and hearing-impaired listeners. The findings suggest that the K-DIN test can be used as a simple and time-efficient hearing-in-noise test in audiology clinics in Korea.

이집트인 학습자의 한국어 모음 지각과 산출 (The perception and production of Korean vowels by Egyptian learners)

  • 사라 벤자민;이호영
    • 말소리와 음성과학
    • /
    • 제13권4호
    • /
    • pp.23-34
    • /
    • 2021
  • 이 연구는 이집트인 한국어 학습자를 대상으로 하여 이들이 한국어 모음을 어떻게 지각하고 범주화하며, 이들이 발음한 한국어 모음을 한국인들이 어떻게 지각하는지 밝히고, 이를 토대로 이집트인 학습자들의 한국어 모음 범주화가 그들의 한국어 모음 지각과 산출에 어떤 영향을 미치는지 밝히는 것을 목적으로 한다. 실험 1에서는 이집트인 학습자가 한국어 모음을 어떻게 지각하는지 알아보기 위해 이집트인 학습자 53명을 대상으로 하여 한국인이 발음한 한국어 자극 단어를 듣고 어느 단어를 들었는지 객관식으로 고르는 과제를 수행하게 하였고, 실험 2에서는 이집트인 학습자들이 발음한 한국어 모음을 한국인들이 어떻게 지각하는지 밝히기 위해 이집트인 학습자 9명이 산출한 자극 단어 117(13개×9명)개를 한국인들에게 들려주고, 어느 단어를 들었는지 객관식으로 고르게 한 다음 모음의 발음이 원어민 수준에 얼마나 근접하는지 5점 척도로 평가하도록 하였다. 실험 결과 이집트어에 존재하지 않는 "새로운" 한국어 모음은 별도의 범주를 쉽게 형성하여 잘 지각된 반면 산출이 잘 되는 새로운 모음도 있었고, 산출에서 어려움을 겪는 모음도 있었다. 반면에 이집트어 음소와 "비슷한" 한국어 음소는 비교적 잘 산출되지만 지각하는 데는 큰 어려움이 있다는 사실도 확인할 수 있었다. 이 연구 결과를 토대로 기존의 음성학습모델(speech learning model)과 지각동화모델(perceptual assimilation model)이 제2언어 학습자들의 제2언어 음성 지각을 잘 설명해 주지만 음성 산출을 설명하는 데 미흡함이 있어 이에 대한 보완이 필요함을 논의했다.