• 제목/요약/키워드: 말소리지각

Search Result 73, Processing Time 0.024 seconds

A review of speech perception: The first step for convergence on speech engineering (말소리지각에 대한 종설: 음성공학과의 융복합을 위한 첫 단계)

  • Lee, Young-lim
    • Journal of Digital Convergence
    • /
    • v.15 no.12
    • /
    • pp.509-516
    • /
    • 2017
  • People observe a lot of events in our environment and we do not have any difficulty to perceive events including speech perception. Like perception of biological motion, two main theorists have debated on speech perception. The purpose of this review article is to briefly describe speech perception and compare these two theories of speech perception. Motor theorists claim that speech perception is special to human because we both produce and perceive articulatory events that are processed by innate neuromotor commands. However, direct perception theorists claim that speech perception is not different from nonspeech perception because we only need to detect information directly like all other kinds of event. It is important to grasp the fundamental idea of how human perceive articulatory events for the convergence on speech engineering. Thus, this basic review of speech perception is expected to be able to used for AI, voice recognition technology, speech recognition system, etc.

The influence of visual information on place of articulation in Korean speech perception: The McGurk effect in Korean subjects (한국어 말소리 지각에 미치는 조음에 관한 시각정보의 영향: 한국인의 McGurk 효과)

  • Choi Yang-Gyu;Nam Kichun
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.293-296
    • /
    • 2002
  • 운동이론(motor theory)에 따르면 조음에 관한 정보는 말소리 지각에 중요한 역할을 한다고 한다. 조음에 관한 시각정보가 자음지각에 중요함을 시사하는 것이 바로 McGurk 효과이다. McGurk 효과는 말소리 지각에서 청각정보와 시각정보가 상충될 때 지각의 결과는 청각에 의한 조음정보와 시각에 의한 조음정보가 통합(integration)되어서 나타나는 것을 말한다. 예컨대, 시각적으로는 /ga/를 발음하는 모습을 보여주면서 동시에 청각적으로는 /ba/를 들려주면 그 결과로 /da/로 지각된다. 마찬가지로 시각적으로는 /ka/를, 청각적으로는 /ma/를 제시하면 /na/로 지각된다. 따라서 McGurk 효과는 시각적인 조음 정보가 자동적으로, 무의식적으로 말소리 지각과정에 통합됨은 보여준다. 한편 이러한 McGurk 효과는 문화마다 그 강도가 다르게 나타난다는 보고가 있다(Sekiyama, 1997). 예컨대, 일본가 중국 원어민의 경우 미국 원어민보다 McGurk 효과가 약하게 나타났다. 본 연구는 한국인에게는 McGurk 효과가 어떠한 양상으로 나타날지를 규명해 보고 아울러 기존의 미국, 일본 그리고 중국 원어민에 대한 연구결과와 비교 분석해 보았다.

  • PDF

Cross-sectional perception studies of children's monosyllabic word by naive listeners (일반 청자의 아동 발화 단음절에 대한 교차 지각 분석)

  • Ha, Seunghee;So, Jungmin;Yoon, Tae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.14 no.1
    • /
    • pp.21-28
    • /
    • 2022
  • Previous studies have provided important findings on children's speech production development. They have revealed that essentially all aspects of children's speech shift toward adult-like characteristics over time. Nevertheless, few studies have examined the perceptual aspects of children's speech tokens, as perceived by naive adult listeners. To fill the gap between children's production and adults' perception, we conducted cross-sectional perceptual studies of monosyllabic words produced by children aged two to six years. Monosyllabic words in the consonant-vowel-consonant form were extracted from children's speech samples and presented aurally to five listener groups (20 listeners in total). Generally, the agreement rate between children's production of target words and adult listeners' responses increases with age. The perceptual responses to tokens produced by two-year old children induced the largest discrepancies and the responses to words produced by six years olds agreed the most. Further analyses were conducted to identify the sources of disagreement, including the types of segments and syllable structure. This study makes an important contribution to our understanding of the development and perception of children's speech across age groups.

The Relationship Between Perception of Prosody, Pitch Discrimination, and Melodic Contour Identification in Cochlear Implants Recipients (인공와우이식 난청인의 말소리 운율변화에 따른 구어 이해와 음도 변별, 선율윤곽 확인 간 관련성)

  • Kim, Eun Yeon;Moon, Il Joon;Cho, Yang-sun;Chung, Won-ho;Hong, Sung Hwa
    • Journal of Music and Human Behavior
    • /
    • v.14 no.2
    • /
    • pp.1-18
    • /
    • 2017
  • The relationships between the ability to understand changes in meaning depending on the prosody of spoken words and the ability to perceive pitch and melodic contour in cochlear implants (CI) recipients were examined. Fifteen postlingual CI recipients were measured in terms of speech prosody perception, speech perception, pitch discrimination (PD), and melody contour identification (MCI). The speech prosody perception test consists of words with positive (PW) and neutral meaning (NW). Participants were asked to identify the meaning of words depending on the conditions of positive and negative prosody. The MCI consists of subtests 1 and 2 with different chance levels to choose. Then, the relationships between speech prosody perception, speech perception, PD, and MCI performance were analyzed. There was a significant difference in identifying the meaning of words expressed in a different prosody between the PW and NW conditions. Speech prosody perception showed a significant correlation with MCI 1 while there was no significant relationship with speech perception. Although speech perception may be possible after CI, limited spoken word comprehension due to decreased sensitivity for prosodic changes may persist in CI recipients. In addition, there was a limitation in perception of melodic contour change compared to pitch discrimination, which is related to speech prosody perception.

A review of event perception: The first step for convergence on robotics (사건지각에 대한 종설: 로봇공학과의 융복합을 위한 첫단계)

  • Lee, Young-Lim
    • Journal of Digital Convergence
    • /
    • v.13 no.4
    • /
    • pp.357-368
    • /
    • 2015
  • People observe lots of events around the environment and we can easily recognize the nature of an event from the resulting optic flow. The questions are how do people recognize events and what is the information in the optic flow that enables observers to recognize events. Motor theorists claim that human observers exhibit special sensitivity when perceiving events like speech or biological motion, because we both produce and perceive those events. However, direct perception theorists suggested that speech or biological motion is not special from the perception of all other kinds of event. The purpose of this review article is to address this controversy to critique the motor theory and to describe a direct realist approach to event perception. It is important to understand the fundamental information of how human perceive event perception for the convergence on robotics.

청각장애 아동과 건청아동의 이중모음 산출에 대한 음향음성학적 특징 비교

  • 배남주;고도흥
    • Proceedings of the KSLP Conference
    • /
    • 2003.11a
    • /
    • pp.244-244
    • /
    • 2003
  • 말소리의 생성 및 전달에서 화자의 청각적 피드백은 말소리 발달에 중요한 부분을 차지한다(고도흥 외, 2000). 그러나 청각장애 아동의 경우, 청각적인 피드백이 부족하여 말소리 발달과 언어발달에서 지체를 보이게 된다. 특히 이러한 말소리 발달은 아동의 말명료도에 큰 영향을 미치게 되고, 국내외 여러 학자들은 청각장애 아동의 말 산출에 대한 연구를 활발하게 하고 있다. 그러나 현재 국내의 연구 중 이중모음에 대한 연구는 거의 없는 실정이다. 국내의 청각장애 성인이나 아동을 대상으로 한 연구들은 대부분 연구자의 지각적이고 주관적인 입장에서 이루어지고 있다. 좀더 객관적인 연구 자료는 임상적인 목적뿐만 아니라 말소리 발달의 연구에서 필요하다. 따라서 이 연구는 청각장애 아동의 이중모음의 특징을 음향음성학적인 방법으로 객관적으로 분석하여 그 자료를 제시하고, 건청 아동과의 비교를 통해 임상적인 자료를 제시하고자 한다. (중략)

  • PDF

한국어 자음의 지각적 구조

  • 배문정
    • Proceedings of the KSLP Conference
    • /
    • 2003.11a
    • /
    • pp.226-229
    • /
    • 2003
  • 본 논문에서는 1) 말소리의 심적 표상구조를 조사하기 위해 본 연구에서 사용된 실험 방법론을 간략하게 소개하고, 2) 한국어 초성 자음들의 지각적 구조를 조사한 본 연구의 결과를 보고한다. 더불어 3) 본 연구에서 얻어진 결과가 음성학 또는 음운론 연구에 어떤 함의를 제공하는지를 논의한다.

  • PDF

Japanese Speakers' Perception and Production of Korean Lenis, Aspirated, and Fortis Consonants (일본어 화자의 한국어 평음/기음/경음의 지각과 산출)

  • Hwang Yu Mi;Cho Hye Suk;Kim Soo Jin
    • MALSORI
    • /
    • no.44
    • /
    • pp.61-72
    • /
    • 2002
  • The purpose of this research is to investigate how Japanese speakers perceive and produce Lenis, Aspirated and Fortis consonants in Korean. Identification tasks and production tasks were performed. The error analysis of both task showed that the participants had a significant difficulty in discriminating between Lenis and Aspirated sounds. And it was observed that there was a positive correlation between identification scores and production scores.

  • PDF

Effects of the Orthographic Representation on Speech Sound Segmentation in Children Aged 5-6 Years (5~6세 아동의 철자표상이 말소리분절 과제 수행에 미치는 영향)

  • Maeng, Hyeon-Su;Ha, Ji-Wan
    • Journal of Digital Convergence
    • /
    • v.14 no.6
    • /
    • pp.499-511
    • /
    • 2016
  • The aim of this study was to find out effect of the orthographic representation on speech sound segmentation performance. Children's performances of the orthographic representation task and the speech sound segmentation task had positive correlation in words of phoneme-grapheme correspondence and negative correlation in words of phoneme-grapheme non-correspondence. In the case of words of phoneme-grapheme correspondence, there was no difference in performance ability between orthographic representation high level group and low level group, while in the case of words of phoneme-grapheme non-correspondence, the low level group's performance was significantly better than the high level group's. The most frequent errors of both groups were orthographic conversion errors and such errors were significantly more noticeable in the high level group. This study suggests that from the time of learning orthographic knowledge, children utilize orthographic knowledge for the performance of phonological awareness tasks.

Learning acoustic cue weights for Korean stops through L2 perception training (지각 훈련을 통한 한국어 폐쇄음 음향 신호 가중치의 L2 학습)

  • Oh, Eunjin
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.9-21
    • /
    • 2021
  • This study investigated whether Korean learners improve acoustic cue weights to identify Korean lenis and aspirated stops in the direction of native values through perception training that focused on contrasting the stops in various phonetic contexts. Nineteen native Chinese learners of Korean and two native Korean instructors for the perception training participated in the experiment. A training group and a non-training group were divided according to pretest results, and only the training group participated in the training for 5 days. To estimate the perceptual weights of the stop cues, a pretest and a posttest were conducted with stimuli whose stop cues (F0 and VOT) were systematically manipulated. Binary logistic regression analyses were performed on each learner's test results to calculate perceptual β coefficients, which estimate the perceptual weights of the acoustic cues used in identifying the stop contrast. The training group showed a statistically significant increase of 0.451 on average in the posttest for the coefficient values of the F0, which is the primary cue for the stop contrast, whereas the non-training group showed an insignificant increase of 0.246. The patterns of change in the F0 use after training varied considerably among individual learners.