• Title/Summary/Keyword: speech production

Search Result 383, Processing Time 0.033 seconds

Discourse-level Prosody Produced by Korean Learners of English

  • Kim, Boram
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.67-77
    • /
    • 2014
  • This study investigated (1) whether Korean learners of English use discourse-level prosody in L2 production as native speakers of English do, and (2) whether discourse-level prosody is also found in the Korean language, as is evident in the prosody of native speakers of English. The study compared the production of the same 15 sentences in two types of reading materials, sentence-level and discourse-level. This study analyzed the onset pitch, sentence mean pitch and pause length to examine the paratone (intonational paragraph) realization in discourse-level speech. The results showed that in L2 discourse-level prosody, the Korean speakers were limited in displaying paratone and did not made significant difference between sentence-level and discourse-level prosody. On the other hand, in L1 discourse-level text, both English and Korean participants demonstrated paratone using pitch. However, there were differences in using prosodic cues between two groups. In using pauses, the ES group paused longer before both the orthographically marked and not marked topic sentences. The KS group paused longer only before the orthographically marked topic sentence in both L1 and L2 text reading. In the comparison of sentence-level and discourse-level prosody, the topic sentences were marked by different prosodic cues. English participants used higher sentence mean pitch, and the Korean participants used higher onset pitch.

Characteristics of Korean Stop Consonants by Using Electroglottography and Its Clinical Application (Electroglottography를 사용한 한국어 폐쇄자음의 특성 및 임상적 적용)

  • Chae, Y.J.;Kim, H.G.;Hong, K.H.
    • Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.157-177
    • /
    • 1998
  • An electroglottography (EGG) was used to investigate the function of the vocal folds during their vibration. In this study, four Korean native speakers and 10 vocal polyp patients were selected. To investigate the dynamic change of EGG waveforms for the three-way distinction of Korean stops, a DSP-Sona graph model 5500, a Rino- Laryngeal stroboscope, a CSL model 4300B and a Laryngograph were used. An EGG Model 4338 was used to exam the vocal polyp of patients' voices during high, low, comfortable pitch production. The purpose of this study is to investigate the characteristics of Korean stop consonants in relation to pitch and to observe laryngeal movement during vocal fold vibration and speech production. The basic data accumulated during this research can be applied in clinical treatment. The results are as follows: on the Korean stop consonants, the aspirated stop is the highest in the GOT and PC1. On the angle of vowel contour, the angle of lenis is smaller than the angle of heavily aspirated and glottalized stops. The fundamental frequency is lowest at the lenis stop, In vocal polyp patients', the low pitch range is smaller than in normal speakers'. The pitch break and the vocal fry were observed. The jitter and OQ value are higher in vocal polyp patients than in those of normal speakers'.

  • PDF

The effects of length of residence (LOR) on voice onset time (VOT)

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.9-17
    • /
    • 2020
  • Changes in the first language (L1) sound system as a result of acquiring a second language (L2) (i.e., phonetic drift) have received considerable attention from a variety of speakers, settings, and environments. Less attention has been given to phonetic drift in adult speakers' L2 learning as their length of residence in America (LOR) increases. This study examines the effects of LOR on voice onset time (VOT) in L1 Korean stops. Three different groups of Korean adult learners of L2 English were compared to assess how malleable their L1 representations are in terms of LOR and whether there is any relationship between L1 change and L2 acquisition. The results showed that the effect of LOR was linguistically unimportant in the production of Korean stops. However, VOT merger as evidence of sound change in Korean stops were robust in the speech production of most of the female speakers across the groups. The results suggest that L2 English may not be the primary cause of L1 sound change. For generalizability, further study is necessary to see whether other acoustic cues show a similar pattern.

Regional differences in Korean children's development of speech production (우리나라 아동의 지역별 말소리 발달 차이)

  • Shin, Moonja;Ha, Ji-Wan;Kim, Young Tae;Kim, Soo-Jin
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.57-67
    • /
    • 2019
  • This study aimed to investigate regional differences in the development of speech production in Korean children. A total of 619 children aged 2 to 7 years from the Jeolla, Seoul/Gyeonggi, Chungcheong, and Gyeongsang areas were included in this study. The subjects were assessed with the UTAP2 word-level test. In PWC, PMLU, and PWP, the performance was significantly lower in Gyeongsang at 2 years 11 months and in Jeolla and Chungcheong at 3 years 5 months than in Seoul/Gyeonggi. The total PCC of Gyeongsang and Chungcheong and UTAP PCC of Chungcheong were significantly lower at 2 years 11 months compared with those of Seoul/Gyeonggi, while Jeolla and Chungcheong showed significantly lower total PCC and UTAP PCC than Seoul/Gyeonggi at 3 years 5 months. However, no regional difference was observed in any indicators after the age of 3 years 6 months. These results suggest that there are regional differences in the ability to produce speech sounds at a very young age, and that the differences can be explained by the differences between Seoul/Gyeonggi and the other provinces rather than by the individual characteristics of specific regions.

Characteristics of speech rate and pause in children with spastic cerebral palsy and their relationships with speech intelligibility (경직형 뇌성마비 아동의 하위그룹별 말속도와 쉼의 특성 및 말명료도와의 관계)

  • Jeong, Pil Yeon;Sim, Hyun Sub
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.95-103
    • /
    • 2020
  • The current study aimed to identify the characteristics of speech rate and pause in children with spastic cerebral palsy (CP) and their relationships with speech intelligibility. In all, 26 children with CP, 4 with no speech motor involvement and age-appropriate language ability (NSMI-LCT), 6 with no speech motor involvement and impaired language ability (NSMI-LCI), 6 with speech motor involvement and age-appropriate language ability (SMI- LCT), and 10 with speech motor involvement and impaired language ability (SMI-LCI) participated in the study. Speech samples for the speech rate and pause analysis were extracted using a sentence repetition task. Acoustic analysis were made in Praat. First, it was found that regardless of the presence of language impairment, significant group differences between the NSMI and SMI groups were found in speech rate and articulation rate. Second, the SMI groups showed a higher ratio of pause time to sentence production time, more frequent pauses, and longer durations of pauses than the NSMI groups. Lastly, there were significant correlations among speech rate, articulation rate, and intelligibility. These findings suggest that slow speech rate is the main feature in SMI groups, and that both speech rate and articulation rate play important roles in the intelligibility of children with spastic CP.

The Pitch Beginning Point Extraction Using Property of G-peak (G-Peak의 특성에 의한 피치시점검출)

  • 이해군
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1993.06a
    • /
    • pp.259-262
    • /
    • 1993
  • In this paper, a new pitch beginning point detection method by extracting the G-peak, is proposed. By the speech production model, the area of the first peak on a pitch interval of speech signals is emphasized. By using the above characteristics, this method have more advantages than the others for pitch beginning point detection. The defective decision caused by an impulsive noise is minimized and the pre-filtering is not necessary for this method, because the integration of signals takes place in the process.

  • PDF

SPEECH PRODUCTION MECHANISM (발음생성기전)

  • Kim, Byoung-W.
    • The Journal of the Korean dental association
    • /
    • v.15 no.4
    • /
    • pp.301-304
    • /
    • 1977
  • 우리인체의 언어기관은 중추신경에서부터 근육에 이르기까지 광번위한 부분이다. 언어9발성)기관을 근육의 단계에서 볼 때는 다음의 세 가지 model(모형)으로써 언어생성(발성) 과정을 설명할 수 있다. 첫째 모형은 해부학적인 모형으로써 언어생성에 어떠한 근육들이 작용하는가를 볼 수 있고 두째번 모형은 생리학적인 모형으로 언어생성과정의 기능적인 면을 볼 수 있고 셋째번 모형은 물리학적 모형으로 말소리의 물리학적인 성질을 이해하는데에 도움을 준다. 이 세 가지 모형을 이용해서 언어생성 과정을 간결하게 설명했다.

  • PDF

말더듬 연구의 최근 동향

  • 심현섭
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.7 no.1
    • /
    • pp.123-127
    • /
    • 1996
  • 말더듬 연구의 최근 동향을 알아보기 위해 첫째로, 말더듬의 기저(underlying)를 이루는 neuromotor mechanism을 밝히기 위해 말더듬 연구에 도입된 새로운 접근 방법인 speech motor control에 대한 기초적인 이해와 어떻게 말더음이의 이론 및 치료에 응용되고 있는가를 개괄적으로 살펴보며, 둘째로 오랫동안 연구대상이 주로 성인 말더듬이였으나, 아동 말더듬이, 특히 취학 전 아동 말더듬이에 대한 관심이 고조되고 있는 이유 및 현재의 아동 말더듬에 관한 speech production 연구를 간략히 살펴보았다.

  • PDF

The Contribution of Prosody to the Foreign Accent of Chinese Talkers' English Speech

  • Liu, Xing;Lee, Joo-Kyeong
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.59-73
    • /
    • 2012
  • This study attempts to investigate the contribution of prosody to the foreign accent in Chinese speakers' English production by examining the synthesized speech of crossing native and non-native talkers' prosody and segments. For the stimuli of the foreign accent ratings, we transplanted gender-matched native speakers' prosody onto non-native talkers' segments and vice versa, utilizing the TD-PSOLA algorithm. Eight English native listeners participated in judging foreign accent and comprehensibility of the transplanted stimuli. Results showed that the synthesized stimuli were perceived as stronger foreign accent regardless of speakers' proficiency when English speakers' prosody was crossed with Chinese speakers' segments. This suggests that segments contribute more than prosody to native listeners' evaluation of foreign accent. When transplanted with English speakers' segments, Chinese speakers' prosody showed a difference in duration rather than pitch between high and low proficiency such that stronger foreign accent was detected when low proficient Chinese speakers' duration was crossed with English speakers' segments. This indicated that prosody, more specifically duration, plays a role though the prosodic role is not overall as significant as segments. According to the post acoustic analysis, the temporal features contributing to making the duration parameter prominent as opposed to pitch were found out to be speaking rate, pause duration and pause frequency. Finally, foreign accent and comprehensibility showed no significant correlation such that native listeners had no difficulty listening to highly foreign accented speech.

Analysis and synthesis of pseudo-periodicity on voice using source model approach (음성의 준주기적 현상 분석 및 구현에 관한 연구)

  • Jo, Cheolwoo
    • Phonetics and Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.89-95
    • /
    • 2016
  • The purpose of this work is to analyze and synthesize the pseudo-periodicity of voice using a source model. A speech signal has periodic characteristics; however, it is not completely periodic. While periodicity contributes significantly to the production of prosody, emotional status, etc., pseudo-periodicity contributes to the distinctions between normal and abnormal status, the naturalness of normal speech, etc. Measurement of pseudo-periodicity is typically performed through parameters such as jitter and shimmer. For studying the pseudo-periodic nature of voice in a controlled environment, through collected natural voice, we can only observe the distributions of the parameters, which are limited by the size of collected data. If we can generate voice samples in a controlled manner, experiments that are more diverse can be conducted. In this study, the probability distributions of vowel pitch variation are obtained from the speech signal. Based on the probability distribution of vocal folds, pulses with a designated jitter value are synthesized. Then, the target and re-analyzed jitter values are compared to check the validity of the method. It was found that the jitter synthesis method is useful for normal voice synthesis.