• 제목/요약/키워드: 말소리 연장

Search Result 16, Processing Time 0.021 seconds

Korean listeners' mode of perceiving the durational variations of /s/ as prolongations (한국어 평마찰음 /s/ 연장음에 대한 비유창성 양상 연구)

  • Park, Jin;Go, Boksun;Park, Sohyun
    • Phonetics and Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.67-76
    • /
    • 2022
  • This study aimed to examine Korean listeners' mode of perceiving sound duration as prolongation, whether dichotomous or continuous. Thirty-five Korean participants (17 men and 18 women) listened to the Korean segment /s/, which was lengthened by 0-980ms in 20-ms increments. Then, the participants were asked to rate each version of the sound based on a rating of one to 100 (the closer to 100, the more disfluent). To examine whether listeners perceived durational variations for the fricative segment dichotomously or continuously, a curve was estimated using the best-fitting regression model for the observed data with the highest adjusted R-squared value. The mode of perceiving durational variations for the segment was continuous (or gradient) rather than discontinuous (or dichotomous). No gender difference was found in the mode of perceiving prolongation. However, there was a significant gender difference in that men rated the most disfluent sounds higher than women. The findings of this study were further discussed in relation to the existing literature, and clinical implications for the assessment of stuttering were presented.

The perceptual judgment of sound prolongation: Equal-appearing interval and direct magnitude estimation (연장음 길이에 따른 비유창성 정도 평가: 등간척도와 직접크기평정 비교 연구)

  • Jin Park;Hwajung Cha;Sejin Bae
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.59-67
    • /
    • 2023
  • This study aimed to propose an appropriate evaluation method for the perceived level of speech disfluency based on sound prolongation (i.e., increased duration of segments). To this end, 34 Korean-speaking adults (9 males, 25 females, average age: 32.9 yrs.) participated as raters in this study. The participants listened to sentences containing a total of 25 stimuli with the Korean voiceless fricative /s/ extended by 80-ms increments up to 2,000 ms (i.e., 285 ms, 365 ms., ..., 2,125 ms, 2,205 ms), and evaluated them using an equal-appearing interval scale (EAI, 1-7 points, where 1 represents "normal" and 7 represents "severe"). Subsequently, based on the interval-scale results, the sentence stimuli with the prolonged voiceless fricative corresponding to the mild-to-moderate level (rated as 4 points) were selected as the reference modulus for direct magnitude estimation (DME). After scatter plots were created for the two evaluation results, the relationship between the two measured mean values was analyzed using a curve estimation method for the observed data with the highest R2-value to determine whether a linear or curvilinear approximation fit the data better. A curvilinear relationship between the two evaluation results was indicated, suggesting that DME is a more appropriate evaluation method than the EAI scale for assessing the perceived level of disfluency based on sound prolongation.

A Survey or The Korean Learner's Problems in Mastering English Pronunciation (한국인의 영어 발음 학습상 문제점 개관)

  • Youe Hansa MahnGunn
    • MALSORI
    • /
    • no.42
    • /
    • pp.47-56
    • /
    • 2001
  • 이 글은 제2회 서울 국제 음성학 학술대회(SICOPS 2000) 기조강연 내용을 조금 손질한 것인데, 한국인 영어 학습자가 저지르기 쉬운 발음상 잘못을 모음, 자음별로 관찰하고 그 대책을 논의한다. 모음에서는 주로 i:l, u:$-\sigma$, (equation omitted) 흔동이 문제이며, 또한 90종이 넘는 여러 철자로 나타나는 쭉정모음(schwa) 식별과 정복한 발음도 큰 문제다. 자음에서는 음소 연결방식에서 생기는 자음접변 둥 한 국어 특유 현상을 영어에까지 연장하는 바람에 많은 오류가 생긴다는 것과 영어 sp-, st-, sk-에서 /p t k/는 연한소리(lenis)로 [(equation omitted)]인데, 된소리로 잘못알고 있는 수가 많다는 것도 지적된다. 무룻 영어학습자는 철자만 보고 발음을 속단하지 말고 단어마다 반드시 발음을 사전에서 확인할 것과 아울러 거기에 음성학적 훈련이 수반되어야 함을 역설하며, 정확한 발음을 아는 것은 실제 영어 청취i구사에 뿐 아니라 또한 언어연구 기초확립에 필수적이라는 말로 글을 맺는다.

  • PDF

Acoustic characteristics of the sustained vowel phonation according to age groups (모음 연장 발성이 보이는 연령대별 음향음성학적 특성 연구)

  • Seo, Yoon-Jeong;Shin, Jiyoung
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.67-76
    • /
    • 2018
  • This study was performed to investigate acoustic characteristics of sustained vowels produced by Seoul Korean speakers. For this study, three hundred nine healthy adults were chosen as participants from Korean Standard Speech Database. These subjects were divided into five chronological age groups (20s, 30s, 40s, 50s, 60-70s) and two gender groups (male and female). Fundamental frequency (f0), jitter, shimmer, and NHR (noise-to-harmonics ratio) was measured with 8 Korean vowels (/ɑ/, /æ/, /ʌ/, /e/, /o/, /u/, /ɯ/, /i/) by using Praat. The results showed that the vowel type significantly affected all acoustic parameters. Gender affected f0, jitter, and NHR significantly. The mean female speakers' f0 was greater than the males', and the mean jitter and NHR of male speakers was greater than the females'. Moreover, age affected shimmer and NHR significantly; in particular, the shimmer and NHR of elderly speakers was greater than the young speakers.

Speaker age estimation and acoustic characteristics: According to pitch and speech rate (화자 연령 지각과 음성적 특성: 음높이와 발화 속도를 중심으로)

  • Seo, YoonJeong;Shin, Jiyoung
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.9-18
    • /
    • 2019
  • This study aimed to investigate the correlation between speaker's chronological age (CA) and perceived age (PA) and to specify the effect of pitch and speech rate as acoustic cue on judging age, using perceptual testing and acoustic analysis. Three tasks were conducted to identify the degree of listener's accuracy about age estimation. Three perception tasks were conducted to measure the accuracy of 80 Korean listeners when presented with different types of speech. In all the tasks, participants listened to speech samples and gave their estimate of the speaker's age in figures. It was found that Korean listeners are able to gauge the age of a speaker fairly precisely. CA and mean PA were positively correlated in all three tasks. It is clear that the amount and type of information included in the voice samples affected the accuracy of a listener's judgement. Moreover, the result revealed that listeners make use of acoustic information such as pitch and speech rate to estimate speaker's age.

AI-based stuttering automatic classification method: Using a convolutional neural network (인공지능 기반의 말더듬 자동분류 방법: 합성곱신경망(CNN) 활용)

  • Jin Park;Chang Gyun Lee
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.71-80
    • /
    • 2023
  • This study primarily aimed to develop an automated stuttering identification and classification method using artificial intelligence technology. In particular, this study aimed to develop a deep learning-based identification model utilizing the convolutional neural networks (CNNs) algorithm for Korean speakers who stutter. To this aim, speech data were collected from 9 adults who stutter and 9 normally-fluent speakers. The data were automatically segmented at the phrasal level using Google Cloud speech-to-text (STT), and labels such as 'fluent', 'blockage', prolongation', and 'repetition' were assigned to them. Mel frequency cepstral coefficients (MFCCs) and the CNN-based classifier were also used for detecting and classifying each type of the stuttered disfluency. However, in the case of prolongation, five results were found and, therefore, excluded from the classifier model. Results showed that the accuracy of the CNN classifier was 0.96, and the F1-score for classification performance was as follows: 'fluent' 1.00, 'blockage' 0.67, and 'repetition' 0.74. Although the effectiveness of the automatic classification identifier was validated using CNNs to detect the stuttered disfluencies, the performance was found to be inadequate especially for the blockage and prolongation types. Consequently, the establishment of a big speech database for collecting data based on the types of stuttered disfluencies was identified as a necessary foundation for improving classification performance.

A comparative study of the acoustic characteristics of the vowel /a/ between children with spastic and dyskinetic cerebral palsy (경직형과 불수의운동형 뇌성마비아동의 /아/ 모음 음향학적 비교)

  • Jeong, Pil Yeon;Sim, Hyun Sub
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.65-74
    • /
    • 2020
  • This study aims to compare the acoustic characteristics of vowel phonation in children with spastic and dyskinetic cerebral palsy (CP). Thirty-four children aged 4-12 years with CP participated in the study (spastic 26, dyskinetic 8). Voice samples for the acoustic analysis were extracted from a sustained vowel /a/. All acoustic measures were made using Praat. Group differences were compared by an independent t-test or Welch-Aspin test, if the equivalence assumption was not met. The results of this study are as follow. First, maximum phonation time(MPT) was significantly shorter for the dyskinetic CP than for the spastic CP. Second, shimmer percent was significantly increased in the dyskinetic CP than in the spastic CP. Lastly, there were no significant group differences in both the first formant and the second formant. These findings indicate that the dyskinetic CP has a poorer respiratory capacity and poorer laryngeal function than the spastic CP. On the other hand, both groups have a comparable ability to articulate the vowel /a/. The results of the present study help speech language pathologists identify the speech motor control ability of children with two types of CP (spastic and dyskinetic) and help to make an intervention plan associated with a specific type of CP.

Classification of muscle tension dysphonia (MTD) female speech and normal speech using cepstrum variables and random forest algorithm (켑스트럼 변수와 랜덤포레스트 알고리듬을 이용한 MTD(근긴장성 발성장애) 여성화자 음성과 정상음성 분류)

  • Yun, Joowon;Shim, Heejeong;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.91-98
    • /
    • 2020
  • This study investigated the acoustic characteristics of sustained vowel /a/ and sentence utterance produced by patients with muscle tension dysphonia (MTD) using cepstrum-based acoustic variables. 36 women diagnosed with MTD and the same number of women with normal voice participated in the study and the data were recorded and measured by ADSVTM. The results demonstrated that cepstral peak prominence (CPP) and CPP_F0 among all of the variables were statistically significantly lower than those of control group. When it comes to the GRBAS scale, overall severity (G) was most prominent, and roughness (R), breathiness (B), and strain (S) indices followed in order in the voice quality of MTD patients. As these characteristics increased, a statistically significant negative correlation was observed in CPP. We tried to classify MTD and control group using CPP and CPP_F0 variables. As a result of statistic modeling with a Random Forest machine learning algorithm, much higher classification accuracy (100% in training data and 83.3% in test data) was found in the sentence reading task, with CPP being proved to be playing a more crucial role in both vowel and sentence reading tasks.

Laryngeal height and voice characteristics in children with autism spectrum disorders (자폐스펙트럼장애 아동의 후두 높이 및 음성 특성)

  • Lee, Jung-Hun;Kim, Go-Woon;Kim, Seong-Tae
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.91-101
    • /
    • 2021
  • The purpose of this study was to investigate laryngeal characteristics in children with autism spectrum disorders (ASD). A total of 50 children participated, including eight children aged 2 to 4 years old diagnosed with ASD and 42 normal controls at the same age. All children recorded X-ray images of the midsagittal plane of the cervical spine and larynx, and compared the laryngeal positions of ASD and control. In addition, samples of children with vowel prolongation were collected and analyzed for acoustic parameters. X-rays showed that the height of the hyoid bone in the normal group was the lowest at 3 years of age, and ascended at 4 years of age. Nevertheless, the distance from the external acoustic meatus to the hyoid bone was longest at age 4. 4-year-olds with explosive language development showed laryngeal height elevation and anteriorization. In contrast, the hyoid height of the ASD group of all ages was lower than that of the control group, and there was no difference in the hyoid position between the ages. As a result of acoustic evaluation, PFR, vFo, and vAm were significantly higher ASD than control. Low laryngeal height of ASD children may be associated with delayed language development. PFR, vFo, and vAm seem to be voice markers showing the difference between normal and ASD children.