• Title/Summary/Keyword: Speakers

Search Result 1,239, Processing Time 0.023 seconds

Vocal acoustic characteristics of speakers with depression (우울증 화자 음성의 음향음성학적 특성)

  • Baek, Yeon-Sook;Kim, Se-Joo;Kim, Eun-Yeon;Choi, Yae-Lin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.91-98
    • /
    • 2012
  • The purposes of this paper is to study the characteristics of compared to the speakers voice without depression and speakers with depression, and to propose a objective method for the measurement of the therapeutic effects as well as for diagnostics of depression based on the characteristics. The voice samples obtained from 11 female speakers with depression, aged from 20 to 40, diagnosed as having major depressive disorder by an psychiatrist were compared with those from 12 normal controls with matched sex, age, height, weight, education, smoking, and drinking. The voice samples are taken by a portable digital recorder(TASCAM DR-07, Japan) and analysed using the MDVP(Multi-Dimentional Voice Program) software module from CSL(Computerized Speech Lab, kay elemetrics, co, model 4100). The result of the investigation are as following. First, the average speaking fundamental frequency and loudness range of the speakers with depression group was statistically significantly lower than that of the control group. The pitch range of the control group was rather higher than that of the speakers with depression group, but without statistical significance. Overall speech rates have no statistical difference between two groups. Second, the average speaking fundamental frequency and loudness range have statistically significant negative correlation with Beck Depression Inventory, i. e. more severe depression exhibits lower average speaking fundamental frequency and loudness range. Other vocal parameters such as pitch range and overall speech rate have no statistically meaningful correlations with Beck Depression Inventory.

Acoustic Measurement of English read speech by native and nonnative speakers

  • Choi, Han-Sook
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.77-88
    • /
    • 2011
  • Foreign accent in second language production depends heavily on the transfer of features from the first language. This study examines acoustic variations in segments and suprasegments by native and nonnative speakers of English, searching for patterns of the transfer and plausible indexes of foreign accent in English. The acoustic variations are analyzed with recorded read speech by 20 native English speakers and 50 Korean learners of English, in terms of vowel formants, vowel duration, and syllabic variation induced by stress. The results show that the acoustic measurements of vowel formants and vowel and syllable durations display difference between native speakers and nonnative speakers. The difference is robust in the production of lax vowels, diphthongs, and stressed syllables, namely the English-specific features. L1 transfer on L2 specification is found both at the segmental levels and at the suprasegmental levels. The transfer levels measured as groups and individuals further show a continuum of divergence from the native-like target. Overall, the eldest group, students who are in the graduate schools, shows more native-like patterns, suggesting weaker foreign accent in English, whereas the high school students tend to involve larger deviation from the native speakers' patterns. Individual results show interdependence between segmental transfer and prosodic transfer, and correlation with self-reported proficiency levels. Additionally, experience factors in English such as length of English study and length of residence in English speaking countries are further discussed as factors to explain the acoustic variation.

  • PDF

Spectral and Cepstral Analyses of Esophageal Speakers (식도발성화자 음성의 spectral & cepstral 분석)

  • Shim, Hee-Jeong;Jang, Hyo-Ryung;Shin, Hee-Baek;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.6 no.2
    • /
    • pp.47-54
    • /
    • 2014
  • The purpose of this study was to analyze spectral versus cepstral measurements in esophageal speakers. The comparison between the measurements in thirteen male esophageal speakers was compared with the control group of thirteen normal speakers using the sustained vowel /a/. The main results can be summarized as below: (a) the CPP and L/H ratio of the esophageal group were significantly lower than those of the control group (b) the CPP was significantly correlated with the spectral parameters such as jitter, shimmer, NHR and VTI, and (c) the ROC analysis showed that the threshold of 10.25dB for the CPP achieved a good classification for esophageal speakers, with 100% perfect sensitivity and specificity. Thus, it was known that cepstral-based acoustic measures such as CPP, may be more reliable predictors than other spectral-based acoustic measures such as jitter and shimmer. And it was found that cepstral-based acoustic measures were effective in distinguishing esophageal voice quality from normal voice quality. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation with laryngectomees.

A Comparative Study on Oral Fluency Between Korean Native Speakers and L2 Korean Learners in Speech Discourse - With Focus on Speech Rate, Pause, and Discourse Markers (발표 담화에서의 한국어 모어 화자와 한국어 학습자의 말하기 유창성 비교 연구 -발화 속도, 휴지, 담화표지를 중심으로-)

  • Lee, Jin;Jung, Jinkyung
    • Journal of Korean language education
    • /
    • v.29 no.4
    • /
    • pp.137-168
    • /
    • 2018
  • The purpose of this study is to prepare the basis for a more objective evaluation of oral fluency by comparing speech patterns of Korean native speakers and L2 Korean learners. For this purpose, the current study focused on the analysis of speech materials of the 21st century Sejong spoken corpus and Korean learner corpus. We compared the oral fluency of Korean native speakers and Korean learners based on speech rate, pause, and discourse markers. The results show that the pattern of Korean learners is different to that of Korean native speakers in all aspects of speech rate, pause, and discourse markers; even though proficiency of Korean leaners show increase, they could not reach the oral fluency level of Korean native speakers. At last, based on these results of the analysis, we added suggestions for setting the evaluation criteria of oral fluency of Korean learners.

Post-focus compression is not automatically transferred from Korean to L2 English

  • Liu, Jun;Xu, Yi;Lee, Yong-cheol
    • Phonetics and Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.15-21
    • /
    • 2019
  • Korean and English are both known to show on-focus pitch range expansion and post-focus pitch range compression (PFC). But it is not clear if this prosodic similarity would make it easy for Korean speakers to learn English focus prosody. In the present study, we conducted a production experiment using phone number strings to examine whether Korean learners of English produce a native-like focus prosody. Korean learners of English were classified into three groups (advanced, intermediate and low) according to their English proficiency and were compared to native speakers. Results show that intermediate and low groups of speakers did not increase duration, intensity, and pitch in the focus positions, nor did they compress those cues in the post-focus positions. Advanced speakers noticeably increased the acoustic cues in the focus positions to a similar extent as native speakers. However, their performance in post-focus positions was quite far from that of native speakers in terms of pitch and excursion size. These results thus demonstrate a lack of positive transfer of focus prosody from Korean to English in L2 learning, and learners may have to relearn it from scratch, which is consistent with a previous finding. More importantly, the results provide further support for the view proposed in other works that acoustic properties of PFC were not easily transferred from one language to another.

A Corpus-Based Analysis of Crosslinguistic Influence on the Acquisition of Concessive Conditionals in L2 English

  • Newbery-Payton, Laurence
    • Asia Pacific Journal of Corpus Research
    • /
    • v.3 no.1
    • /
    • pp.35-49
    • /
    • 2022
  • This study examines crosslinguistic influence on the use of concessive conditionals by Japanese EFL learners. Contrastive analysis suggests that Japanese native speakers may overuse the concessive conditional even if due to partial similarities to Japanese concessive conditionals, whose formal and semantic restrictions are fewer than those of English concessive conditionals. This hypothesis is tested using data from the written module of the International Corpus Network of Asian Learners of English (ICNALE). Comparison of Japanese native speakers with English native speakers and Chinese native speakers reveals the following trends. First, Japanese native speakers tend to overuse concessive conditionals compared to native speakers, while similar overuse is not observed in Chinese native speaker data. Second, non-nativelike uses of even if appear in contexts allowing the use of concessive conditionals in Japanese. Third, while overuse and infelicitous use of even if is observed at all proficiency levels, formal errors are restricted to learners at lower proficiency levels. These findings suggest that crosslinguistic influence does occur in the use of concessive conditionals, and that its particular realization is affected by L2 proficiency, with formal crosslinguistic influence mediated at an earlier stage than semantic cross-linguistic influence.

Acoustic analysis of Korean affricates produced by dysarthric speakers with cerebral palsy (뇌성마비 마비말장애 성인의 파찰음 실현 양상 분석)

  • Mun, Jihyun;Kim, Sunhee;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.45-55
    • /
    • 2021
  • This study aims to analyze the acoustic characteristics of Korean affricates produced by dysarthric speakers with cerebral palsy. Korean fricatives and affricates are the consonants that are prone to errors in dysarthric speech, but previous studies have focused only on fricatives. For this study, three affricates /tɕ, tɕh, ͈tɕ/ appearing at word initial and intervocalic positions produced by six mild-moderate male speakers of spastic dysarthria are selected from a QOLT database constructed in 2014. The parameters representing the acoustic characteristics of Korean affricates were extracted by using Praat: frication duration, closure duration, center of gravity, variance, skewness, kurtosis, and central moment. The results are as follows: 1) frication duration of the intervocalic affricates produced by dysarthric speakers was significantly longer than that of the non-disordered speakers; 2) the closure duration of dysarthric speakers was significantly longer; 3) in the case of the center of gravity, there was no significant difference between the two groups; 4) the skewness of the dysarthric speakers was significantly larger; and 5) the central moment of dysarthric speakers was significantly larger. This study investigated the characteristics of the affricates produced by dysarthric speakers and differences with non-disordered speakers.

A study of L1 and L2 influences on the speech of Korean-English bilinguals: With special reference to VOT and F0 (한국어-영어 이중 언어 화자들의 L1과 L2 영향에 관한 연구: VOT와 F0와 관련해서)

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.13-26
    • /
    • 2015
  • Speech production studies have suggested that bilinguals who are L2-dominant are the most likely to suppress the influence of the first language (L1) on the second language (L2). The voice onset times (VOTs) and fundamental frequencies (f0s) of monolingual and bilingual speakers of English and Korean were examined to address the question whether cross language influences occur particularly in L2 predominant bilinguals and to compare their outcomes with those of L2 proficient bilinguals and monolinguals. A total of 28 speakers participated in this experiment and they produced English and Korean stops in the carrier sentence. In English, for voiceless aspirated and unaspirated stops, L2 predominant bilingual speakers produced VOTs that were significantly shorter than those of monolingual English speakers. The outcome was analogous in Korean speech. For aspirated and lax stops, they produced shorter Korean VOTs than monolingual speakers. The results of f0s were slightly different from those of VOTs. In English, L2 predominant bilinguals produced f0s that were not significantly different from those of monolingual English speakers. In Korean, however, they produced f0s that were significantly different from those of monolingual Korean speakers. Taken VOT and f0 into consideration together, the overall results suggest that, although they tend to show a corresponding pattern of monolinguals, L2 predominant bilinguals had cross language phonetic influences between L1 and L2, similar to L2 proficient bilinguals. Between the two acoustic cues, f0 seemed to be a more reliable cue than VOT to examine the influences.

INTONATION OF TAIWANESE: A COMPARATIVE OF THE INTONATION PATTERNS IN LI, IL, AND L2

  • Chin Chin Tseng
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.574-575
    • /
    • 1996
  • The theme of the current study is to study intonation of Taiwanese(Tw.) by comparing the intonation patterns in native language (Ll), target language (L2), and interlanguage (IL). Studies on interlanguage have dealt primarily with segments. Though there were studies which addressed to the issues of interlanguage intonation, more often than not, they didn't offer evidence for the statement, and the hypotheses were mainly based on impression. Therefore, a formal description of interlanguage intonation is necessary for further development in this field. The basic assumption of this study is that native speakers of one language perceive and produce a second language in ways closely related to the patterns of their first language. Several studies on interlanguage prosody have suggested that prosodic structure and rules are more subject to transfer than certain other phonological phenomena, given their abstract structural nature and generality(Vogel 1991). Broselow(1988) also shows that interlanguage may provide evidence for particular analyses of the native language grammar, which may not be available from the study of the native language alone. Several research questions will be addressed in the current study: A. How does duration vary among native and nominative utterances\ulcorner The results shows that there is a significant difference in duration between the beginning English learners, and the native speakers of American English for all the eleven English sentences. The mean duration shows that the beginning English learners take almost twice as much time (1.70sec.), as Americans (O.97sec.) to produce English sentences. The results also show that American speakers take significant longer time to speak all ten Taiwanese utterances. The mean duration shows that Americans take almost twice as much time (2.24sec.) as adult Taiwanese (1.14sec.) to produce Taiwanese sentences. B. Does proficiency level influence the performance of interlanguage intonation\ulcorner Can native intonation patterns be achieved by a non-native speaker\ulcorner Wenk(1986) considers proficiency level might be a variable which related to the extent of Ll influence. His study showed that beginners do transfer rhythmic features of the Ll and advanced learners can and do succeed in overcoming mother-tongue influence. The current study shows that proficiency level does play a role in the acquisition of English intonation by Taiwanese speakers. The duration and pitch range of the advanced learners are much closer to those of the native American English speakers than the beginners, but even advanced learners still cannot achieve native-like intonation patterns. C. Do Taiwanese have a narrower pitch range in comparison with American English speakers\ulcorner Ross et. al.(1986) suggests that the presence of tone in a language significantly inhibits the unrestricted manipulation of three acoustical measures of prosody which are involved in producing local pitch changes in the fundamental frequency contour during affective signaling. Will the presence of tone in a language inhibit the ability of speakers to modulate intonation\ulcorner The results do show that Taiwanese have a narrower pitch range in comparison with American English speakers. Both advanced (84Hz) and beginning learners (58Hz) of English show a significant narrower FO range than that of Americans' (112Hz), and the difference is greater between the beginning learners' group and native American English speakers.

  • PDF

Analysis of Acoustic Characteristics of Vowel and Consonants Production Study on Speech Proficiency in Esophageal Speech (식도발성의 숙련 정도에 따른 모음의 음향학적 특징과 자음 산출에 대한 연구)

  • Choi, Seong-Hee;Choi, Hong-Shik;Kim, Han-Soo;Lim, Sung-Eun;Lee, Sung-Eun;Pyo, Hwa-Young
    • Speech Sciences
    • /
    • v.10 no.3
    • /
    • pp.7-27
    • /
    • 2003
  • Esophageal Speech uses the esophageal air during phonation. Fluent esophageal speakers frequently intake air in oral communication, but unskilled esophageal speakers are difficult with swallowing lots of air. The purpose of this study was to investigate the difference of acoustic characteristics of vowel and consonants production according to the speech proficiency level in esophageal speech. 13 normal male speakers and 13 male esophageal speakers (5 unskilled esophageal speakers, 8 skilled esophageal speakers) with age ranging from 50 to 70 years old. The stimuli were sustained /a/ vowel and 36 meaningless two syllable words. Used vowel is /a/ and consonants were 18 : /k, n, t, m, p, s, c, $C^{h},\;k^{h},\;t^{h},\;p^{h}$, h, I, k', t', p', s', c'/. Fundermental frequency (Fx), Jitter, shimmer, HNR, MPT were measured with by electroglottography using Lx speech studio (Laryngograph Ltd, London, UK). 36 meaningless words produced by esophageal speakers were presented to 3 speech-language pathologists who phonetically transcribed their responses. Fx, Jitter, HNR parameters is significant different between skilled esophageal speakers and unskilled esophageal speakers (P<.05). Considering manner of articulation, ANOVA showed that differences in two esophageal speech groups on speech proficiency were significant; Glide had the highest number of confusion with the other phoneme class, affricates are the most intelligible in the unskilled esophageal speech group, whereas in the skilled esophageal speech group fricatives resulted highest number of confusions, nasals are the most intelligible. In the place of articulation, glottal /h/ is the highest confusion consonant in both groups. Bilabials are the most intelligible in the skilled esophageal speech, velars are the most intelligible in the unskilled esophageal speech. In the structure of syllable, 'CV+V' is more confusion in the skilled esophageal group, unskilled esophageal speech group has similar confusion in both structures. In unskilled esophageal speech, significantly different Fx, Jitter, HNR acoustic parameters of vowel and the highest confusions of Liquid, Nasals consonants could be attributed to unstable, improper contact of neoglottis as vibratory source and insufficiency in the phonatory air supply, and higher motoric demand of remaining articulation due to morphological characteristics of vocal tract after laryngectomy.

  • PDF