• Title/Summary/Keyword: Speaker characteristics

Search Result 257, Processing Time 0.027 seconds

The effect of word length on f0 intervals: Evidence from North Kyungsang children

  • Kim, Jungsun
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.107-116
    • /
    • 2015
  • The present experiment investigated the effect of word length on the length of f0 intervals for North Kyungsang children. In order to find out the lengths of the f0 intervals, the f0 values at the midpoints of vowels in words were measured. F0 estimates were computed as intervals consistent with the logarithmic scale corresponding to the number of syllables in the words. The results indicated that the mean f0 intervals in words of different lengths showed a significant difference for the HH in HH vs. HHL and the LH in LH vs. LLH for North Kyungsang children. Adult speakers from the North Kyungsang region significantly differed only within the HH in HH vs. HHL. Adult speakers made a noticeable contribution in this characteristic from the children. The result of the adult study was presented to confirm whether the children used a North Kyungsang dialect. With respect to individual speaker differences, the North Kyungsang children showed more or less consistent patterns in quantile-quantile plots for the HH vs. HHL, but for the HL vs. LHL and LH vs. LLH, there were more variations than for the HH vs. HHL. The individual speakers' variation was the largest for the HL vs. LHL and the smallest for HH vs. HHL. Considering these results, the effect of word length on f0 intervals tended to show pitch accent-type-specific characteristics in the process of prosodic acquisition.

Effects of Prosodic Strengthening on the Production of English High Front Vowels /i, ɪ/ by Native vs. Non-Native Speakers (원어민과 비원어민의 영어 전설 고모음 /i, ɪ/ 발화에 나타나는 운율 강화 현상)

  • Kim, Sahyang;Hur, Yuna;Cho, Taehong
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.129-136
    • /
    • 2013
  • This study investigated how acoustic characteristics (i.e., duration, F1, F2) of English high front vowels /i, ɪ/ are modulated by boundary- and prominence-induced strengthening in native vs. non-native (Korean) speech production. The study also examined how the durational difference in vowels due to the voicing of a following consonant (i.e., voiced vs. voiceless) is modified by prosodic strengthening in two different (native vs. non-native) speaker groups. Five native speakers of Canadian English and eight Korean learners of English (intermediate-advanced level) produced 8 minimal pairs with the CVC sequence (e.g., 'beat'-'bit') in varying prosodic contexts. Native speakers distinguished the two vowels in terms of duration, F1, and F2, whereas non-native speakers only showed durational differences. The two groups were similar in that they maximally distinguished the two vowels when the vowels were accented (F2, duration), while neither group showed boundary-induced strengthening in any of the three measurements. The durational differences due to the voicing of the following consonant were also maximized when accented. The results are discussed further in terms of phonetics-prosody interface in L2 production.

The change of vowel characteristics for the dysarthric speech along with speaking style (경도 마비말장애 환자의 발화 유형에 따른 모음 특성 비교)

  • Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.51-59
    • /
    • 2016
  • The purpose of present study is to examine differences between habitual speech (HS) and clear speech (CS) in individuals with mild dysarthria. Twelve speakers with mild dysarthria and twelve healthy control speakers read sentences in two speaking styles. Formant and intensity related values, triangular area, and center of gravity of /a/, /i/, and /u/ were measured. In addition, formant-ratio variables such as vowel space area(VSA), vowel articulatory index (VAI), formant centralization ratio (FCR) and F2i/F1u ratio (F2 ratio) were calculated. The results of repeated-measures ANOVA showed a significant difference in F2 of vowel /i/ and F2 energy of vowel /a/ between groups. Regarding formant energy, F2 energy of vowel /a/ were observed as meaningful variables between speaking styles. There were significant speaking style-by-group interactions for F2 energy of vowel /a/. These findings indicated that current parameters could discriminate healthy group and mild dysarthria group meaningfully and that speaker with dysarthria had larger clear speech benefit than healthy talkers. We also claim that various acoustic changes of clear speech may contribute to improving vowel intelligibility.

Speech sound and personality impression (말소리와 성격 이미지)

  • Lee, Eunyung;Yuh, Heaok
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.59-67
    • /
    • 2017
  • Regardless of their intention, listeners tend to assess speakers' personalities based on the sounds of the speech they hear. Assessment criteria, however, have not been fully investigated to indicate whether there is any relationship between the acoustic cue of produced speech sounds and perceived personality impression. If properly investigated, the potential relationship between these two will provide crucial insights on the aspects of human communications and further on human-computer interaction. Since human communications have distinctive characteristics of simultaneity and complexity, this investigation would be the identification of minimum essential factors among the sounds of speech and perceived personality impression. The purpose of this study, therefore, is to identify significant associations between the speech sounds and perceived personality impression of speaker by the listeners. Twenty eight subjects participated in the experiment and eight acoustic parameters were extracted by using Praat from the recorded sounds of the speech. The subjects also completed the Neo-five Factor Inventory test so that their personality traits could be measured. The results of the experiment show that four major factors(duration average, pitch difference value, pitch average and intensity average) play crucial roles in defining the significant relationship.

Design of Model to Recognize Emotional States in a Speech

  • Kim Yi-Gon;Bae Young-Chul
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.1
    • /
    • pp.27-32
    • /
    • 2006
  • Verbal communication is the most commonly used mean of communication. A spoken word carries a lot of informations about speakers and their emotional states. In this paper we designed a model to recognize emotional states in a speech, a first phase of two phases in developing a toy machine that recognizes emotional states in a speech. We conducted an experiment to extract and analyse the emotional state of a speaker in relation with speech. To analyse the signal output we referred to three characteristics of sound as vector inputs and they are the followings: frequency, intensity, and period of tones. Also we made use of eight basic emotional parameters: surprise, anger, sadness, expectancy, acceptance, joy, hate, and fear which were portrayed by five selected students. In order to facilitate the differentiation of each spectrum features, we used the wavelet transform analysis. We applied ANFIS (Adaptive Neuro Fuzzy Inference System) in designing an emotion recognition model from a speech. In our findings, inference error was about 10%. The result of our experiment reveals that about 85% of the model applied is effective and reliable.

Some articulatory reflexes observed in intervocalic consonantal sequences: Evidence from Korean place assimilation

  • Son, Minjung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.2
    • /
    • pp.17-27
    • /
    • 2020
  • This paper examines kinematic characteristics of /pk/ clusters, as compared to /kk/ and /pp/ with varying vowel contexts and speech rate. The results of EMMA data from eight Seoul-Korean speakers indicate as follows. Firstly, comparing /pk/ to /pp/ sequences, lips closing movement was faster and spatially greater in the /a/-to-/a/ context while temporally longer in the /i/-to-/i/ context. It was smaller in spatial displacement and shorter in temporal duration in /pk/ sequences. Peak velocity did not vary. Secondly, comparing /pk/ with /pp/ and /kk/ controls, lip aperture was less constricted in the /a/-to-/a/ context than /i/-to-/i/, but the maximum contact between the upper and lower lips was invariant across different vocalic contexts within /pk/ sequences (/apka/=/ipki/). Categorical reduction of C1 in /pk/ sequences fell in with the low-vowel and fast-rate conditions with across-/within-speaker variability. Gradient reduction of C1 was observed in all C1C2 types, being more frequent in fast rate. Lastly, the jaw articulator was a stable indicator of rate effects. The implication of the current study is that gestural reduction occurs with categorical reduction and general spatiotemporal weakening in the assimilating contexts, while quantitative properties of gestures may be a reason for gradient reduction, not necessarily confined to place assimilation.

A Basic Study on the Development of a Grading Scale of Discourse Competence in Korean Speaking Assessment -Focusing on the Scale of 'REFUSAL' Task (한국어 말하기 평가에서 '담화 능력' 등급 기술을 위한 기초 연구 -'부탁'에 대한 '거절하기' 과제를 중심으로-)

  • Lee, Haeyong;Lee, Hyang
    • Journal of Korean language education
    • /
    • v.29 no.3
    • /
    • pp.255-292
    • /
    • 2018
  • Most grading scales of Korean language proficiency tests are based on existing grading scales that are not empirically verified. The purpose of this study is to develop an empirically verified scale descriptor. The 'Performance data-driven approach' that is suggested by Fulcher (1987) was used to develop the detailed description of characteristics for each level of performance. This study is focused on the functional phase of speech samples analysis (coding data) to create explanatory categories of discourse skills into which individual observations of speech phenomena can be scored. The speech samples that were collected through this study demonstrated stages of speech that can be a foundation of a grading scale. The data used in the study was collected from 23 native speakers of Korean. Speech samples were recorded from simulated speaking tests using the 'REFUSAL' task, and transcribed for analysis. The transcript was analyzed using discourse analysis. The result showed that the 'REFUSAL' task needs to go through four functional phases in actual communication. Furthermore, this study found specific and detailed explanatory categories of discourse competence based on the actual native speaker's speech data. Such findings are expected to contribute to the development of more valid and reliable speaking assessment.

English vowel production conditioned by probabilistic accessibility of words: A comparison between L1 and L2 speakers

  • Jonny Jungyun Kim;Mijung Lee
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigated the influences of probabilistic accessibility of the word being produced - as determined by its usage frequency and neighborhood density - on native and high-proficiency L2 speakers' realization of six English monophthong vowels. The native group hyperarticulated the vowels over an expanded acoustic space when the vowel occurred in words with low frequency and high density, supporting the claim that vowel forms are modified in accordance with the probabilistic accessibility of words. However, temporal expansion occurred in words with greater accessibility (i.e., with high frequency and low density) as an effect of low phonotactic probability in low-density words, particularly in attended speech. This suggests that temporal modification in the opposite direction may be part of the phonetic characteristics that are enhanced in communicatively driven focus realization. Conversely, none of these spectral and temporal patterns were found in the L2 group, thereby indicating that even the high-proficiency L2 speakers may not have developed experience-based sensitivity to the modulation of sub-categorical phonetic details indexed with word-level probabilistic information. The results are discussed with respect to how phonological representations are shaped in a word-specific manner for the sake of communicatively driven lexical intelligibility, and what factors may contribute to the lack of native-like sensitivity in L2 speech.

A Study on the Teaching and Learning of Korean Modality Expressions (한국어의 양태 표현 교육 연구 : 한국어 '-(으)ㄹ 수 있다'와 중국어 '능(能)'의 대조를 중심으로)

  • Jiang, Fei
    • Korean Educational Research Journal
    • /
    • v.40 no.1
    • /
    • pp.17-42
    • /
    • 2019
  • Modality is the psychological attitude of the speaker, which is comprised by the sentences used in every language. Modality can be broadly categorized as perceptional modality and obligatory modality. This study summarizes the previous related literatures and theoretical branches of Korean linguistic studies. The study also proposes and classifies a modal concept on the Korean language, which is aimed at aiding Chinese people who are studying Korean. It further describes characteristics and expressions of modality in both the Chinese and Korean languages. This study aims to develop an effective teaching-learning program on the basis of the contrastive analysis between Korean language's modality, "-(으)ㄹ 수 있다," and the corresponding Chinese auxiliary verb, "能." Modality is a syntax item that reflects a speaker's subjective manner. There are many grammatical facets in Korean language books and teaching materials that are modal in nature. Further, modalities in Korean language are not only numerous but also have very rich meanings and functions. Based on the contrastive analysis, this study designs an effective teaching plan for Chinese people learning the Korean language. The designed system uses specific conversational occasions as the basis of learning, and it adapts the Korean language's modal system to classroom teaching. The system is expected to be effective during classroom teaching for demonstrating and learning modality in the Korean language.

  • PDF

A study on the connected-digit recognition using MLP-VQ and Weighted DHMM (MLP-VQ와 가중 DHMM을 이용한 연결 숫자음 인식에 관한 연구)

  • Chung, Kwang-Woo;Hong, Kwang-Seok
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.8
    • /
    • pp.96-105
    • /
    • 1998
  • The aim of this paper is to propose the method of WDHMM(Weighted DHMM), using the MLP-VQ for the improvement of speaker-independent connect-digit recognition system. MLP neural-network output distribution shows a probability distribution that presents the degree of similarity between each pattern by the non-linear mapping among the input patterns and learning patterns. MLP-VQ is proposed in this paper. It generates codewords by using the output node index which can reach the highest level within MLP neural-network output distribution. Different from the old VQ, the true characteristics of this new MLP-VQ lie in that the degree of similarity between present input patterns and each learned class pattern could be reflected for the recognition model. WDHMM is also proposed. It can use the MLP neural-network output distribution as the way of weighing the symbol generation probability of DHMMs. This newly-suggested method could shorten the time of HMM parameter estimation and recognition. The reason is that it is not necessary to regard symbol generation probability as multi-dimensional normal distribution, as opposed to the old SCHMM. This could also improve the recognition ability by 14.7% higher than DHMM, owing to the increase of small caculation amount. Because it can reflect phone class relations to the recognition model. The result of my research shows that speaker-independent connected-digit recognition, using MLP-VQ and WDHMM, is 84.22%.

  • PDF