• Title/Summary/Keyword: Voice Speakers

Search Result 170, Processing Time 0.025 seconds

An Acoustic Phonetic Study about Voice Imitation(2) -Focusing on Prosody Feature- (모방발화에 대한 음향음성학적 연구(2) -운율 특징을 중심으로-)

  • Park Miyoung;Park Jihye;Shin Jiyoung;Kang Sunmee
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.56-60
    • /
    • 2003
  • The purpose of this paper is to research voice imitation. Voice imitation changes various phonetic feature. Also, in our experimental results, voice imitation has preferential prosody difference. For imitating voice, imitators change their fundamental frequency bandwidths for the most part. Imitative speakers change their high fundamental frequencies effectively while they maintain their low fundamental frequencies. Also, excellent group is distinctly superior to common group for imitating prosodic patterns. That is, the f0 bandwidth's change and the prosodic patterns are significant in imitating voice. But the low f0 is maintain by all speakers.

  • PDF

Voice onset time in English and Korean stops with respect to a sound change

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.13 no.2
    • /
    • pp.9-17
    • /
    • 2021
  • Voice onset time (VOT) is known to be a primary acoustic cue that differentiates voiced from voiceless stops in the world's languages. While much attention has been given to the sound change of Korean stops, little attention has been given to that of English stops. This study examines VOT of stop consonants as produced by English speakers in comparison to Korean speakers to see whether there is any VOT change for English stops and how the effects of stop, place, gender, and individual on VOT differ cross-linguistically. A total of 24 native speakers (11 Americans and 13 Koreans) participated in this experiment. The results showed that, for Korean, the VOT merger of lax and aspirated stops was replicated, and, for English, voiced stops became initially devoiced and voiceless stops became heavily aspirated. English voiceless stops became longer in VOT than Korean counterparts. The results suggest that, similar to Korean stops, English stops may also undergo a sound change. Since it is the first study to be revealed, more convincing evidence is necessary.

The relationship between cross language phonetic influences and L2 proficiency in terms of VOT

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.3-10
    • /
    • 2011
  • This study examined the production of aspirated stop consonants in Korean and English words to address how the influences differed particularly in terms of proficiency in L2 English. Voice onset times (VOTs) were measured from two American monolinguals and seven Korean speakers. The results showed that VOT patterns for both L1 and L2 stops differed according to their proficiency in L2 English. In L2 English, high proficient speakers produced VOTs that were similar to those of native speakers of English whereas low proficient speakers produced VOTs that were significantly longer than those of proficient speakers. In L1 Korean and L2 English, most of the proficient speakers produced VOTs similarly. Unlike previous findings, Korean VOTs were even shorter than English counterparts. The VOT shortening of aspirated stops in Korean was found for most of the proficient speakers. The findings of the present study suggest that cross language phonetic influences as well as the ongoing VOT shortening in Korean aspirated stops may be correlated with L2 proficiency. Since this is a pilot study with a small number of subjects for each proficiency group, further quantitative study is necessary to generalize.

  • PDF

A Study of the Correlation between Subjective and Objective Evaluation of Voice Disorders (음성장애 주관적 평가와 객관적 평가 간의 상관성 연구)

  • Lee, Ok-Bun;Kim, So-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.167-172
    • /
    • 2011
  • The purpose of this study was to examine the relationship between subjective and objective evaluation in speakers with voice disorders. Subjective evaluation indicates the self-reports of voice problems by dysphonic speakers. The relating protocol is the Voice Handicap Index (VHI) and the self-awareness index of voice problems (SAIVP-14). A total of 48 individuals with voice disorders replied to the questionnaire and participated in a voice assessment. Objective evaluations included the perceptual judgement of G grade in GRBAS, acoustic measurements (jitter, shimmer, NHR) by MDVP (CSL 4400), and aerodynamic measurements (MPT, MFR, psub) by PAS (Phonatory Aerodynamic System, KayPentax, USA). Pearson and Spearman correlations were used for the analysis. In the correlation with perceptual judgement (G grade) and VHI-Total, VHI-Physical, and SAIVP-14, there was a significant correlation, but the overall correlation was poor. NHR, jitter, and shimmer were significantly correlated with overall VHI and SAIVP-14. Specifically, the correlation with shimmer was stronger compared to the other measurements. In aerodynamic measures, MFR and MPT showed a significant correlation with VHI-Total, VHI-Emotional, and SAIVP-14, but their correlation was poor. The results of this study suggested that subjective evaluation of self voice problems is meaningfully correlated with objective evaluations, but more data in the multidimensional voice assessment should be collected and analyzed for the reliability and validity of the voice handicap questionnaire.

  • PDF

Effect of Age on the Voice Onset Time of Korean Stops in VCV contexts (연령에 따른 VCV 문맥에서 한국어 폐쇄음의 성대진동개시시간)

  • Lee, Seulgi;Lee, Youngmee
    • Phonetics and Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.37-44
    • /
    • 2015
  • This study investigated the effects of the age of Korean speakers, place of articulation, and phonation types on voice onset time (VOT) of stops. Twenty-five preschoolers, 25 schoolers, and 25 adults who had no history of speech and language impairment produced plosives in /VCV/ words in isolation. A three-way ($3{\times}3{\times}3$) mixed design was used with the age of speakers (preschoolers, schoolers, adults) as a between-subject factor, the place of articulation (bilabials, alveolars, velars) and phonation types (plain, tense, aspirated consonants) as a within-subject factor. The dependent measure was the VOT values. Results revealed that three main effects were statistically significant. Preschoolers exhibited longer VOTs than adults (p<.05). There were significant differences in VOTs among the place of articulation, showing that speakers had the longest VOTs for velars (velars > alvelars > bilabials) (all p<.05). In addition, the VOTs for aspirated consonants were longer than those for plain and tense consonants, and the differences were significant among three phonation types (aspirated > tense > plain) (all p<.05). The current results suggested that VOTs would be linked to age and development, and schoolers over the age of 11 years had achieved adult-like VOTs. Moreover, the place of articulation and phonation types in Korean stops showed marked factors in normal speakers' VOT patterns.

Acoustic Characteristics of Patients with Total Laryngectomees via Voice Rehabilitation Techniques (후두적출술 환자의 발성법에 따른 음향학적 특성)

  • Jang, Hyo-Ryung;Shim, Hee-Jeong;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.25-32
    • /
    • 2013
  • This research is aimed at finding the acoustic characteristics of different voice rehabilitation techniques, the electrolaryx (EL), standard esophageal (SE), and tracheoesophageal (TE), used on 17 patients with laryngectomees. The analysis of the voice qualities was achieved using MDVP. In order to compare the acoustic characteristics, patients were asked to produce the vowel /a/ sound. The acoustic analysis included fundamental frequency (f0), jitter, shimmer, and noise-to-harmonic ratio (NHR). The main acoustic results showed no significant statistical differences between the average measurements of SE and TE speakers. It was found that the current study showed the same tendency found in previous studies. There was also a significant difference between SE and EL speakers. On the other hand, there were no significant statistical differences between the average measurements of TE and EL speakers on all acoustic measurements. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation for patients with laryngectomees. In future, the present findings and issues should be considered in the context of gender. Specifically, the number of women who are diagnosed with laryngeal cancer continues to rise and their acoustic characteristics may indeed differ from those of men.

A study on the voice onset times of the Seoul Corpus males in their twenties (서울 코퍼스 20대 남성의 성대진동 개시시간 연구)

  • Lee, Yuri;Yoon, Kyuchul
    • Phonetics and Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.1-8
    • /
    • 2016
  • The purpose of this work is to examine the voice onset times (VOTs) of the three types of plosives from the Seoul Corpus male speakers in their twenties. In addition, the factors known to affect VOTs were analyzed, including the place and manner of articulation, speakers, location in words, type of following vowels and speech rates calculated from the three consecutive words. Much of the findings agreed with those from earlier studies on Korean and other languages and new discoveries were made.

The Relationship Between Voice and the Image Triggered by the Voice: American Speakers and American Listeners (목소리를 듣고 감지하는 인상에 대한 연구: 미국인화자와 미국인청자)

  • Moon, Seung-Jae
    • Phonetics and Speech Sciences
    • /
    • v.1 no.2
    • /
    • pp.111-118
    • /
    • 2009
  • The present study aims at investigating the relationship between voices and the physical images triggered by the voices. It is the final part of a four-part series and the results reported in the present study are limited to those of American speakers and American listeners. Combined with the results from previous studies (Moon, 2000; Moon, 2002; Tak, 2005), the results suggest that (1) there is a very strong, much higher than chance-level relationship between voices and the pictures chosen for the voices by the perception experiment subjects; (2) the more physical characteristics that are given, the better the chance for correctly matching voices with pictures; and (3) culture (in the present, language environment) seems to play a role in conjuring up the mental images from voices.

  • PDF

Voice Quality of Dysarthric Speakers in Connected Speech (연결발화에서 마비말화자의 음질 특성)

  • Seo, Inhyo;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.5 no.4
    • /
    • pp.33-41
    • /
    • 2013
  • This study investigated the perceptual and cepstral/spectral characteristics of phonation and their relationships in dysarthria in connected speech. Twenty-two participants were divided into two groups; the eleven dysarthric speakers were paired with matching age and gender healthy control participants. A perceptual evaluation was performed by three speech pathologists using the GRBAS scale to measure the cepstrual/spectral characteristics of phonation between the two groups' connected speech. Correlations showed dysarthric speakers scored significantly worse (with a higher rating) with severities in G (overall dysphonia grade), B (breathiness), and S (strain), while the smoothed prominence of the cepstral peak (CPPs) was significantly lower. The CPPs were significantly correlated with the perceptual ratings, including G, B, and S. The utility of CPPs is supported by its high relationship with perceptually rated dysphonia severity in dysarthric speakers. The receiver operating characteristic (ROC) analysis showed that the threshold of 5.08 dB for the CPPs achieved a good classification for dysarthria, with 63.6% sensitivity and the perfect specificity (100%). Those results indicate the CPPs reliably distinguished between healthy controls and dysarthric speakers. However, the CPP frequency (CPP F0) and low-high spectral ratio (L/H ratio) were not significantly different between the two groups.

An Integrated Framework for Modeling the Influential Factors Affecting the Use of Voice-Enabled IoT Devices: A Case Study of Amazon Echo

  • Temidayo Oluwapelumi Shofolahan;Juyoung Kang
    • Asia pacific journal of information systems
    • /
    • v.28 no.4
    • /
    • pp.320-349
    • /
    • 2018
  • Purpose: The application of IoT is finding continuous acceptance in our daily lives, particularly, smart speakers are making life easier and convenient for consumers. This research aims to develop and test an integrated model of factors influencing consumer's adoption of voice-enabled IoT devices. Design/methodology/approach: Based on the VAM, an integrated voice-enabled IoT device adoption model is proposed. Gender differences on five constructs relating with perceived value (perceived usefulness, perceived enjoyment, perceived security risk, perceived technicality and perceived cost) was also examined through PLS-MGA technique. The usage experience of consumers was also controlled in the integrated VAM. Findings: Result shows that Perceived-Usefulness, Perceived-Enjoyment and Perceived-Cost have a strong effect on Perceived-Value. However, Perceived-Technicality and Perceived-Security-Risk are non-influential and have no significant effect on PV. Additionally, Perceived-Value and Social-Influence plays a significant role in predicting adoption intention. Gender differences also exist in consumers perception of usefulness, enjoyment and cost. In comparison to the basic value-based adoption model, the integrated model provides more insight on consumers adoption of voice-enabled IoT devices. Originality/value: Using an integrated model, this study is one of the first scholarly attempt at modelling the influential factors for adopting smart speakers i.e., voice-enabled IoT devices, with implications for improved adoption.