• Title/Summary/Keyword: Voice Speakers

Search Result 172, Processing Time 0.022 seconds

Fundamental Frequencies in Korean Elderly Speakers (한국 정상 노인 음성의 기본주파수)

  • Kim, Sun-Hai;Ko, Do-Heung
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.95-102
    • /
    • 2008
  • Multiple physical changes of the larynx and its components occur with age. Vocal pitch, commonly expressed through measures of fundamental frequency (Fo) relate to physical conditions of the larynx. Available data is lacking for the senescent voice, and should be applied to the of changes of elderly speakers' Fo characteristics. The purpose of this study was to investigate the Fo of normal elderly speaker's voice. A total of 406 normal elderly speakers (207 males and 199 females) participated in this experiment. Age ranged from 60 years to 89 years. The subjects were asked to produce sustained corner vowels (/a/ /i/ /u/) three times each and the data were analyzed using the MDVP of CSL. According to the results of this study, the mean Fo from the ages of 60's to 80's shows 143.95Hz(SD 13.94) for men and 185.42Hz (SD 15.29) for women. For men, a significant change is found as a function of age in the Fo (F=16.181, p<.05). A post-hoc Scheffe test revealed significant differences between the Fo data of subjects aged 60's and 70's, 60's and 80's. For women, a significant change is found as a function of age in the Fo (F=49.013, p<.05). A post-hoc $Scheff'{e}$ test revealed significant differences between the Fo data of subjects in their 60's and 70's, 70's and 80's, 60's and 80's. The Fo of men goes up from their 60's to 80's gradually, whereas the Fo of women goes down gradually until their 70's, and after their 70's it again increases. It has been known that diminishing estrogen levels in women in old age may be a factor in lowering Fo, whereas diminishing testosterone levels in men may contribute to a rising Fo. This result may be used as some meaningful guideline and lead the basic data to differentiate between normal aged voice and aged voice disorders.

  • PDF

Tonal development and voice quality in the stops of Seoul Korean

  • Yu, Hye Jeong
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.91-99
    • /
    • 2018
  • Korean stops are currently undergoing a tonogenetic sound change, as found in the Seoul dialect in which a merged VOT of aspirated and lax stops induces F0 to be the primary cue for distinguishing the two stops and the lax stops have lower F0 than the aspirated stops. In tonal languages, low tone is produced with a breathy voice. This study investigated whether there are changes in voice quality with respect to the tonogenetic sound change of Korean stops. Two age groups speaking the Seoul dialect participated in this study: five females and six males born in the 1940s and 1950s and nine females and eight males born in the 1980s and 1990s. This study replicated previous findings of VOT and F0 and further examined H1-H2, H1-A1, and H1-A2 to see how they correlate with the sound change. In the older and younger generations, H1-H2, H1-A1, and H1-A2 were significantly lower after the tense stops than after the aspirated and lax stops, but they were not significantly different after the aspirated and lax stops. However, the younger females exhibited some different results for H1-H2 and H1-A2 than the older generation. In the younger females, the H1-H2 mean was higher after the aspirated stops than it was after the lax stops at the vowel onset, and the H1-H2 difference increased at the vowel midpoint. Although there was an inter-speaker variation in the results of H1-H2 and H1-A1, analyses of individual speakers showed that the H1-H2 and H1-A1 were higher after the lax stops than after the aspirated stops in the younger female speakers. These results indicate that lax stops tend to be breathier than aspirated stops in the younger female speakers. They also indicate that changes in voice quality are on Korean stops with tonal sound change, but are still developing.

Proposal of Hostile Command Attack Method Using Audible Frequency Band for Smart Speaker (스마트 스피커 대상 가청 주파수 대역을 활용한 적대적 명령어 공격 방법 제안)

  • Park, Tae-jun;Moon, Jongsub
    • Journal of Internet Computing and Services
    • /
    • v.23 no.4
    • /
    • pp.1-9
    • /
    • 2022
  • Recently, the functions of smart speakers have diversified, and the penetration rate of smart speakers is increasing. As it becomes more widespread, various techniques have been proposed to cause anomalous behavior against smart speakers. Dolphin Attack, which causes anomalous behavior against the Voice Controllable System (VCS) during various attacks, is a representative method. With this method, a third party controls VCS using ultrasonic band (f>20kHz) without the user's recognition. However, since the method uses the ultrasonic band, it is necessary to install an ultrasonic speaker or an ultrasonic dedicated device which is capable of outputting an ultrasonic signal. In this paper, a smart speaker is controlled by generating an audio signal modulated at a frequency (18 to 20) which is difficult for a person to hear although it is in the human audible frequency band without installing an additional device, that is, an ultrasonic device. As a result with the method proposed in this paper, while humans could not recognize voice commands even in the audible band, it was possible to control the smart speaker with a probability of 82 to 96%.

Analysis of the Voice Quality in Emotional Speech Using Acoustical Parameters (음향 파라미터에 의한 정서적 음성의 음질 분석)

  • Jo, Cheol-Woo;Li, Tao
    • MALSORI
    • /
    • v.55
    • /
    • pp.119-130
    • /
    • 2005
  • The aim of this paper is to investigate some acoustical characteristics of the voice quality features from the emotional speech database. Six different parameters are measured and compared for 6 different emotions (normal, happiness, sadness, fear, anger, boredom) and from 6 different speakers. Inter-speaker variability and intra-speaker variability are measured. Some intra-speaker consistency of the parameter change across the emotions are observed, but inter-speaker consistency are not observed.

  • PDF

Voice Personality Transformation Using an Optimum Classification and Transformation (최적 분류 변환을 이용한 음성 개성 변환)

  • 이기승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.5
    • /
    • pp.400-409
    • /
    • 2004
  • In this paper. a voice personality transformation method is proposed. which makes one person's voice sound like another person's voice. To transform the voice personality. vocal tract transfer function is used as a transformation parameter. Comparing with previous methods. the proposed method makes transformed speech closer to target speaker's voice in both subjective and objective points of view. Conversion between vocal tract transfer functions is implemented by classification of entire vector space followed by linear transformation for each cluster. LPC cepstrum is used as a feature parameter. A joint classification and transformation method is proposed, where optimum clusters and transformation matrices are simultaneously estimated in the sense of a minimum mean square error criterion. To evaluate the performance of the proposed method. transformation rules are generated from 150 sentences uttered by three male and on female speakers. These rules are then applied to another 150 sentences uttered by the same speakers. and objective evaluation and subjective listening tests are performed.

The Production and Perception of the Korean Stops by English Learners (영어권 화자의 국어 폐쇄음 발화와 지각)

  • Kim, Kee-Ho;Park, Yoon-Jin;Chun, Yun-Sil
    • Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.51-67
    • /
    • 2006
  • This study examined the acoustic properties of initial stops in Korean, produced by Korean native speakers and English Korean learners. The productions of Korean native speakers were compared with those of beginners and advanced learners of Korean. Fundamental frequency(F0) and Voice Onset Time(VOT) were measured in condition of one or two syllable words, containing word-initial lenis, fortis, and aspirated stops. English Korean Learners showed that they produced stops with relatively shorter VOT and lower F0, compared with those of Korean native speakers. In case of the manner of articulation, English Korean learners have production difficulties in order of lenis stops, aspirated stops, and fortis stops. In regard to the place of articulation, English Korean learners showed production troubles in order of labial stops, velar stops, and alveolar stops. In the experiment of perception, it is hard for English Korean learners to distinguish stops of lenis and aspirated. Therefore, the results of production experiment were almost consistent with those of the perception experiment. Finally, according to both groups of proficiency, the results demonstrated that the advanced learners produce or perceive Korean stops easier than the beginners.

  • PDF

The effects of length of residence (LOR) on voice onset time (VOT)

  • Kim, Mi-Ryoung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.9-17
    • /
    • 2020
  • Changes in the first language (L1) sound system as a result of acquiring a second language (L2) (i.e., phonetic drift) have received considerable attention from a variety of speakers, settings, and environments. Less attention has been given to phonetic drift in adult speakers' L2 learning as their length of residence in America (LOR) increases. This study examines the effects of LOR on voice onset time (VOT) in L1 Korean stops. Three different groups of Korean adult learners of L2 English were compared to assess how malleable their L1 representations are in terms of LOR and whether there is any relationship between L1 change and L2 acquisition. The results showed that the effect of LOR was linguistically unimportant in the production of Korean stops. However, VOT merger as evidence of sound change in Korean stops were robust in the speech production of most of the female speakers across the groups. The results suggest that L2 English may not be the primary cause of L1 sound change. For generalizability, further study is necessary to see whether other acoustic cues show a similar pattern.

Korean Speakers' Pronunciation and Pronunciation Training of English Stops (한국인의 영어 폐쇄음 발화와 발화 훈련)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.29-36
    • /
    • 2010
  • The purposes of this study are (1) to see if language transfer effect is found in Korean speakers' pronunciation of English stops and to correct them and (2) to investigate the effectiveness of mimicry training and Speech Analyzer training on subjects' pronunciation of English stops. For these purposes, 20 Korean speakers' VOT values of English stops were measured using Speech Analyzer and their post-training production was compared with their pre-training production. The result shows that Korean speakers have no difficulty in correcting pronunciation errors of English voiceless stops and voiced stops and such a result indicates that language transfer effect is not noticed as expected. In addition, the result of pronunciation training shows that the training using Speech Analyzer is more effective than mimicry training.

  • PDF

A Study On Fomants of Voice Imitation (모방발화의 모음 포만트 연구)

  • Ahn, Byoung-Seob;Shin, Ji-Young;Kang, Sun-Mee
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.209-213
    • /
    • 2004
  • The aim of this paper is to analyze vowel in voice imitation, and to find the invariable phonetic features of the speaker. In this paper we examined the formants of vowel /a, u, i/. The results of the present are as follows : (1) Speakers change their vocal tract cavity features. (2) F1 changes easily compared to $F2{\sim}F3{\sim}F4$. (3) F3-F2 appears to be constituent for a speakers identification in vowel /a/ and F4-F2 in vowel /i/.

  • PDF

Effects of EAI and VAS on perceptual judgement and confidence rating by listeners for voice disorders (청지각적 평가 방식에 따른 음성장애 심한 정도 판단과 자가 신뢰도에 대한 차이)

  • Lee, Ok-Bun;Kim, Sun-Hee;Jeong, Hanjin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.5
    • /
    • pp.3046-3050
    • /
    • 2014
  • The purpose of the present study was to evaluate the effect of 7-point interval scale(EAI) and visual analogue scale(VAS) on perceptual judgement and the reliability of severity on voice problems by dysphonic speakers. 30 undergraduate students studying communication disorder were enrolled in the perceptual evaluation. Those listeners judged overall voice severity within the anchored(condition 1) and non-anchored scales(condition 2) for vowel prolongation and reading tasks by 25 speakers with voice disorder. The results of this study showed that the scores by VAS was significantly higher than EAI in both condition 1 and condition 2 for vowel prolongation and reading task. However, the scores by EAI method was higher than by VAS method on voice severity of vowel prolongation (condition 1) and reading task(condition 2). These results suggest auditory-perceptual scaling procedures must be more studied in the aspects of clinical application of voice disorder.