• Title/Summary/Keyword: fundamental frequency of speech

Search Result 205, Processing Time 0.03 seconds

Fundamental Frequencies in Korean Elderly Speakers (한국 정상 노인 음성의 기본주파수)

  • Kim, Sun-Hai;Ko, Do-Heung
    • Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.95-102
    • /
    • 2008
  • Multiple physical changes of the larynx and its components occur with age. Vocal pitch, commonly expressed through measures of fundamental frequency (Fo) relate to physical conditions of the larynx. Available data is lacking for the senescent voice, and should be applied to the of changes of elderly speakers' Fo characteristics. The purpose of this study was to investigate the Fo of normal elderly speaker's voice. A total of 406 normal elderly speakers (207 males and 199 females) participated in this experiment. Age ranged from 60 years to 89 years. The subjects were asked to produce sustained corner vowels (/a/ /i/ /u/) three times each and the data were analyzed using the MDVP of CSL. According to the results of this study, the mean Fo from the ages of 60's to 80's shows 143.95Hz(SD 13.94) for men and 185.42Hz (SD 15.29) for women. For men, a significant change is found as a function of age in the Fo (F=16.181, p<.05). A post-hoc Scheffe test revealed significant differences between the Fo data of subjects aged 60's and 70's, 60's and 80's. For women, a significant change is found as a function of age in the Fo (F=49.013, p<.05). A post-hoc $Scheff'{e}$ test revealed significant differences between the Fo data of subjects in their 60's and 70's, 70's and 80's, 60's and 80's. The Fo of men goes up from their 60's to 80's gradually, whereas the Fo of women goes down gradually until their 70's, and after their 70's it again increases. It has been known that diminishing estrogen levels in women in old age may be a factor in lowering Fo, whereas diminishing testosterone levels in men may contribute to a rising Fo. This result may be used as some meaningful guideline and lead the basic data to differentiate between normal aged voice and aged voice disorders.

  • PDF

Emotion Recognition Based on Frequency Analysis of Speech Signal

  • Sim, Kwee-Bo;Park, Chang-Hyun;Lee, Dong-Wook;Joo, Young-Hoon
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.2 no.2
    • /
    • pp.122-126
    • /
    • 2002
  • In this study, we find features of 3 emotions (Happiness, Angry, Surprise) as the fundamental research of emotion recognition. Speech signal with emotion has several elements. That is, voice quality, pitch, formant, speech speed, etc. Until now, most researchers have used the change of pitch or Short-time average power envelope or Mel based speech power coefficients. Of course, pitch is very efficient and informative feature. Thus we used it in this study. As pitch is very sensitive to a delicate emotion, it changes easily whenever a man is at different emotional state. Therefore, we can find the pitch is changed steeply or changed with gentle slope or not changed. And, this paper extracts formant features from speech signal with emotion. Each vowels show that each formant has similar position without big difference. Based on this fact, in the pleasure case, we extract features of laughter. And, with that, we separate laughing for easy work. Also, we find those far the angry and surprise.

Relationship between executive function and cue weighting in Korean stop perception across different dialects and ages

  • Kong, Eun Jong;Lee, Hyunjung
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.21-29
    • /
    • 2021
  • The present study investigated how one's cognitive resources are related to speech perception by examining Korean speakers' executive function (EF) capacity and its association with voice onset time (VOT) and f0 sensitivity in identifying Korean stop laryngeal categories (/t'/ vs. /t/ vs. /th/). Previously, Kong et al. (under revision) reported that Korean listeners (N = 154) in Seoul and Changwon (Gyeongsang) showed differential group patterns in dialect-specific cue weightings across educational institutions (college, high school, and elementary school). We follow up this study by further relating their EF control (working memory, mental flexibility, and inhibition) to their speech perception patterns to examine whether better cognitive ability would control attention to multiple acoustic dimensions. Partial correlation analyses revealed that better EFs in Korean listeners were associated with greater sensitivity to available acoustic details and with greater suppression of irrelevant acoustic information across subgroups, although only a small set of EF components turned out to be relevant. Unlike Seoul participants, Gyeongsang listeners' f0 use was not correlated with any EF task scores, reflecting dialect-specific cue primacy using f0 as a secondary cue. The findings confirm the link between speech perception and general cognitive ability, providing experimental evidence from Korean listeners.

An Acoustic Phonetic Study about Voice Imitation(2) -Focusing on Prosody Feature- (모방발화에 대한 음향음성학적 연구(2) -운율 특징을 중심으로-)

  • Park Miyoung;Park Jihye;Shin Jiyoung;Kang Sunmee
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.56-60
    • /
    • 2003
  • The purpose of this paper is to research voice imitation. Voice imitation changes various phonetic feature. Also, in our experimental results, voice imitation has preferential prosody difference. For imitating voice, imitators change their fundamental frequency bandwidths for the most part. Imitative speakers change their high fundamental frequencies effectively while they maintain their low fundamental frequencies. Also, excellent group is distinctly superior to common group for imitating prosodic patterns. That is, the f0 bandwidth's change and the prosodic patterns are significant in imitating voice. But the low f0 is maintain by all speakers.

  • PDF

Robust Speech Recognition Algorithm of Voice Activated Powered Wheelchair for Severely Disabled Person (중증 장애우용 음성구동 휠체어를 위한 강인한 음성인식 알고리즘)

  • Suk, Soo-Young;Chung, Hyun-Yeol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.6
    • /
    • pp.250-258
    • /
    • 2007
  • Current speech recognition technology s achieved high performance with the development of hardware devices, however it is insufficient for some applications where high reliability is required, such as voice control of powered wheelchairs for disabled persons. For the system which aims to operate powered wheelchairs safely by voice in real environment, we need to consider that non-voice commands such as user s coughing, breathing, and spark-like mechanical noise should be rejected and the wheelchair system need to recognize the speech commands affected by disability, which contains specific pronunciation speed and frequency. In this paper, we propose non-voice rejection method to perform voice/non-voice classification using both YIN based fundamental frequency(F0) extraction and reliability in preprocessing. We adopted a multi-template dictionary and acoustic modeling based speaker adaptation to cope with the pronunciation variation of inarticulately uttered speech. From the recognition tests conducted with the data collected in real environment, proposed YIN based fundamental extraction showed recall-precision rate of 95.1% better than that of 62% by cepstrum based method. Recognition test by a new system applied with multi-template dictionary and MAP adaptation also showed much higher accuracy of 99.5% than that of 78.6% by baseline system.

Perception of lenis and aspirated stops in Seoul Korean by younger and older male and female listeners (한국어 서울 방언의 평음과 격음 변별 지각에서 연령과 성별에 따른 차이)

  • Kim, Jeahong;Kim, Soan;Ahn, Joohee;Nam, Kichun;Choi, Jiyoun
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.1-8
    • /
    • 2020
  • Traditionally it has been understood that the aspirated and lenis stops in Seoul Korean are distinguished primarily by voice onset time (VOT) and secondarily by other cues such as the fundamental frequency (F0) of the following vowel. However, recent studies on stop production have shown that the aspirated and lenis stops are currently merging in VOT and that they are now differentiated primarily by F0. In the present study, we examined whether the currently reported change in the production domain would be also found in the perception domain. To this end, an auditory identification task was conducted using speech materials of varying VOT and F0 values with young and older male and female Seoul listeners. Results revealed that all listener groups used both VOT and F0 to distinguish the lenis vs. aspirated stops but they used the F0 cue more reliably than the VOT cue in discriminating the stop contrast. The effects of gender and age were found only in the VOT cue (i.e., not in the F0 cue), with the greatest VOT cue weight in older males and the smallest in young females, which is in line with recent production studies.

The Acoustic Characteristics of Focus Associated with the Korean Particle' -man' (한국어 특수조사 ‘-만’에 연계된 초점의 음향음성학적 특성)

  • Choe, J.W.;Jeon, Y.S.;C., Y.;Park, S.B.;Kim, K.H.
    • Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.77-91
    • /
    • 1999
  • The purpose of this paper is to investigate the phonetic characteristics of the 'focus' phrases associated with the particle '-man' in Korean. The particle '-man' is a bound morpheme which, like other postpositions such as the subject marker '-ka' and the object marker '-lil', the so-called 'case markers' in Korean, typically attaches to a noun (phrase). The semantics of '-man' roughly corresponds to that of only, its counterpart in English, and is thus classified as a 'delimiter' (Yang 1973). It is assumed in this paper that '-man', like only in English, should have a 'focus' associated with it (von Stechow 1991, Rooth 1992). In general, '-man' attached phrases get the focus, but sometimes the association is not clear-cut, especially in the cases of emphatic use of '-man' or when the context strongly favors other phrase as the focus (Choe 1996). In this paper, we compare the phonetic characteristics of the '-man' marked phrases with those to which '-ka'/'-lil' is attached, and conclude that the focused '-man' phrases show higher fundamental frequencies than their equally focused 'case' -marked counterparts. However, when the context clearly forces the focus to fall on phrases other than the '-man' or '-ka'/'-lil' attached ones, there is no meaningful difference in fundamental frequency between the '-man' and '-ka'/'-lil' attached phrases. We also compare the phonetic characteristics of the regular use of '-man' with those of the emphatic '-man'. According to our experiments, the emphatic '-man' does not bring forth its phonetic effects, namely, higher fundamental frequencies, on the' -man' attached words or phrases but rather in various other ways such as higher fundamental frequencies in '-man', lengthening of the following word-initial syllable, or the inclusion of the following word in the same accentual phrase. Finally, it is claimed that '-man' associated focus phenomena, especially the emphatic use of '-man', show some typical acoustic characteristics of the other well-known focus phenomena, namely, wh-interrogatives.

  • PDF

Analysis of Japanese EEL Learners English Intonation - Japanese and English Compounds -

  • Taniguchi, Masaki
    • Proceedings of the KSPS conference
    • /
    • 2000.07a
    • /
    • pp.88-95
    • /
    • 2000
  • This paper attempts to investigate characteristic features of Japanese EFL learners' English intonation and how their Japanese accents are affecting their English intonation, focusing on a comparison between the accent patterns of Japanese compounds and the stress patterns of English compounds. It is based on research dedicated to helping to improve the teaching and learning of English intonation (prosody) for Japanese EFL learners. It examines the Fundamental Frequency (henceforth Fx) contours of two EFL college students, one specializing in English and the other in Japanese. Both of them may be considered upper intermediate EFL students with their TOEFL (Test of English as a Foreign Language) scores ranging between 500 and 550.

  • PDF

PITCH EXTRACTION USING AN APROXIMATELY IDEAL LOW-PASS FILTER

  • Matsuoka, T.;Sugama, A.;Onodera, E.;Ishida, Y.
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.931-936
    • /
    • 1994
  • Although an ideal low-pass filter is not physically realizable, it can be approximated on the basis of time reversal techniques. In this paper, we describe a method to approximately implement the ideal low-pass filter and apply it to the pitch extraction system. Experimental results show that our method is effective to estimate the fundamental frequency of the speech signal.

  • PDF

An acoustic phonetic study of Korean nuclear tones (국어 핵억양의 음향음성학적 연구)

  • Lee Ho-Young
    • MALSORI
    • /
    • no.38
    • /
    • pp.25-39
    • /
    • 1999
  • Korean intonation has been investigated mainly from the point of view of impressionistic phonetics and phonology. The purpose of this paper is to investigate Korean intonation especially nuclear tones, from the point of view of experimental phonetics. Since what we hear is not always the always as what we see in fundamental frequency contours, acoustic characteristics of Korean nuclear times are first discussed Based m quantitative data similar nuclear times are compared and the relationship between the nuclear ton and sentence type is investigated The relationship between the nuclear tone and the speaker's attitude is also discussed.

  • PDF