• Title/Summary/Keyword: fundamental frequency of speech

Search Result 203, Processing Time 0.023 seconds

Variance characteristics of speaking fundamental frequency and vocal intensity depending on utterance conditions (발화조건에 따른 기본주파수 및 음성강도 변동의 특징)

  • Lee, Moo-Kyung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.111-118
    • /
    • 2012
  • The purpose of this study was to characterize and determine variances of speaking fundamental frequency and vocal intensity depending on gender and three utterance conditions (spontaneous speech, reading, and counting). A total of 65 undergraduate students (32 male students, 33 female students) attending universities in Daegu, South Korea participated in this study. The subjects were all in their 20s. This study used KayPENTAX's Visi-Pitch IV (Model 3950) to measure the variances of speaking fundamental frequency (SFF0) and vocal intensity (VI). As a result, this study came to the following conclusions. First, it was found that both males and females showed no significant difference in SFF0 and vocal intensity among three utterance conditions. Second, this study sought to analyze differences in the variances of SFF0 between males and females. As a result, it was found that females showed significantly higher levels of four measured variances (SFF0 $SD^{**}$, SFF0 $range^{***}$, Min $SFF0^{***}$ and Max $SFF0^{***}$) than males on spontaneous speech. However, it was found that there was no significant difference between males and females in SFF0 range on reading or in SFF0 SD and SFF0 range on counting. It was found that there was no significant difference between males and females in the level of measured variances of vocal intensity depending on utterance conditions. Finally, this study made a comparison and analysis on differences in the variances of SFF0 and vocal intensity among utterance conditions. As a result, it was found that all the measured variances of SFF0 in males were most significantly reduced depending upon spontaneous speech which was followed by reading and counting respectively (SFF0 SD: p<.001, SFF0 range: p<.05, Max SFF0: p<.05). Females however, show no significant difference in the measured variances of SFF0 depending upon three utterance conditions. It was also found that the measured variances of vocal intensity in females were most significantly reduced depending on spontaneous speech that was followed by reading and counting (VI SD: p<.001, VI range: p<.001, Min VI: p<.01 Max VI: p<.05), while males showed no significant difference in the measured variances of vocal intensity depending on three utterance conditions. In sum, these findings suggest that variances of SFF0 in males are affected by three utterance conditions, while variances of vocal intensity in females are affected by three utterance conditions.

F-ratio of Speaker Variability in Emotional Speech

  • Yi, So-Pae
    • Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.63-72
    • /
    • 2008
  • Various acoustic features were extracted and analyzed to estimate the inter- and intra-speaker variability of emotional speech. Tokens of vowel /a/ from sentences spoken with different modes of emotion (sadness, neutral, happiness, fear and anger) were analyzed. All of the acoustic features (fundamental frequency, spectral slope, HNR, H1-A1 and formant frequency) indicated greater contribution to inter- than intra-speaker variability across all emotions. Each acoustic feature of speech signal showed a different degree of contribution to speaker discrimination in different emotional modes. Sadness and neutral indicated greater speaker discrimination than other emotional modes (happiness, fear, anger in descending order of F-ratio). In other words, the speaker specificity was better represented in sadness and neutral than in happiness, fear and anger with any of the acoustic features.

  • PDF

Development of Integrated Speech Training Aids for Hearing Impaired (청각 장애인용 통합형 발음 훈련 기기의 개발)

  • 박상희;김동준
    • Journal of Biomedical Engineering Research
    • /
    • v.13 no.4
    • /
    • pp.275-284
    • /
    • 1992
  • Development of Integrated Speech Training Aids for Hearing Impaired In this study, a spepch lralnlng aids that can do real-time display of vocal tract shape and other speech parameters together in a single system is implemenLed and self-training program for this system is developed. To estimate vocal tract shape, speech production process is assumed to be AR model. Through LPC analysis, vocal tract shape, intensity, and log spcclrum are calculated. And, fundamental frequency and nasality are measured using vibration sensors.

  • PDF

The Analysis of Tracheoesophageal Voice after Near-Total Laryngectomy and Implantation of Provox Prosthesis (후두근전적출술과 Provox 삽입술 후 기관식도발성에 관한 연구)

  • Choi, In-Ja;Choi, Young-Soo;Kim, Jin-Hwan;Ahn, Hwoe-Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.2
    • /
    • pp.141-144
    • /
    • 2004
  • Background and Objectives : To compare acoustic, aerodynamic analysis of voice and intelligibility score in patients with near-total laryngectomy and implantation of Provox prothesis. Material and Methods : In order to evaluate the voice characteristics, acoustic, aerodynamic parameter and speech intelligibility were measured in 5 patients after near-total laryngectomy, 5 patients after implantation of Provox prosthesis with total bility were measured in 5 patients after near-total laryngectomy, 5 patients after implantation of Provox prosthesis with total laryngectomy and 10 adults normal speaker. Acoustic analysis was carried out using CSL and aerodynamic analysis was carried out using Aerophon II. Speech sample was recorded and 10 listener was scored for speech intelligibility using a percentage of words correctly identified. Results. Fundamental frequency($F_0$), intensity, jitter, shimmer, maximal phonation time(MPT), subglottic air pressure were used for parameters for voice analysis. There were no significant difference between two group except on fundamental frequency and shimmer. The fundamental frequency was higher in patients with near-total laryngectomy and shimmer was higher in patients after implantation of Provox prosthesis with total laryngectomy. In addition, speech intelligibility was no significant difference between two groups. Conclusion : This results confirm that near-total laryngectomy and implantation of Provox prosthesis provides good voice rehabilitation.

  • PDF

Pitch Extraction of Speech Signals by the Harmonics analysis (고조파 분석에 의한 음성신호의 피치 검출)

  • Kim, Kee-Hee;Choi, Jung-Ah;Bae, Myung-Jin;Ann, Sou-Guil
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1610-1614
    • /
    • 1987
  • The harmonies of the fundamental frequency in speech signal make a minute line spectrum in frequency domain. In this paper, we propose a new algorithm to detect a pitch interval in voiced sound based on the fact that the number of harmonies can represent the period of the pitch in the time domain.

  • PDF

Shimmer Change According to Fundamental Frequency Variation of Korean Normal Adults

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.143-152
    • /
    • 2003
  • The present study was performed to investigate change in shimmer according to $F_{0}$ variation precisely, and to offer suggestions for a clinical application. The analysis for the present study was done by the fundamental frequency ($F_{0}$) and shimmer measurement results of the previous 120 Korean normal adults' voice study of Pyo et al. (2002), used three vowels, /i/, /a/, /and /u/. Through the analysis of 60 female samples from the previous study, we found that $F_{0}$ of the vowels was the highest in /u/, and the lowest in /a/, but, on the contrary, shimmer was highest in /a/and lowest in /u/. Thirty of 60 subjects showed such an inverse relationship between $F_{0}$ and shimmer, as a whole. In the vowel /a/, 47 of 60 subjects showed the increased $F_{0}$ and decreased shimmer, in /i/, 32 subjects, and in /u/, 33 subjects showed the same results. The decrease in shimmer means the improvement of voice quality, so by these results, we expect to answer the question why the patients with spasmodic dysphonia can improve their voice quality with increased pitched voice production.

  • PDF

Characteristics of Vowel Formants, Voice Intensity, and Fundamental Frequency of Female with Amyotrophic Lateral Sclerosis using Spectrograms (스펙트로그램을 이용한 근위축성측삭경화증 여성 화자의 모음 포먼트, 음성강도, 기본주파수의 변화)

  • Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.9
    • /
    • pp.193-198
    • /
    • 2019
  • This study analyzed the changes of vowel formant, voice intensity, and fundamental frequency of vowels for 11 months using acoustochemical spectrogram analysis of women diagnosed with amyotrophic lateral sclerosis (ALS). The test word was a vowel /a, i, u/ and a diphthong /h + ja + da/, /h + wi + da/, and /h +ɰi+ da/. Speech data were collected through the word reading task presented on the monitor using 'Alvin' program, and the recording environment was set to 5,500 Hz for the nyquist frequency and 11,000 Hz for the sampling rate. The records were analyzed by using spectrograms to vowel formants, voice intensity, and fundamental frequency. As a result of analysis, the fundamental frequency and intensity of the ALS process were decreased and the formant slope of the diphthong was decreased rather than the formant change in the vowel. This result suggests that the vowel distortion of ALS due to disease progression is due to the decrease of tongue and jaw co morbidity.

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean (한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구)

  • Kwon, Soon-Il;Park, Ji-Hyung;Park, Neung-Soo
    • The KIPS Transactions:PartB
    • /
    • v.15B no.6
    • /
    • pp.595-602
    • /
    • 2008
  • The focused word of each sentence is a help in recognizing and understanding spoken Korean. To find the method of focused word spotting at spoken speech signal, we made an analysis of the average and variance of Fundamental Frequency and the average energy extracted from a focused word and the other words in a sentence by experiments with the speech data from 100 spoken sentences. The result showed that focused words have either higher relative average F0 or higher relative variances of F0 than other words. Our findings are to make a contribution to getting prosodic characteristics of spoken Korean and keyword extraction based on natural language processing.

2.4kbps Speech Coding Algorithm Using the Sinusoidal Model (정현파 모델을 이용한 2.4kbps 음성부호화 알고리즘)

  • 백성기;배건성
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.3A
    • /
    • pp.196-204
    • /
    • 2002
  • The Sinusoidal Transform Coding(STC) is a vocoding scheme based on a sinusoidal model of a speech signal. The low bit-rate speech coding based on sinusoidal model is a method that models and synthesizes speech with fundamental frequency and its harmonic elements, spectral envelope and phase in the frequency region. In this paper, we propose the 2.4kbps low-rate speech coding algorithm using the sinusoidal model of a speech signal. In the proposed coder, the pitch frequency is estimated by choosing the frequency that makes least mean squared error between synthetic speech with all spectrum peaks and speech synthesized with chosen frequency and its harmonics. The spectral envelope is estimated using SEEVOC(Spectral Envelope Estimation VOCoder) algorithm and the discrete all-pole model. The phase information is obtained using the time of pitch pulse occurrence, i.e., the onset time, as well as the phase of the vocal tract system. Experimental results show that the synthetic speech preserves both the formant and phase information of the original speech very well. The performance of the coder has been evaluated in terms of the MOS test based on informal listening tests, and it achieved over the MOS score of 3.1.

Characteristics of Connected Speech in ADSD (내전형 연축성 발성장애의 연속 발화 특성)

  • Hwang, Yon-Shin;Kim, Jae-Ok;Choi, Hong-Shik
    • Phonetics and Speech Sciences
    • /
    • v.1 no.1
    • /
    • pp.93-98
    • /
    • 2009
  • The aim of this study was to investigate voice characteristics of adductive spasmodic dysphonia(ADSD) by measuring electroglottal and acoustic examination at the sentence level. The clinical records of 86 ADSD female patients (age group of $20{\sim}50$ years) and the control records of 86 normal females (age group of $20{\sim}40$ years) were recorded by speech studio(Laryngograph Ltd., UK). An independent t-test was used to compare ADSD and normal group. Results were as follows. (1) Fundamental frequency($F_0$) was significantly decreased in ADSD compared with normal group. (2) Irregularity of frequency and closed quotient(CQ) was significantly increased in ADSD compared with normal group. (3) Voiceless duration increased and voiced duration was significantly decreased in ADSD compared with normal group. (4) Fricative duration was increased in ADSD compared with normal group but it wasn't significant. In conclusion, strained, tight and choked voice shows an increase of CQ, tremor voice shows an increase of irregularity of frequency and less feminine voice shows decrease of $F_0$. Increase of voiceless duration and fricative duration and decrease of voiced duration related with diminution speech intelligibility.

  • PDF