• Title/Summary/Keyword: formant characteristics

Search Result 128, Processing Time 0.026 seconds

Voice Similarities between Sisters

  • Ko, Do-Heung
    • Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.43-50
    • /
    • 2001
  • This paper deals with voice similarities between sisters who are supposed to have common physiological characteristics from a single biological mother. Nine pairs of sisters who are believed to have similar voices participated in this experiment. The speech samples obtained from one pair of sisters were eliminated in the analysis because their perceptual score was relatively low. The words were measured in both isolation and context, and the subjects were asked to read the text five times with about three seconds of interval between readings. Recordings were made at natural speed in a quiet room. The data were analyzed in pitch and formant frequencies using CSL (Computerized Speech Lab) and PCQuirer. It was found that data of the initial vowels are much more similar and homogeneous than those of vowels in other positions. The acoustic data showed that voice similarities are strikingly high in both pitch and formant frequencies. It is assumed that statistical data obtained from this experiment can be used as a guideline for modelling speaker identification and speaker verification.

  • PDF

Characteristics of Speech Intelligibility and the Vowel Space in Patients with Parkinson's disease (파킨슨병 환자의 말 명료도와 모음 공간 특성)

  • Shim, Hee-Jeong;Park, Won-Kyoung;Ko, Do-Heung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.161-169
    • /
    • 2012
  • The purpose of this study was to investigate the characteristics of speech intelligibility of spontaneous speech and the vowel space parameters in patients with Parkinson's disease. Ten PD patients (M=5, F=5) and a corresponding control group of ten normal adults participated in this study. Firstly, subjects were asked to tell a story about their hometown and youth in order to analyze speech intelligibility. Secondly, the subjects were also asked to repeat four vowels (/a/, /i/, /u/, /e/) five times in order to compare their vowel spaces. The results were as follows: (1) the speech intelligibility of the PD group was lower than that of the control group. (2) Four parameters including vowel area, vowel articulatory index, formant centralization ratio, F2i/F1u ratio were significantly different in each group. For instance, vowel area and F2 ratio were wider and higher, respectively. As a result, a decrease in speech intelligibility of patients with PD is likely to show different types of errors from the normal group. The results of this research are meaningful in a sense that they could provide the objective standard of speech intelligibility and vowel space parameters.

Vocal Characteristics and Differences in Gender and Voice Classification among Classical Singers (성악가의 성별 및 성종에 따른 발성적 특징과 차이)

  • Nam, Do-Hyun;Kim, Wha-Soak
    • Phonetics and Speech Sciences
    • /
    • v.1 no.2
    • /
    • pp.163-171
    • /
    • 2009
  • This study attempted to investigate vocal characteristics and differences in gender and voice classification among classical singers. Twenty-three female singers (M = 23.1 yrs, SD = 3.6 yrs, average 6.3 yrs singing experience, all classified as sopranos) and twenty male singers (M = 25.2 yrs, SD= 3.6 yrs, average 6. 3 yrs singing experience, 8 tenors, 12 baritones) were recruited to participate in the present study. Speaking fundamental frequency (FO), closed quotient (CQ), MPT (Maximum Phonation Time), breathing types, maximum inspiratory pressure (MIP), maximum expiratory pressure (MEP), and singers' formants were measured. In addition, vibratory patterns were observed using stroboscopy. Sfo, singing CQ, breathing types, formant frequency in singers' formants, MIP, MEP, and MPT were significantly different from gender to gender. Generally, singers' formants were observed in male singers and also the pattern of singers' formants was different between tenors and baritones. Lower singing CQ values were observed than speaking CQ values in the female singers (P<.001). Furthermore, MEP, MIP, and singing CQ were significantly lower for female singers than for males singers (P<.001). MPT and speaking FO, however, were not significantly different between tenors and baritones.

  • PDF

Energy-Dependent Preemphasis for Speech Signal Preprocessing (음성신호 전처리를 위한 에너지 의존 프리엠퍼시스)

  • Kim, Dong-Jun;Park, Sang-Hui
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3
    • /
    • pp.18-25
    • /
    • 1997
  • This study describes a modified preemphasis formula, what we call energy-dependent preemphasis(EDP). This uses the normalized short-term energy of speech signal, with the assumption that the source characteristics of the glottal pulses and the radiation characteristics of the lips are approximately proportional to the energy of speech signal. Using this method, speech analyses, such as AR spectrum estimation, formant detection, are performed for nonstationary starting parts of 5 Korean single vowels. The results are compared with the conventional two preemphasis methods. We found that the proposed preemphasis gave enhanced spectral shapes and more accurate formant frequencies and avoided overlapping phenomenon of adjacent two formants.

  • PDF

A Study on Correlation between Sasang Constitution and Speech Features (사상체질과 음성특징과의 상관관계 연구)

  • Kwon, Chul-Hong;Kim, Jong-Yeol;Kim, Keun-Ho;Han, Sung-Man
    • Journal of Haehwa Medicine
    • /
    • v.19 no.2
    • /
    • pp.219-228
    • /
    • 2011
  • Objective : Sasang constitution medicine utilizes voice characteristics to diagnose a person's constitution. In this paper we propose methods to analyze Sasang constitution using speech information technology. That is, this study aims at establishing the relationship between Sasang constitutions and their corresponding voice characteristics by investigating various speech variables. Materials & Methods : Voice recordings of 1,406 speakers are obtained whose constitutions have been already diagnosed by the experts in the fields. A total of 144 speech features obtained from five vowels and a sentence are used. The features include pitch, intensity, formant, bandwidth, MDVP and MFCC related variables for each constitution. We analyze the speech variables and find whether there are statistically significant differences among three constitutions. Results : The main speech variables classifying three constitutions are related to pitch and MFCCs for male, and formant and MFCCs for female. The correct decision rate is 73.7% for male Soeumin, 63.3% for male Soyangin, 57.3% for male Taeumin, 74.0% for female Soeumin, 75.6% for female Soyangin, 94.3% for female Taeumin, and 73.0% on the average. Conclusion : Experimental results show that statistically significant correlation between some speech variables and the constitutions is observed.

The identification of /I/ in Spanish and French

  • Jorge A. Gurlekian;Benoit Jacques;Miguelina Guirao
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.521-528
    • /
    • 1996
  • This presentation explores on the perceptual characteristics of the lateral sound /l/ in CV syllables. At initial position we found that /l/ has well marked formant transitions. Then several questions arise: 1) are these formant structures dependent on the following vowel\ulcorner. 2) Are the formant transitions giving an additional cue for the identification\ulcorner Considering that the French vocalic system presents a greater variety of vowels than Spanish, several experiments were designed to verify to what extent a more extensive range of vocalic timbres contribute to the perception of /l/. Natural emissions of /l/ produced in Argentine Spanish and Canadian French CV syllables were recorded, where V was successively /i, e, a, o, u/ for Spanish and /i, e, $\varepsilon$, a, $\alpha$, o, u, y, \phi$/ for French. For each item, the segment C was maintained and V was replaced by cutting & splicing by each of the remaining vowels without transitions. Results of the identification tests for Spanish show that natural /l/ segments with low Fl and high formants F3, F4 can be clearly identified in the /i, e, u/ vowel contexts without transitions. For French subjects the combination of /l/ with a vowel without transitions reflected correct identifications for its own original vowel context in /e, $\varepsilon$, y, $\phi$/. For both languages, in all these combinations, F1 values remained rather steady along the syllable. In the case of /o, u/ very likely the F2 difference lead to a variety of perceptions of the original /l/. For example in Ilul, French subjects reported some identifications of /l/ as a vowel, mainly /y/. Our observations reinforce the importance of F1 as a relevant cue for /l/, and the incidence of the relative distance between formants frequencies of both components.

  • PDF

How to Express Emotion: Role of Prosody and Voice Quality Parameters (감정 표현 방법: 운율과 음질의 역할)

  • Lee, Sang-Min;Lee, Ho-Joon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.11
    • /
    • pp.159-166
    • /
    • 2014
  • In this paper, we examine the role of emotional acoustic cues including both prosody and voice quality parameters for the modification of a word sense. For the extraction of prosody parameters and voice quality parameters, we used 60 pieces of speech data spoken by six speakers with five different emotional states. We analyzed eight different emotional acoustic cues, and used a discriminant analysis technique in order to find the dominant sequence of acoustic cues. As a result, we found that anger has a close relation with intensity level and 2nd formant bandwidth range; joy has a relative relation with the position of 2nd and 3rd formant values and intensity level; sadness has a strong relation only with prosody cues such as intensity level and pitch level; and fear has a relation with pitch level and 2nd formant value with its bandwidth range. These findings can be used as the guideline for find-tuning an emotional spoken language generation system, because these distinct sequences of acoustic cues reveal the subtle characteristics of each emotional state.

Characteristics of 2 to 4 year old Korean children's production of monophthongs and diphthongs (만 2-4세 한국 아동의 단모음과 이중모음 산출 특징)

  • Song, Inmi;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.65-74
    • /
    • 2018
  • The purpose of this study is to investigate age-specific features of 2;1- to 4;1-year -olds' production of monophthongs and diphthongs through both auditory perceptual analysis and acoustic analysis. Test material included {vowel+'da'} consisting of 7 monophthongs and 10 diphthongs and meaningful words beginning with vowels. The percentage of correct vowels was used for perceptual analysis and Praat(5.2.12) was used for acoustic analysis, analyzing variables related to monophthongs and diphthongs. The results of this study are as follows: First, perceptual analysis showed that children from an age group of 2;1 to 2;8 years showed significant difference in the accuracy level of both monophthongs and diphthongs as compared to those aged 2;9 to 3;4 years and those aged 3;5 to 4;1 years. Second, the results of acoustic analysis provided that formant (F1 and F2) of monophthong, in general, tended to decrease as age increased. In terms of F2 differentiation slope and regression slope, which were diphthong-related variables, the age group of 3;5 to 4;1 years showed a large general slope change.

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers - Possibility of a Second Singer's Formant-

  • Jin, Sung-Min;Kwon, Young-Kyung;Song, Yun-Kyung
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.99-109
    • /
    • 2003
  • The purpose of this study was to analyze and compare head register with chest register of singers acoustically. Fifteen healthy tenor major students were participated. Fifteen healthy untrained adults were chosen as the control group for this study. Long term average (LTA) power spectrum using the Fast Fourier transform (FFT) algorithm and Linear predictive coding (LPC) filter response were made with /a/ sustained in both head (G4, 392 Hz) and chest registers (C3, 131 Hz). Statistical analysis was performed using the Mann-Whitney test. In the LTA power spectrum, head register of singers increased in the level of energy gain within the frequency of 2.2-3.4 kHz (p<0.01), and 7.5-8.4 kHz (p<0.01, p<0.05). Chest register of singers increased in the frequency of 2.2-3.1 kHz (p<0.01), 7.8-8.4 kHz (p<0.05) and around 9.6 kHz (p<0.01). The LTA power spectrum revealed a peak of acoustic energy around 2,500 Hz, known as the singer's formant and another peak of acoustic energy around 8,000 Hz in the singer's voice.

  • PDF

Acoustic and Stroboscopic Characteristics of Normal Person's Voices with Advancing Age (연령증가에 따른 정상 노인의 음향분석학적 특징)

  • 진성민;권기환;강현국
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.8 no.1
    • /
    • pp.44-48
    • /
    • 1997
  • Anatomic and physiological changes of the larynx with advancing age result in morphologic changes of the vocal fold and reduced control of the phonatory mechanism in elderly individuals and are reflected in increased unstability of fundamental frequency (Fo). The purpose of this study is to increase current understanding of acoustic and stroboscopic characteristics of normal elderly persons voices. First, phonated /a/ vowel productions by 40 normal adults (20 to 40 years, 20 men and 20 women) and 40 normal elderly persons (60 to 80 years,20 men and 20 women) were analyzed, using CSL (model 4300B) acoustic analysis software, to obtain acoustic measures related to fundamental frequency stability nd vocal resonance characteristics. Second, stroboscopic images of the vocal fold behavior in all subjects were analyzed by experienced specialists. In the men, fundamental frequency variation (vFe) (p<0.01), jitter. (p<0.05), and shimmer (p<0.05) for the older group were significantly higher than the value for the adult group. In the stroboscopic findings, edema of vocal fold had a significant finding in aged men (15%). In the women, vFo (p<0.05), jitter (p<0.05), and noise to harmonic ratio (NHR) (p<0.05) for the older group were significantly higher than the value for e adult group and first formant frequency (F1) (p<0.01) and second formant frequency (F2) (p<0.01) for. the older group were significantly lower than the value for the adult group. In the stroboscopic findings, vocal fold atrophy had a significant finding in aged women (25%). Frequency stability, as reflected by vFo, jitter, shimmer, and NHR, decreases with advancing age in men and women and spectral analysis of phonated /a/ vowel productions reveals the lowering of the frequency of F1 and second F2 with advancing age, especially in aged women. Change in the mass of vocal folds, due to atrophy or edema, is considered to be the greatest factor in these acoustic changes.

  • PDF