• Title/Summary/Keyword: fundamental frequency of speech

Search Result 205, Processing Time 0.024 seconds

A Phonetic Analysis of Yodel Singing by the Electroglottographic(EGG) Measurement (요들송에 대한 전기성문파형검사(EGG)를 이용한 발성학적 접근)

  • Suh, D.;Choi, H.S.
    • Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.113-126
    • /
    • 2000
  • A comparative phonetic analysis of Yodel singing and Belcanto singing by the electroglottographic(EGG) measurement was done in three singers. One professional tenor singer(SDI) who is also well trained in Yodel singing, another yodler(KWS) who is not so trained in Belcanto singing, and the other training tenor singer(CSK) who is not well trained both yodel and Belcanto singing. Closed quotient(CQ), speed quotient(SQ) and fundamental frequency (F0) at the initial modal part(I) , middle falsetto part(M), and final modal part(F) of the same phrase were measured by EGG machine and program(Kay model 4338). In the middle part, not only CQ but also SQ of the Yodel singing were much smaller than that of Belcanto singing in all three singers. However, accuracy of parameters in Belcanto singing of the yodler(KWS) and both Yodel singing and Belcanto singing of the training singer(CSK) were inferior to that of trained tenor singer(SDI). Possible advantages of utilizing Yodel singing training under the guidance of feedback control by the EGG for hyperfunctional voice disorders such as vocal nodules were discussed.

  • PDF

An Acoustical Study of Korean Diphthongs (한국어 이중모음의 음향학적 연구)

  • Yang Byeong-Gon
    • MALSORI
    • /
    • no.25_26
    • /
    • pp.3-26
    • /
    • 1993
  • The goals of the present study were (3) to collect and analyze sets of fundamental frequency (F0) and formant frequency (F1, F2, F3) data of Korean diphthongs from ten linguistically homogeneous speakers of Korean males, and (2) to make a comparative study of Korean monophthongs and diphthongs. Various definitions, kinds, and previous studies of diphthongs were examined in the introduction. Procedures for screening subjects to form a linguistically homogeneous group, time point selection and formant determination were explained in the following section. The principal findings were as follows: 1. Much variation was observed in the ongliding part of diphthongs. 2. F2 values of (j) group descended while those of [w] group ascended, 3. The average duration of diphthongs were about 110 msec, and there was not much variation between speakers and diphthongs. 4. In a comparative study of monophthongs and diphthongs, Fl and F2 values of the same offgliding part at the third time point almost converged. 5. The gliding of diphthongs was very short beginning from the h-noise. Perceptual studies using speech synthesis are desirable to find major parameters for diphthongs. The results of the present study wi11 be useful in the area of automated speech recognition and computer synthesis of speech.

  • PDF

The Production and Perception of Focus in English Yes- No Questions (영어 가부 의문문 초점 발화와 지각)

  • Jeon, Yoon-Shil;Oh, Sei-Poong;Kim, Kee-Ho
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.111-128
    • /
    • 2004
  • In English, a focused word with new information receives a pitch accent. This paper examines how English native speakers and Korean speakers produce and perceive focus in English yes-no questions. The production experiments show that native speakers realize an appropriate intonation of yes-no questions, in which a focused word has a low pitch accent followed by a high phrasal accent and a high boundary tone. However, Korean speakers usually give a high tone to a focused word. In a like manner, the perception experiments show that English native speakers judge a word with a low tone to be focused, while Korean speakers have difficulty in comprehending a focused word realized as a low tone. And it is found that Korean speakers tend to perceive low tones on sentence initial and final focused words better than those on sentence medial focused words, and they often perceive a word with a relatively high fundamental frequency or a sharp rise of fundamental frequency as a focused word. This paper shows that Korean speakers have trouble to produce and perceive an appropriate tonal pattern of a focused yes-no question, and that can cause confusion in a conversation with native speakers.

  • PDF

Aerodynamic Characteristics, Vocal Efficiency, and Closed Quotient Differences according to Fundamental Frequency Fixation (음도 고정 유무에 따른 공기역학, 음성효율성 및 성대접촉률 차이)

  • Kim, Jaeock
    • Phonetics and Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.19-26
    • /
    • 2013
  • The aerodynamic characteristics (subglottal pressure (Ps) and mean airflow rate (MFR)), fundamental frequency (Fo), intensity (I), vocal efficiency (VE), and closed quotient (CQ) were compared during a sustained vowel /o/ sound under three conditions: in a comfortable loudness and pitch level (condition 1), in a maximum loudness level with a fixed pitch (condition 2), and in a maximum loudness level without a fixed pitch (condition 3). Also, multiple regression analyses were done to measure the aerodynamic characteristics affect on the VE and the CQ in each condition. The results showed the Fo, Ps, MFR, VE, and CQ increased as I increased with and without fixed pitch. Most notably, VE in condition 3 was the highest of all the conditions, but CQ was not very high. By the results of multiple regression analysis, VE was significantly affected by I and Ps in all conditions; Fo was the other main key for affecting VE in high pitch. However, none of the aerodynamic characteristics significantly affected CQ. As I increases, Fo should be increased by increasing Ps and VE. Therefore, researchers should consider and specify an a priori to Fo, Ps, and I when measuring VE to examine the complex and delicate vocal mechanism.

PRAAT Software: A Spech Interaction Tool to Analyze Teacher Voices (PRAAT 소프트웨어: 교사 목소리 분석을 위한 맞춤법 상호작용 도구)

  • Kidd, Ella Jane
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.9
    • /
    • pp.158-165
    • /
    • 2019
  • Through the use of speech software technology, this paper examines the effects of voice interactions within the inner circle of English. The fundamental frequency (F0) was obtained by analyzing native speakers (aged 30-55) speech effects based on nationality, age, and gender. The findings within this study reveal that the Caucasian British female (age 33) and the Caucasian American male (age 55) produced the most interactive speech. The contributing factor is the students' experience with various language styles throughout their language acquisition studies. The results of this study are compatible with $Traunm{\ddot{u}}eller$ & Eriksson (1995) and previous studies which agree that continuous speech above average is paramount towards student engagement and interactions.

A Study of Extracting Acoustic Parameters for Individual Speakers (개별화자의 음성파라미터 추출에 관한 연구: 음성파라미터의 상관관계를 중심으로)

  • Ko, Do-Heung
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.129-143
    • /
    • 2003
  • Fundamental frequency (Fo), jitter, shimmer, and harmonics-to-noise ratio (NHR) have been measured to see their interactions between the parameters using Multi-Dimensional Voice Program (MDVP). 100 Korean normal adults (50 males and 50 females) ranging from their early 20's to their early 30's produced the eight sustained vowels including /a/, /i/, /u/, /c/, /e/,/$\varepsilon$/, /i/, and /e/. The subjects were asked to read the above vowels five times in isolation with the interval of five seconds, respectively. Male voices, on the average, showed 130.7 Hz in Fo, 0.6696% in jitter, 1.8151% in shimmer, and 0.12 in NHR, while female voices showed 232.8 Hz in Fo, 0.9222% in jitter, 1.9199% in shimmer, and 0.1098 in NHR. As to the correlation coefficient, it was found that for male speakers jitter vs. shimmer, shimmer vs. NHR, Fo vs. shimmer, and Fo vs. NHR are statistically significant. It was found that for female subjects jitter vs. shimmer and Fo vs. shimmer are statistically significant. However, it is concluded that the correlation coefficient in females are not meaningful in a practical way though they are all statistically significant.

  • PDF

Differences in Speaking Fundamental Frequency for Voice Classification and Closed Quotient between Speaking and Singing (성종에 따른 발화 기본주파수와 발화 및 성악발성 시 성대접촉률의 차이 비교)

  • Nam, Do-Hyun;Choi, Hong-Shik
    • Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.147-157
    • /
    • 2008
  • Habitual speaking fundamental frequency (sF0) plays an important role in determining the voice classification, which can be presented differently depending on the vocal fold length and language habits. The purpose of this study, therefore, was to compare the differences in sF0 for voice classification and closed quotient between speaking and singing. Seventeen singers (7 sopranos, 5 tenors, 5 baritones, mean age 25.1 years) with no evidence of vocal folds pathology were participated. sF0 and closed quotient (CQ) both in speaking and in singing (A3-A5 with soprano, A2-A4 with tenor and baritone) were measured using SPEAD program and electroglottography. No significant differences were observed for sF0 between tenor and baritone groups (p> 0.05). However, CQ in singing was significantly different among three groups (p< 0.05), but CQ in speaking was not (p> 0.05). Furthermore, CQ was significantly different with both soprano (p< 0.01) and tenor groups ((P= 0.02) whereas baritone group revealed there is no difference when compared between speaking and singing. No significant differences in sF0 between tenor and baritone participants may result from decision-making for voice classification by experience and should measure sF0 before determining the voice classification.

  • PDF

The f0 distribution of Korean speakers in a spontaneous speech corpus

  • Yang, Byunggon
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.31-37
    • /
    • 2021
  • The fundamental frequency, or f0, is an important acoustic measure in the prosody of human speech. The current study examined the f0 distribution of a corpus of spontaneous speech in order to provide normative data for Korean speakers. The corpus consists of 40 speakers talking freely about their daily activities and their personal views. Praat scripts were created to collect f0 values, and a majority of obvious errors were corrected manually by watching and listening to the f0 contour on a narrow-band spectrogram. Statistical analyses of the f0 distribution were conducted using R. The results showed that the f0 values of all the Korean speakers were right-skewed, with a pointy distribution. The speakers produced spontaneous speech within a frequency range of 274 Hz (from 65 Hz to 339 Hz), excluding statistical outliers. The mode of the total f0 data was 102 Hz. The female f0 range, with a bimodal distribution, appeared wider than that of the male group. Regression analyses based on age and f0 values yielded negligible R-squared values. As the mode of an individual speaker could be predicted from the median, either the median or mode could serve as a good reference for the individual f0 range. Finally, an analysis of the continuous f0 points of intonational phrases revealed that the initial and final segments of the phrases yielded several f0 measurement errors. From these results, we conclude that an examination of a spontaneous speech corpus can provide linguists with useful measures to generalize acoustic properties of f0 variability in a language by an individual or groups. Further studies would be desirable of the use of statistical measures to secure reliable f0 values of individual speakers.

A Study on the Phonetic Parameters Used on the Voice Imitation (모방의 대상이 되는 음성적 특성에 관한 연구)

  • Park Jihye;Shin Jiyoung;Kang Sunmee
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.187-190
    • /
    • 2003
  • The purpose of this paper is to research the phonetic parameters used on the voice imitation. First of all, the fundamental frequency is imitated effectively. Distinctive prosodic patterns are used repeatedly on the voice imitation. Speaking rate is used in special measure in case the target speaker has extraordinary speaking rate. Also formant frequency is imitated variously. In sum, distinctive characteristics perceived by listener are used on voice imitation.

  • PDF

A Production and Perception Experiment of Korean Alveolar Fricatives

  • Yoon, Kyu-Chul
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.169-184
    • /
    • 2002
  • Korean has two types of voiceless alveolar fricatives: a non-tense fricative /$S^{h}$ and a tense fricative /s'/. Twenty native speakers of Korean produced five pairs of isolated words containing word initial $S^{h}V$ and /s'V/ sequences where V was any one of five (/a, e, i, o, u/) of Korean vowels. Acoustic measures such as duration, fricative noise prominent frequency, energy change of following vowel, and fundamental frequency at vowel onset were examined. Results showed that among the parameters, aspiration noise duration of /s'/ in mid and low vowel contexts was less than 21 ms. In a perception experiment, where only the aspiration noise interval of the /$S^{h}$/ tokens was incrementally reduced, some listeners shifted perception from /$S^{h}$/ to /s'/.

  • PDF