• Title/Summary/Keyword: fundamental frequency($f_0$)

Search Result 138, Processing Time 0.021 seconds

The First Formant Characteristics in Vocalize of One Soprano (소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상)

  • Song, Yun-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.10-14
    • /
    • 2005
  • Background and Objectives : Vowels are characterized on the basis of formant patterns. The first formant(F1) is determined by high-low placement of the tongue, and the second formant (F2) by front-back placement of the tongue. The fundamental frequency(F0) of a soprano often exceed the normal frequency of the first formant. And the vocal intensity is boosted when F0 is high and a harmonic coincides with a formant. This is called a formant tuning. Experienced singers thus learned how to tune their formants over a resonable range by lowering the tongue to maximize their vocal intensity. So, the current study aimed to identify the formant tuning in one experienced soprano by comparing the first formants of vowel [i] in three different voice production : speech, ascending scale, and vocalize. Materials and Method : All voices recordings of vowel [i] in speech, ascending scale (from F4 note to A4 note), and vocalize(:Ridente la calam") were made with digital audio tape-corder in a sound treated room. And the captured data were analyzed by the long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elementrics, Model, 4300B). Results : Although the first formant of vowel [i] in speech was 238Hz, those of ascending scale [i] were 377Hz, 405Hz, 453Hz respectively in F4(349z), G4(392Hz), A4(440Hz) note, and 722Hz, 820Hz, 918Hz respectively in F5 (698Hz), G5(784Hz), A5(880Hz) note. In vocalize, first formants of [i] were 380Hz, 398Hz, 453Hz respectively in F4, G4, A4 note, and 720Hz, 821Hz, 890Hz respectively in F5, G5, A5 note. Conclusion : These results showed that the first formant of ascending scale and vocalize sustained higher frequency than fundamental frequency in high pitch. This finding implicates that the formant tuning of vowel [i] in ascending scale was also noted in vocalize.

  • PDF

A Basic Study on the Conversion of Sound into Color Image using both Pitch and Energy

  • Kim, Sung-Ill
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.2
    • /
    • pp.101-107
    • /
    • 2012
  • This study describes a proposed method of converting an input sound signal into a color image by emulating human synesthetic skills which make it possible to associate an sound source with a specific color image. As a first step of sound-to-image conversion, features such as fundamental frequency(F0) and energy are extracted from an input sound source. Then, a musical scale and an octave can be calculated from F0 signals, so that scale, energy and octave can be converted into three elements of HSI model such hue, saturation and intensity, respectively. Finally, a color image with the BMP file format is created as an output of the process of the HSI-to-RGB conversion. We built a basic system on the basis of the proposed method using a standard C-programming. The simulation results revealed that output color images with the BMP file format created from input sound sources have diverse hues corresponding to the change of the F0 signals, where the hue elements have different intensities depending on octaves with the minimum frequency of 20Hz. Furthermore, output images also have various levels of chroma(or saturation) which is directly converted from the energy.

Core Loss Analysis of Non-oriented Electrical Steel Under Magnetic Induction Including Higher Harmonics

  • Cho, Chuhyun;Son, Derac;Cho, Youk
    • Journal of Magnetics
    • /
    • v.6 no.2
    • /
    • pp.66-69
    • /
    • 2001
  • The actual magnetic induction waveform of cores in electrical machines is not sinusoidal i.e. higher harmonics are always included. Thus the core loss in actual electrical machines is different from the core loss which is measured by the standard method, because the waveform of magnetic induction should be sinusoidal in the standard testing method. Core loss analysis under higher harmonic induction is always important in electric machine design. In this works we measured the core loss when a hysteresis loop has only one period of an ac minor loop of higher harmonic frequency, depending on the position of the ac minor loop of relative to the fundamental harmonic frequency. From this experiment, the core loss P(B/sub 0/f/sub 0/, B/sub h/, nf/sub 0/)) under a higher harmonic magnetic induction B/sub h/ could be expressed by the linear combination the core loss at fundamental harmonic frequency P/sub c/(B/sub 0/, f/sub 0/), the core loss of ac minor loop at zero induction region of the major hysteresis loop P/sub cL/ (B/sub h/, nf/sub 0/), and the core loss of an ac minor loop in the high induction region of the major hysteresis loop P/sub cH/ (B/sub h/, nf/sub 0/) i.e., P/sub c/, (B/sub 0/, f/sub 0/, B/sub h/, nf/sub 0/)=P/sub c/ (B/sub 0/, f/sub 0/,)+(n-1)[k₁(B/sub 0/) P/sub cL/ (B/sub h/, nf/sub 0/)+(1-k₁(B/sub 0/)) P/sub cH/ (B/sub h/, nf/sub 0/)]. This will be useful formula for electrical machine designers and one of effective methods to predict core loss including higher harmonic induction.

  • PDF

F0 as a primary cue for signaling word-initial stops of Seoul Korean (서울 방언 어두 폐쇄음의 후속모음 F0)

  • Byun, Hi-Gyung
    • Phonetics and Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.25-36
    • /
    • 2016
  • Previous studies showed that the voice onset time (VOT) of aspirated and lenis stops has been merged, and post-stop fundamental frequency (F0) has emerged as a primary cue to distinguish the two stops in the younger generation and female speech. The purpose of this study is to demonstrate that VOT merger in aspirated and lenis stops occurs after an F0 difference between the two stops becomes stabilized. In other words, unless post-stop F0, which is a redundant feature, is fully developed, it is hard for VOT merger to happen. Females have got a stable F0 difference in stops earlier than males. Therefore, VOT merger could happen, and as a result, females could take the lead in changing from VOT to F0 in initial stops. This study also shows that speakers who acquired F0 as a primary cue use F0 to the full to distinguish lenis stops from two other stops (aspirated and fortis).

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean (한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구)

  • Kwon, Soon-Il;Park, Ji-Hyung;Park, Neung-Soo
    • The KIPS Transactions:PartB
    • /
    • v.15B no.6
    • /
    • pp.595-602
    • /
    • 2008
  • The focused word of each sentence is a help in recognizing and understanding spoken Korean. To find the method of focused word spotting at spoken speech signal, we made an analysis of the average and variance of Fundamental Frequency and the average energy extracted from a focused word and the other words in a sentence by experiments with the speech data from 100 spoken sentences. The result showed that focused words have either higher relative average F0 or higher relative variances of F0 than other words. Our findings are to make a contribution to getting prosodic characteristics of spoken Korean and keyword extraction based on natural language processing.

A Study of Peak Finding Algorithms for the Autocorrelation Function of Speech Signal

  • So, Shin-Ae;Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young;Park, Ji Su
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.12
    • /
    • pp.131-137
    • /
    • 2016
  • In this paper, the peak finding algorithms corresponding to the Autocorrelation Function (ACF), which are widely exploited for detecting the pitch of voiced signal, are proposed. According to various researchers, it is well known fact that the estimation of fundamental frequency (F0) in speech signal is not only very important task but quite difficult mission. The proposed algorithms, presented in this paper, are implemented by using many characteristics - such as monotonic increasing function - of ACF function. Thus, the proposed algorithms may be able to estimate both reliable and correct the fundamental frequency as long as the autocorrelation function of speech signal is accurate. Since the proposed algorithms may reduce the computational complexity it can be applied to the real-time processing. The speech data, is composed of Korean emotion expressed words, is used for evaluation of their performance. The pitches are measured to compare the performance of proposed algorithms.

A Study on Acoustical Properties of Soprano′s Singing (소프라노의 성악 발성에 대한 음향학적 특징 연구)

  • 임동철;문소연;이행세
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.60-64
    • /
    • 2000
  • This paper studies the relation between the Fundamental Frequency (F0) and the formants of simple vowels in the Korean language sung by sopranos. It is hewn that, in soprano singing, the F0 of a vowel affects its formants. For this reason the formants of simple vowels sung by sopranos must be considered in all over the soprano singing range. We recorded the five simple vowel sounds /a/, /e/, /i/, /o/, and /u/ sung by five professional sopranos from A3 (220.0Hz) to A5 (880.0Hz) in the major scale and compared the formants of the sung vowels with those of spoken vowels. We observed that F1 and F2 of sung vowels were stable in low F0 (lower than B4) but in high F0 (higher than B4), F1 and F2 lost their stabilities. In the case of /a/, /o/, and /u/, the slope of the F1-F2 graph was about 2.6, and those of the F0-F2 and F0-Fl graphs were 2.2-2.5 and 0.7-1.0, respectively. And as the F0 increases, the F1 and F2 of sung vowels /a/, /e/, /i/, /o/, and /u/ were almost the same. At A5, the Fl and F2 of five sung vowels had the same values. This results suggest that the relation between the F0 and the formants be used to synthesize soprano's singing vowels.

  • PDF

The relationship between fluency levels and suprasegmentals according to the sentence types in the English read speech by Korean middle school English learners (한국 중학생의 영어 읽기 발화에서 문장유형에 따른 유창성 등급과 초분절 요소의 관계)

  • Kim, Hwa-Young
    • Phonetics and Speech Sciences
    • /
    • v.14 no.3
    • /
    • pp.51-66
    • /
    • 2022
  • This study aims to help Korean English learners to learn English pronunciation by revealing which suprasegmentals affect the implementation of English sentences closer to native English speakers when they read English sentences. To this end, Korean middle school English learners were selected as subjects and research data were gathered through sentence types (declarative, interrogative, imperative, and exclamative), as well as syllables. Speech rate, pause frequency, pause duration, F0 range, and rhythm among suprasegmentals were used for analysis of these English sentence utterances. Mean analysis, correlation analysis, and regression analysis were performed. The results showed that speech rate, pause frequency, pause duration, and F0 range affected the evaluation of fluency levels. In the regression analysis between all suprasegmentals and fluency levels, the suprasegmentals that most affected fluency levels were speech rate and F0 range. Rhythm had no meaningful relation with fluency levels. Therefore, when teaching English pronunciation, it is necessary to teach students to increase their speech rate and F0 range. In addition, students should be trained to reduce both the number and the duration of pauses during utterance to improve their fluency. It is noteworthy that of the four sentence types, exclamative sentences were produced with faster speech rate, fewer pauses, shorter pause duration, and higher rhythm values.

Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech (한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리)

  • Hong, Mun-Ki;Shin, Ji-Young;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.7-25
    • /
    • 2003
  • Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.

  • PDF

Acoustic Characteristics of Vowels in Korean Distant-Talking Speech (한국어 원거리 음성의 모음의 음향적 특성)

  • Lee Sook-hyang;Kim Sunhee
    • MALSORI
    • /
    • v.55
    • /
    • pp.61-76
    • /
    • 2005
  • This paper aims to analyze the acoustic effects of vowels produced in a distant-talking environment. The analysis was performed using a statistical method. The influence of gender and speakers on the variation was also examined. The speech data used in this study consist of 500 distant-talking words and 500 normal words of 10 speakers (5 males and 5 females). Acoustic features selected for the analysis were the duration, the formants (Fl and F2), the fundamental frequency and the total energy. The results showed that the duration, F0, F1 and the total energy increased in the distant-talking speech compared to normal speech; female speakers showed higher increase in all features except for the total energy and the fundamental frequency. In addition, speaker differences were observed.

  • PDF