• Title/Summary/Keyword: fundamental frequency

Search Result 1,615, Processing Time 0.027 seconds

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean (한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구)

  • Kwon, Soon-Il;Park, Ji-Hyung;Park, Neung-Soo
    • The KIPS Transactions:PartB
    • /
    • v.15B no.6
    • /
    • pp.595-602
    • /
    • 2008
  • The focused word of each sentence is a help in recognizing and understanding spoken Korean. To find the method of focused word spotting at spoken speech signal, we made an analysis of the average and variance of Fundamental Frequency and the average energy extracted from a focused word and the other words in a sentence by experiments with the speech data from 100 spoken sentences. The result showed that focused words have either higher relative average F0 or higher relative variances of F0 than other words. Our findings are to make a contribution to getting prosodic characteristics of spoken Korean and keyword extraction based on natural language processing.

New development of artificial record generation by wavelet theory

  • Amiri, G. Ghodrati;Ashtari, P.;Rahami, H.
    • Structural Engineering and Mechanics
    • /
    • v.22 no.2
    • /
    • pp.185-195
    • /
    • 2006
  • Nowadays it is very necessary to generate artificial accelerograms because of lack of adequate earthquake records and vast usage of time-history dynamic analysis to calculate responses of structures. According to the lack of natural records, the best choice is to use proper artificial earthquake records for the specified design zone. These records should be generated in a way that would contain seismic properties of a vast area and therefore could be applied as design records. The main objective of this paper is to present a new method based on wavelet theory to generate more artificial earthquake records, which are compatible with target spectrum. Wavelets are able to decompose time series to several levels that each level covers a specific range of frequencies. If an accelerogram is transformed by Fourier transform to frequency domain, then wavelets are considered as a transform in time-scale domain which frequency has been changed to scale in the recent domain. Since wavelet theory separates each signal, it is able to generate so many artificial records having the same target spectrum.

A Study of Peak Finding Algorithms for the Autocorrelation Function of Speech Signal

  • So, Shin-Ae;Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young;Park, Ji Su
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.12
    • /
    • pp.131-137
    • /
    • 2016
  • In this paper, the peak finding algorithms corresponding to the Autocorrelation Function (ACF), which are widely exploited for detecting the pitch of voiced signal, are proposed. According to various researchers, it is well known fact that the estimation of fundamental frequency (F0) in speech signal is not only very important task but quite difficult mission. The proposed algorithms, presented in this paper, are implemented by using many characteristics - such as monotonic increasing function - of ACF function. Thus, the proposed algorithms may be able to estimate both reliable and correct the fundamental frequency as long as the autocorrelation function of speech signal is accurate. Since the proposed algorithms may reduce the computational complexity it can be applied to the real-time processing. The speech data, is composed of Korean emotion expressed words, is used for evaluation of their performance. The pitches are measured to compare the performance of proposed algorithms.

A Study on Pitch Perception of Normal Korean (한국 성인 음성의 음도인식에 관한 연구)

  • Jeong, Ok-Ran;Kim, Hyung-Soon;Kim, Young-Tae;Sub, Jang-Su
    • Speech Sciences
    • /
    • v.1
    • /
    • pp.315-323
    • /
    • 1997
  • This study attempts to determine the fundamental frequency level of male and female voices that Koreans perceive as normal. Seventy-three college students majoring in Speech Pathology participated in the study on a voluntary basis. The subjects listened to a male voice with fundamental frequency of 60 Hz, 80 Hz, 100 Hz, 120 Hz, 140 Hz, 160 Hz, 180 Hz, and 200 Hz, and a female voice with fundamental frequency of 140 Hz, 160 Hz, 180 Hz, 200 Hz, 220 Hz, 240 Hz, 260 Hz, and 280 Hz. The PSOLA (Pitch Synchronous Overlap). method and harmonic modeling method of speech signal were used to change pitch in the 20 Hz interval. The voices were presented in a random order to prevent listener bias. The results were as follows; Firstly, $46.6\%$ judged male voice with 120 Hz as normal, and $19.2\%$ judged 140 Hz as normal, and another $19.2\%$ judged 160 Hz as normal. Secondly, $50.7\%$ perceived female voice with 220 Hz as normal, and $32.9\%\;and\;30.1\%$ responded to 200 Hz and 240 Hz, respectively. The problems and recommendations for a future investigation are discussed.

  • PDF

Shimmer Change According to Fundamental Frequency Variation of Korean Normal Adults

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.143-152
    • /
    • 2003
  • The present study was performed to investigate change in shimmer according to $F_{0}$ variation precisely, and to offer suggestions for a clinical application. The analysis for the present study was done by the fundamental frequency ($F_{0}$) and shimmer measurement results of the previous 120 Korean normal adults' voice study of Pyo et al. (2002), used three vowels, /i/, /a/, /and /u/. Through the analysis of 60 female samples from the previous study, we found that $F_{0}$ of the vowels was the highest in /u/, and the lowest in /a/, but, on the contrary, shimmer was highest in /a/and lowest in /u/. Thirty of 60 subjects showed such an inverse relationship between $F_{0}$ and shimmer, as a whole. In the vowel /a/, 47 of 60 subjects showed the increased $F_{0}$ and decreased shimmer, in /i/, 32 subjects, and in /u/, 33 subjects showed the same results. The decrease in shimmer means the improvement of voice quality, so by these results, we expect to answer the question why the patients with spasmodic dysphonia can improve their voice quality with increased pitched voice production.

  • PDF

The Analysis of Eletroglottographic Measures of Vowel and Sentence in Korean Healthy Adults (한국 정상 성인의 모음과 문단 산출 시 전기성문파형 측정)

  • Kim, Jae-Ock
    • Phonetics and Speech Sciences
    • /
    • v.2 no.4
    • /
    • pp.223-228
    • /
    • 2010
  • This study investigated the closed quotient and other voice quality parameters using electroglottography (EGG) in sustaining the vowel /a/ and reading a sentence at the comfortable pitch and loudness in healthy Korean adults. Seventy two healthy adults (36 men, 36 women) aged 20~40 years were included in the study. The tasks were recorded and analyzed using Lx Speech Studio. In vowel sustaining task, closed quotient (Qx), fundamental frequency (Fx), sound pressure level (SPL), Jitter, and Shimmer were measured. In sentence reading task, closed quotient (DQx), fundamental frequency (DFx), and sound pressure level (DAx) were measured. The sex effects were observed on Qx, Fx, Shimmer, DQx, and DFx. Men had significantly higher Qx and DQx than women, but had significantly lower Shimmer than women. However, there was no sex effect on Jitter. The task effects on Qx and SPL as well as DQx and DAx were also assessed. Qx and SPL were significantly higher than DQx and DAx in both gender. This study showed that the closed quotients in both vowel sustaining and sentence reading tasks were significantly related to other voice quality parameters. Therefore, clinicians and researchers should describe the voice quality parameters like fundamental frequency, sound pressure level, Jitter, Shimmer, and so on when reporting closed quotients using EGG.

  • PDF

The fundamental frequency (f0) distribution of Korean speakers in a dialogue corpus using Praat and R (Praat과 R로 분석한 한국인 대화 음성 말뭉치의 fundamental frequency(f0)값 분포)

  • Byunggon Yang
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.17-25
    • /
    • 2023
  • This study examines the fundamental frequency(f0) distribution of 2,740 Korean speakers in a dialogue speech corpus. Praat and R were used for the collection and analysis of acoustical f0 data after removing extreme values considering the interquartile f0 range of the intonational phrases produced by each individual speaker. Results showed that the average f0 value of all speakers was 185 Hz and the median value was 187 Hz. The f0 data showed a positively skewed distribution of 0.11, and the kurtosis was -0.09, which is close to the normal distribution. The pitch values of daily conversations varied in the range of 238 Hz. Further examination of the male and female groups showed distinct median f0 values: 114 Hz for males and 199 Hz for females. A t-test between the two groups yielded a significant difference. The skewness representing the distribution shape was 1.24 for the male group and 0.58 for the female group. The kurtosis was 5.21 and 3.88 for the male and female groups, and the male group values appeared leptokurtic. A regression analysis between the median f0 and age yielded a slope of 0.15 for the male group and -0.586 for the female group, which indicated a divergent relationship. In conclusion, a normative f0 distribution of different Korean age and sex groups can be examined in the conversational speech corpus recorded by a massive number of participants. However, more rigorous data might be required to define a relation between age and f0 values.

Theoretical Study of the Circuits for Device of the High Voltage Pulse Generator (고전압 펄스 발생 장치의 회로에 관한 이론적 연구)

  • Kim, Young-Ju
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.27 no.1
    • /
    • pp.99-108
    • /
    • 2013
  • The high-voltage pulse generator is consist of transformers of fundamental wave and harmonic waves, and shunt capacitances. The pulse has the fundamental wave and the harmonic waves that have been increased as a series circuit by the transformers to make high voltage pulse. This paper shows that pulse generator circuit is analyzed using Miller's theorem and network theory(ABCD Matrix) and simulated in frequency and time domain using Matlab program. The output voltage of pulse were obtained to 2.5kHz, 1.8kV. Output pulse voltage increases as $L_m$ increases in low voltage circuit. In high voltage circuit, outer capacitors are related to frequency band pass characteristics.

PSO algorithm for fundamental frequency optimization of fiber metal laminated panels

  • Ghashochi-Bargh, H.;Sadr, M.H.
    • Structural Engineering and Mechanics
    • /
    • v.47 no.5
    • /
    • pp.713-727
    • /
    • 2013
  • In current study, natural frequency response of fiber metal laminated (FML) fibrous composite panels is optimized under different combination of the three classical boundary conditions using particle swarm optimization (PSO) algorithm and finite strip method (FSM). The ply angles, numbers of layers, panel length/width ratios, edge conditions and thickness of metal sheets are chosen as design variables. The formulation of the panel is based on the classical laminated plate theory (CLPT), and numerical results are obtained by the semi-analytical finite strip method. The superiority of the PSO algorithm is demonstrated by comparing with the simple genetic algorithm.

Acoustic Characteristics of the Voices of Korean Normal Adults by Gender on MDVP (성별에 따른 한국 정상 성인 음성의 음향학적 평가 기준치)

  • Kim, Jae-Ock
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.147-157
    • /
    • 2009
  • The purpose of the study is to develop the normal voice database and to analyze the acoustic characteristics of Korean adults' voices by gender using MDVP. Eight categories in the 34 parameters of MDVP were analyzed in the voices of 170 Korean normal adults taken from /a/ vowel. Among them, Fundamental Frequency Parameters and Frequency Perturbation Parameters were significantly different by gender. In addition, Fundamental Frequency Parameters of our data were remarkably different from the data suggested in the MDVP program which currently used in clinics. Therefore, the data obtained from the current study can be effectively used for the diagnosis of voice disorders of Korean adults as the standard parameter values of MDVP.

  • PDF