• Title/Summary/Keyword: formant parameters

Search Result 74, Processing Time 0.017 seconds

Comparative Analysis for General and Estrus-related Vocalizations in Sows (모돈의 일반 발성음과 발정기 특이음의 비교분석)

  • Jeon, J.H.;Yeon, S.C.;Chang, H.H.
    • Journal of Animal Science and Technology
    • /
    • v.47 no.1
    • /
    • pp.133-140
    • /
    • 2005
  • The aim of this study was to divide vocalizations of sows into general(GVs) and estrus-related vocalizations( EVs) and to find out their phonetic characteristics. Ten sows(Landrace) were recorded using digital video recorders twice daily(06: 00 - 08 : 00h and 17: 00 - 19 : 00h) during the anestrus and estrus periods. The GVs and EVs were divided based on the shapes of spectrum and spectrogram. The GVs and EVs were identified as 5 and 3 types, respectively. Pitch, formant I, formant 2, and formant 3 between GVs and EVs were not significantly different(P> 0.05), whereas intensity(P < 0.001), duration(P < 0.05), and formant 4(P < 0.01) were significantly different. Three parameter groups(Group I : Formant vector alone, Group II: Formant veetor+ parameters from time signal, Group III: Formant vector+parameters from time signal-parameters eliminated by stepwise discriminant analysis backward) were compared by discriminant function analysis. The classification system adopted in the Group II represented the higher discrimination rate than those in other groups(Group I : 76.1 0/0, Group II : 88.1 0/0, Group Ill: 87.3 %). These results suggest that EVs are present and intensity, formant 2, and formant 4 are available parameters for discrimination of EVs in sows.

Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants (한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석)

  • Choi, Soon-Ai;Seong, Cheol-Jae
    • MALSORI
    • /
    • no.60
    • /
    • pp.67-84
    • /
    • 2006
  • The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).

  • PDF

Implementation of Formant Speech Analysis/Synthesis System (포만트 분석/합성 시스템 구현)

  • Lee, Joon-Woo;Son, Ill-Kwon;Bae, Keuo-Sung
    • Speech Sciences
    • /
    • v.1
    • /
    • pp.295-314
    • /
    • 1997
  • In this study, we will implement a flexible formant analysis and synthesis system. In the analysis part, the two-channel (i.e., speech & EGG signals) approach is investigated for accurate estimation of formant information. The EGG signal is used for extracting exact pitch information that is needed for the pitch synchronous LPC analysis and closed phase LPC analysis. In the synthesis part, Klatt formant synthesizer is modified so that the user can change synthesis parameters arbitarily. Experimental results demonstrate the superiority of the two-channel analysis method over the one-channel(speech signal only) method in analysis as well as in synthesis. The implemented system is expected to be very helpful for studing the effects of synthesis parameters on the quality of synthetic speech and for the development of Korean text-to-speech(TTS) system with the formant synthesis method.

  • PDF

Measuring Acoustical Parameters of English Words by the Position in the Phrases (영어어구의 위치에 따른 단어의 음향 변수 측정)

  • Yang, Byung-Gon
    • Speech Sciences
    • /
    • v.14 no.4
    • /
    • pp.115-128
    • /
    • 2007
  • The purposes of this paper were to develop an automatic script to collect such acoustic parameters as duration, intensity, pitch and the first two formant values of English words produced by two native Canadian speakers either alone or in a two-word phrase at a normal speed and to compare those values by the position in the phrases. A Praat script was proposed to obtain the comparable parameters at evenly divided time point of the target word. Results showed that the total duration of the word in the phrase was shorter than that of the word produced alone. That was attributed to the pronunciation style of the native speakers generally placing the primary word stress in the first word position. Also, the reduction ratio of the male speaker depended on the word position in the phrase while the female speaker didn't. Moreover, there were different contours of intensity and pitch by the position of the target word in the phrase while almost the same formant patterns were observed. Further studies would be desirable to examine those parameters of the words in the authentic speech materials.

  • PDF

A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth (Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究))

  • Park, Sung-Jin;Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.16 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".

  • PDF

Gender difference in speech intelligibility using speech intelligibility tests and acoustic analyses

  • Kwon, Ho-Beom
    • The Journal of Advanced Prosthodontics
    • /
    • v.2 no.3
    • /
    • pp.71-76
    • /
    • 2010
  • PURPOSE. The purpose of this study was to compare men with women in terms of speech intelligibility, to investigate the validity of objective acoustic parameters related with speech intelligibility, and to try to set up the standard data for the future study in various field in prosthodontics. MATERIALS AND METHODS. Twenty men and women were served as subjects in the present study. After recording of sample sounds, speech intelligibility tests by three speech pathologists and acoustic analyses were performed. Comparison of the speech intelligibility test scores and acoustic parameters such as fundamental frequency, fundamental frequency range, formant frequency, formant ranges, vowel working space area, and vowel dispersion were done between men and women. In addition, the correlations between the speech intelligibility values and acoustic variables were analyzed. RESULTS. Women showed significantly higher speech intelligibility scores than men and there were significant difference between men and women in most of acoustic parameters used in the present study. However, the correlations between the speech intelligibility scores and acoustic parameters were low. CONCLUSION. Speech intelligibility test and acoustic parameters used in the present study were effective in differentiating male voice from female voice and their values might be used in the future studies related patients involved with maxillofacial prosthodontics. However, further studies are needed on the correlation between speech intelligibility tests and objective acoustic parameters.

Sound parameters for classifying individual sows(Landrace×Yorkshire) during nursing behavior (수유행동시 모돈(랜드레이스×요크셔) 발성음의 개체 판별을 위한 음성 파라미터)

  • Jeon, Jung-Hwan;Chang, Hong-Hee;Ha, Jeung-Key;Kim, Hyeon-Hui;Koo, Ja-Min;Lee, Hyo-Jong;Yeon, Seong-Chan
    • Korean Journal of Veterinary Research
    • /
    • v.43 no.1
    • /
    • pp.165-169
    • /
    • 2003
  • The aim of the present study was to analyse grunts of the sows and to extract parameters from the time and frequency signals in nursing behavior. Five crossbred $Landrace{\times}Yorkshire$ sows were used on day 5 or 6 postpartum. The grunts and the behaviors of the five sows were recorded with five digital camcorders. Three parameter groups [Group I: Formant vector alone, Group II: Formant vector+parameters from time signal, Group III: Formant vector+parameters from time signal-parameters eliminated by stepwise discriminant analysis backward (SDAB)] with parameter vectors extracted from single grunts in the maximum grunting rate period were used for individuality of the sows. The parameter groups were compared by a discriminant function analysis. The classification system adopted in the Group II represented the higher discriniation rate than those in other groups (Group I: 63.3%, Group II: 83.0%, Group III: 80.0%). This study demonstrated that formant, intensity, and pitch were available sound parameters for individuality of the sows during nursing behavior.

Perceptual Experiment on Number Production for Speaker Identification

  • Yang, Byung-Gon
    • Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.7-19
    • /
    • 2001
  • The acoustic parameters of nine Korean numbers were analyzed by Praat, a speech analysis software, and synthesized by SenSynPPC, a Klatt formant synthesizer. The overall intensity, pitch and formant values of the numbers were modified dynamically by a step of 1 dB, 1 Hz and 2.5% respectively. The study explored the sensitivity of listeners to changes in the three acoustic parameters. Twelve subjects (male and female) listened to 390 pairs of synthesized numbers and judged whether the given pair sounded the same or different. Results showed that subjects perceived the same sound quality within the range of 6.6 dB of intensity variation, 10.5 Hz of pitch variation and 5.9% of the first three formant variations. The male and female groups showed almost the same perceptual ranges. Also, an asymmetrical structure of high and low boundary was observed. The ranges may be applicable to the development of a speaker identification system while the method of synthesis modification may apply to its evaluation data.

  • PDF

The implementation of Korean adult's optimal formant setting by Praat scripting (성인 포먼트 측정에서의 최적 세팅 구현: Praat software와 관련하여)

  • Park, Jiyeon;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.97-108
    • /
    • 2019
  • An automated Praat script was implemented to measure optimal formant frequencies for adults. Optimal formant analysis could be interpreted to show that the deviation of formant frequency that resulted from the two variously combined setting parameters (maximum formant and number of formants) was minimal. To increase the reliability of formant analysis, LPC order should be set differently, based on the gender or vowel type. Praat recommends 5,000 Hz and 5,500 Hz as maximum formant settings and, at the same time, recommends 5 as the number of formants for males and females. However, verification is needed to determine whether these recommended settings are valid for Korean vowels. Statistical analysis showed that formant frequencies significantly varied across the adapted scripts, especially with respect to the data on females. Formant plots and statistical results showed that linear_script and qtone_script are much more reliable in formant measurements. Among four kinds of scripts, the linear and qtone_scripts proved to be more stable and reliable. While the linear_script was designed to have a linearly increased formant step in for-loop, the increment of formant step in the qtone_script was arranged by quarter tone scale (base frequency×common ratio ($\sqrt[24]{2}$)). When looking at the tendency of the formant setting drawn by the two referred algorithms in the context of front vowel [i, e], the maximum formant was set higher; and the number of formants set at a lower value than recommended by Praat. The back vowel [o, u], on the contrary, has a lower maximum formant and a higher number of formants than the standard setting.

Pitch and Formant Trajectories of English Vowels by American Males with Different Speaking Styles (발화방식에 따른 미국인 남성 영어모음의 피치와 포먼트 궤적)

  • Yang, Byung-Gon
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.21-28
    • /
    • 2012
  • Many previous studies reported acoustic parameters of English vowels produced by a clear speaking style. In everyday usage, we actually produce speech sounds with various speaking styles. Different styles may yield different acoustic measurements. This study attempts to examine pitch and formant trajectories of eleven English vowels produced by nine American males in order to understand acoustic variations depending on clear and conversational speaking styles. The author used Praat to obtain trajectories systematically at seven equidistant time points over the vowel segment while checking measurement validity. Results showed that pitch trajectories indicated distinct patterns depending on four speaking styles. Generally, higher pitch values were observed in the higher vowels and the pitch was higher in the clear speaking styles than that in the conversational styles. The same trend was observed in the three formant trajectories of front vowels and the first formant trajectories of back vowels. The second and third trajectories of back vowels revealed an opposite or inconsistent trend, which might be attributable to the coarticulation of the following consonant or lip rounding gestures. The author made a tentative conclusion that people tend to produce vowels to enhance pitch and formant differences to transmit their information clearly. Further perceptual studies on synthesized vowels with varying pitch and formant values are desirable to address the conclusion.