• Title/Summary/Keyword: Formant

Search Result 414, Processing Time 0.021 seconds

Analyzing the element of emotion recognition from speech (음성으로부터 감성인식 요소분석)

  • 심귀보;박창현
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.6
    • /
    • pp.510-515
    • /
    • 2001
  • Generally, there are (1)Words for conversation (2)Tone (3)Pitch (4)Formant frequency (5)Speech speed, etc as the element for emotional recognition from speech signal. For human being, it is natural that the tone, vice quality, speed words are easier elements rather than frequency to perceive other s feeling. Therefore, the former things are important elements fro classifying feelings. And, previous methods have mainly used the former thins but using formant is good for implementing as machine. Thus. our final goal of this research is to implement an emotional recognition system based on pitch, formant, speech speed, etc. from speech signal. In this paper, as first stage we foun specific features of feeling angry from his words when a man got angry.

  • PDF

Formant Trajectories of English Vowels Produced by American Females (미국인 여성이 발음한 영어모음의 포먼트 궤적)

  • Yang, Byung-Gon
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.3-9
    • /
    • 2009
  • Acoustically English vowels are defined primarily by formant values. The measurements of the values have been usually made at a few time points of the vowel segment despite the fact that the majority of English vowel formants vary dynamically throughout the segment. This study attempts to collect acoustic data of the nine English vowels published by Hillenbrand et al. (1995) online and to examine the acoustic features of the English vowels for phoneticians and English teachers. The author used Praat to obtain the data systematically at six equidistant timepoints over the vowel segment. Obvious errors were corrected based on the spectrographic display of each vowel. Results show that the first two formant trajectories are important to separate the nine vowels within the front- or back-vowel groups. The third formant trajectories appear comparable except those of the high vowels. Second, the back vowels leave longer traces on the vowel space toward the locus of the following consonant /d/. Third, each vowel has inherent duration, pitch, and intensity patterns. The results match the findings of Yang (2009). From the results, the author concludes that dynamic spectral changes are important in specifying acoustic characteristics of English vowels. Further studies on the application of the vowel trajectories to English pronunciation lessons or on perceptual experiment of synthesized vowels are desirable.

  • PDF

A Study on the Pitch Detection of Speech Harmonics by the Peak-Fitting (음성 하모닉스 스펙트럼의 피크-피팅을 이용한 피치검출에 관한 연구)

  • Kim, Jong-Kuk;Jo, Wang-Rae;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.85-95
    • /
    • 2003
  • In speech signal processing, it is very important to detect the pitch exactly in speech recognition, synthesis and analysis. If we exactly pitch detect in speech signal, in the analysis, we can use the pitch to obtain properly the vocal tract parameter. It can be used to easily change or to maintain the naturalness and intelligibility of quality in speech synthesis and to eliminate the personality for speaker-independence in speech recognition. In this paper, we proposed a new pitch detection algorithm. First, positive center clipping is process by using the incline of speech in order to emphasize pitch period with a glottal component of removed vocal tract characteristic in time domain. And rough formant envelope is computed through peak-fitting spectrum of original speech signal infrequence domain. Using the roughed formant envelope, obtain the smoothed formant envelope through calculate the linear interpolation. As well get the flattened harmonics waveform with the algebra difference between spectrum of original speech signal and smoothed formant envelope. Inverse fast fourier transform (IFFT) compute this flattened harmonics. After all, we obtain Residual signal which is removed vocal tract element. The performance was compared with LPC and Cepstrum, ACF. Owing to this algorithm, we have obtained the pitch information improved the accuracy of pitch detection and gross error rate is reduced in voice speech region and in transition region of changing the phoneme.

  • PDF

Efficient Tracking of Speech Formant Using Closed Phase WRLS-VFF-VT Algorithm

  • Lee, Kyo-Sik;Park, Kyu-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.2E
    • /
    • pp.8-13
    • /
    • 2000
  • In this paper, we present an adaptive formant tracking algorithm for speech using closed phase WRLS-VFF-VT method. The pitch synchronous closed phase methods is known to give more accurate estimates of the vocal tract parameters than the pitch asynchronous method. However the use of a pitch-synchronous closed phase analysis method has been limited due to difficulties associated with the task of accurately isolating the closed phase region in successive periods of speech. Therefore we have implemented the pitch synchronous closed phase WRLS-VFF-VT algorithm for speech analysis, especially for formant tracking. The proposed algorithm with the variable threshold(VT) can provide a superior performance in the boundary of phone and voiced/unvoiced sound. The proposed method is experimentally compared with the other method such as two channel CPC method by using synthetic waveform and real speech data. From the experimental results, we found that the block data processing techniques, such as the two-channel CPC, gave reasonable estimates of the formant/antiformant. However, the data windows used by these methods included the effects of the periodic excitation pulses, which affected the accuracy of the estimated formants. On the other hand the proposed WRLS-VFF-VT method, which eliminated the influence of the pulse excitation by using an input estimation as part of the algorithm, gave very accurate formant/bandwidth estimates and good spectral matching.

  • PDF

Nursing and Suckling Behaviour in Domestic Pigs 1. Characteristics of the Grunting Sound of the Sow(Landrace $\times$ Yorkshire) during Nursing Behaviour (돼지의 수.포유 행동 I. 수유 행동에서 모돈(랜드레이스$\times$요크셔) 발성음의 특성)

  • 장홍희;연성찬
    • Journal of Veterinary Clinics
    • /
    • v.19 no.2
    • /
    • pp.191-194
    • /
    • 2002
  • The nursing vocalization of domestic pigs(Landrace$\times$Yorkshire) was investigated with respect to common features. All vocalizations uttered during nursings in 5 sows at 5 days after farrowing were recorded and 305 grunts were processed in a spectrograph. The sow's repeated grunting during nursing can be regarded as a contact call and a signal of the mother to start and synchronize the suckling behavior of the piglets. Analysis in the time domain revealed the gross structure of the call, whereas in the frequency domain the fine structure of single grunts was investigated. Nursing interval, duration of nursing behavior, duration of grunt, grunt rate per 10 seconds, fundamental frequency, 1 formant, 2 formant, 3 formant, 4 formant and spectrum were investigated. The results showed that mean interval between the nursing following one another was 25, 4.6 min and duration of nursing behavior was 3.2 $\pm$ 0.7 min. Average duration of grunt was 203.9 $\pm$ 63.6 ms. The formant contours could be identified. The nursing behavior might be disturbed by the grunts of alien sow.

Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants (한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석)

  • Choi, Soon-Ai;Seong, Cheol-Jae
    • MALSORI
    • /
    • no.60
    • /
    • pp.67-84
    • /
    • 2006
  • The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).

  • PDF

A Study on the Formant Comparison of Korean Monophthongs according to Age and Gender -A Survey on Patients in Oriental Hospitals- (연령 및 성별에 따른 한국인 단모음 포먼트 비교에 관한 연구 -한방병원 내원환자를 중심으로-)

  • Kim, Young-Su;Kim, Keun Ho;Kim, Jong Yeol;Jang, Jun-Su
    • Phonetics and Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.73-80
    • /
    • 2013
  • Formant is one of the essential vocal features for research of voice production, recognition and synthesis. Numerous studies were established on foreign languages including English vowels. However, studies related to Korean were done with a limited number of voice data. In this study, we compare four formants according to age and gender using a large number of Korean monophthongs. A total of 2614 Korean speakers participated in our experiments. We summarize statistical results by mean and standard deviation for each formant of five monophthongs. The results show a notable difference in each age and gender group. A quantitative study based on a large dataset is suggested for future studies on Korean speech sounds.

A study on the automatic recognition of Korean vowel (한국어 단모음 자동 인식에 관한 연구)

  • 안동순
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1984.12a
    • /
    • pp.57-61
    • /
    • 1984
  • In this study, the system is proposed which can be used for recognition of Koean single vowles "ㅏ, ㅓ, ㅗ, ㅜ, ㅡ, ㅣ, ㅐ, ㅔ, ㅚ,", and automatic recognition is processed using $\mu$-computer. 3 men of not-being-studied are participated in this experiment. Using the period of vowels, one part of the steady state is selected for high speed recognition, and amplitude comparison method, LPC, PARCOR, and Formant are used for parameter of recognition. Formant is obtained by peak picking method using LPC, and then vowels are recognized by amplitude comparison method, LPC, PARCOR, and Formant. As a result, Recognition rates are 90.1% for amplitude comparison method, 93.1% for LPC, 100% for PARCOR, 88.8% for using formant.

  • PDF

A Study on the Length and Formant Structures of the Korean Liquid 'ㄹ' Pronounced by Chinese Learners and Koreans (중국인 한국어 학습자와 한국인의 'ㄹ' 발음의 길이와 포먼트에 대한 연구)

  • Fan Liu
    • MALSORI
    • /
    • no.57
    • /
    • pp.43-58
    • /
    • 2006
  • This study aims to investigate whether Chinese learning Korean and Korean native speakers show any difference in length and formant structures of the Korean liquid 'ㄹ' in the environments of v_v and v_# through the acoustic analysis of 10 Chinese learners' and 10 Koreans' utterances. The acoustic analysis of L2KSC DB shows that the length and formant structures of 'ㄹ' produced by Chinese learners are significantly different from the ones by Koreans. I explain these differences by contrasting the liquids and syllable structure constraints of the two languages, Chinese and Korean. In addition, I relate the F1 and F2's values to the tongue's movement when making a constriction, and conclude that Chinese learners pronounce the 'ㄹ' in the v_# environment with the tongue lower and backer than Koreans do.

  • PDF

Sound Analysis of Cleft Platate Patinents Using Formant Position (포르만트 위치비교를 이용한 구개열 환자의 발음분석)

  • 김덕원;송철규
    • Journal of Biomedical Engineering Research
    • /
    • v.11 no.2
    • /
    • pp.283-288
    • /
    • 1990
  • As one of the main purpose of the physical management of cleft palate is to provide for the anatomic and physiologic requisites for speech, the speech must be as one of the criteria for determining when physical management has been achieved. But there is no objective methods to evaluate the speech of cleft palate patients. The authors tried to analyze the speech of adult cleft palate patients using sound spectrog raphy and compared with normal adults. The results were obtained as follows ; 1. In Vowels, cleft palate patients of both sexes showed reduction of frequency of the first and second formant as compared to normal. There was minimal difference in front vowels (i, e, ae) 2. In consonants, cleft palate patients showed reduction of frequency of the first formant in both sexes but reduction of frequency of the second formant was noticed only in fe- male patients. 3. There was no statistical difference in sound spectrograph between plosive, fricative, africative, nasal, and glide consonants.

  • PDF