Search | Korea Science

Emotion Recognition Based on Frequency Analysis of Speech Signal

Sim, Kwee-Bo;Park, Chang-Hyun;Lee, Dong-Wook;Joo, Young-Hoon
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.2 no.2
- /
- pp.122-126
- /
- 2002
In this study, we find features of 3 emotions (Happiness, Angry, Surprise) as the fundamental research of emotion recognition. Speech signal with emotion has several elements. That is, voice quality, pitch, formant, speech speed, etc. Until now, most researchers have used the change of pitch or Short-time average power envelope or Mel based speech power coefficients. Of course, pitch is very efficient and informative feature. Thus we used it in this study. As pitch is very sensitive to a delicate emotion, it changes easily whenever a man is at different emotional state. Therefore, we can find the pitch is changed steeply or changed with gentle slope or not changed. And, this paper extracts formant features from speech signal with emotion. Each vowels show that each formant has similar position without big difference. Based on this fact, in the pleasure case, we extract features of laughter. And, with that, we separate laughing for easy work. Also, we find those far the angry and surprise.
https://doi.org/10.5391/IJFIS.2002.2.2.122 인용 PDF KSCI

Formant Trajectories of English Vowels Produced by American Children (미국인 아동이 발음한 영어모음의 포먼트 궤적)

Yang, Byung-Gon
- Phonetics and Speech Sciences
- /
- v.3 no.1
- /
- pp.23-34
- /
- 2011
Many Korean children have difficulty learning English vowels. The gestures inside the oral and pharyngeal cavities are hard to control when they cannot see the gestures and the target vowel system is quite different from that of Korean. This study attempts to collect children's acoustic data of twelve English vowels published by Hillenbrand et al. (1995) online and to examine the acoustic features of English vowels for phoneticians and English teachers. The author used Praat to obtain the data systematically at six equidistant timepoints over the vowel segment avoiding any obvious errors. Results show inherent acoustic properties for vowels from the children's distribution of vowel duration, f0 and intensity values. Second, children's gestures for each vowel coincide with the regression analysis of all formant values at different timepoints regardless of the vocal fold and tract difference. Third, locus points appear higher than those of American males and females. Their gestures along the timepoints display almost similar patterns. From the results the author concludes that vowel formant trajectories provide useful and important information on dynamic articulatory gestures, which may be applicable to Korean children's education and correction of English vowels. Further studies on the developmental study of vowel formants and pitch values are desirable.
PDF

Speech recognition rates and acoustic analyses of English vowels produced by Korean students

Yang, Byunggon
- Phonetics and Speech Sciences
- /
- v.14 no.2
- /
- pp.11-17
- /
- 2022
English vowels play an important role in verbal communication. However, Korean students tend to experience difficulty pronouncing a certain set of vowels despite extensive education in English. The aim of this study is to apply speech recognition software to evaluate Korean students' pronunciation of English vowels in minimal pair words and then to examine acoustic characteristics of the pairs in order to check their pronunciation problems. Thirty female Korean college students participated in the recording. Speech recognition rates were obtained to examine which English vowels were correctly pronounced. To compare and verify the recognition results, such acoustic analyses as the first and second formant trajectories and durations were also collected using Praat. The results showed an overall recognition rate of 54.7%. Some students incorrectly switched the tense and lax counterparts and produced the same vowel sounds for qualitatively different English vowels. From the acoustic analyses of the vowel formant trajectories, some of these vowel pairs were almost overlapped or exhibited slight acoustic differences at the majority of the measurement points. On the other hand, statistical analyses on the first formant trajectories of the three vowel pairs revealed significant differences throughout the measurement points, a finding that requires further investigation. Durational comparisons revealed a consistent pattern among the vowel pairs. The author concludes that speech recognition and analysis software can be useful to diagnose pronunciation problems of English-language learners.
https://doi.org/10.13064/KSSS.2022.14.2.011 인용 PDF KSCI

Common Calls of Poodle (Poodle의 발성음)

연성찬;서강문;권오경;남치주
- Journal of Veterinary Clinics
- /
- v.13 no.2
- /
- pp.163-170
- /
- 1996
This study was performed to analyse the common calls of poddle spectrographically : bark, growl, howl, snore, yelp and whine. The sonograms of 6 common calls were shown their own specific features. There were significant differences among each types of common callsin the parceter of minimun frequency of call (MIFC), maximun frequency of call (MAFC), duration of call (DC), interval between call (IBC), dominant frequency (DF), F1 formant, F2 formant and F3 formant (P<0.01). It was considered that it was possible to record the main common calls dogs by sonograms and it sould be applied to objective basic data for understanding the psychological stats of dogs, the social relationship among them and the relationship sith human being.
PDF

A Study on the Text-to-Speech Conversion Using the Formant Synthesis Method (포만트 합성방식을 이용한 문자-음성 변환에 관한 연구)

Choi, Jin-San;Kim, Yin-Nyun;See, Jeong-Wook;Bae, Geun-Sune
- Speech Sciences
- /
- v.2
- /
- pp.9-23
- /
- 1997
Through iterative analysis and synthesis experiments on Korean monosyllables, the Korean text-to-speech system was implemented using the phoneme-based formant synthesis method. Since the formants of initial and final consonants in this system showed many variations depending on the medial vowels, the database for each phoneme was made up of formants depending on the medial vowels as well as duration information of transition region. These techniques were needed to improve the intelligibility of synthetic speech. This paper investigates also methods of concatenating the synthesis units to improve the quality of synthetic speech.
PDF

Perceptual Experiment on Number Production for Speaker Identification

Yang, Byung-Gon
- Speech Sciences
- /
- v.8 no.1
- /
- pp.7-19
- /
- 2001
The acoustic parameters of nine Korean numbers were analyzed by Praat, a speech analysis software, and synthesized by SenSynPPC, a Klatt formant synthesizer. The overall intensity, pitch and formant values of the numbers were modified dynamically by a step of 1 dB, 1 Hz and 2.5% respectively. The study explored the sensitivity of listeners to changes in the three acoustic parameters. Twelve subjects (male and female) listened to 390 pairs of synthesized numbers and judged whether the given pair sounded the same or different. Results showed that subjects perceived the same sound quality within the range of 6.6 dB of intensity variation, 10.5 Hz of pitch variation and 5.9% of the first three formant variations. The male and female groups showed almost the same perceptual ranges. Also, an asymmetrical structure of high and low boundary was observed. The ranges may be applicable to the development of a speaker identification system while the method of synthesis modification may apply to its evaluation data.
PDF

A Comparative Analysis on English Vowels of Korean Students by Formant Frequencies (포먼트에 의한 영어모음 비교 분석)

Hwang, Young-Soon
- Speech Sciences
- /
- v.8 no.4
- /
- pp.221-228
- /
- 2001
The purpose of this study is to analyze the problems Korean students, having acoustic structure of Korean vowels, have when they pronounce English vowels by measuring formant frequencies. The experimental results show that the pronunciation of English vowels by Korean students is partially influenced by their Korean vowels. There is little distinction between /i/ and /I/, /U/ and /u/ due to the absence of short and long vowels in Korean pronunciation. Also, as observed in typical Korean vowel pronunciation, there is little difference between the F1 values of /$\varepsilon$/ and /$\{\ae}$/ by Korean speakers, resulting in inaccurate English pronunciation. In addition, compared to English native speakers, Korean speakers show the biggest difference in F1 value of /c/. The fact that they make pronunciation of /c/ covering /e/, /$\Lambda$/ and /c/ positions probably accounts for such phenomenon. The results of this experiment show the interference of Korean that occurred in some English vowels by native Korean speakers.
PDF

On Formant Extraction Based on Transfer Function

Jiang, Gang-Yi;Park, Tae-Young;Mei Yu
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.2E
- /
- pp.31-38
- /
- 1999
This paper focuses on extracting formants from transfer function, derived from linear prediction analysis of speech signal. The second derivative of the log magnitude spectrum of the transfer function, the first and third derivatives of the phase spectrum of the transfer function in the z-plane are discussed. Their resolutions of detecting formants are analyzed and some comparisons are given. Theoretical analyses and experimental results show that the third derivative of the phase spectrum decays more rapidly around the formant locations than the first derivative of the phase spectrum and the second derivative of the log magnitude spectrum. Compared with the second derivative of the log spectrum and the first derivative of the phase spectrum, the third derivative of the phase spectrum has higher resolution in frequency domain and provides more accurate formant extraction.
PDF

Characteristics of Vowel Formants, Voice Intensity, and Fundamental Frequency of Female with Amyotrophic Lateral Sclerosis using Spectrograms (스펙트로그램을 이용한 근위축성측삭경화증 여성 화자의 모음 포먼트, 음성강도, 기본주파수의 변화)

Byeon, Haewon
- Journal of the Korea Convergence Society
- /
- v.10 no.9
- /
- pp.193-198
- /
- 2019
This study analyzed the changes of vowel formant, voice intensity, and fundamental frequency of vowels for 11 months using acoustochemical spectrogram analysis of women diagnosed with amyotrophic lateral sclerosis (ALS). The test word was a vowel /a, i, u/ and a diphthong /h + ja + da/, /h + wi + da/, and /h +ɰi+ da/. Speech data were collected through the word reading task presented on the monitor using 'Alvin' program, and the recording environment was set to 5,500 Hz for the nyquist frequency and 11,000 Hz for the sampling rate. The records were analyzed by using spectrograms to vowel formants, voice intensity, and fundamental frequency. As a result of analysis, the fundamental frequency and intensity of the ALS process were decreased and the formant slope of the diphthong was decreased rather than the formant change in the vowel. This result suggests that the vowel distortion of ALS due to disease progression is due to the decrease of tongue and jaw co morbidity.
https://doi.org/10.15207/JKCS.2019.10.9.193 인용 PDF KSCI

Effects of Tonsillectomy on Oral and Nasal Spectral Outputs for Sustained Vowel (편도적출술이 구강 및 비강 음향스팩트럼에 미치는 영향)

Choi, Dong-Il;Kong, Il-Seung;Lee, Eun-Jung;So, Sang-Soo;Yang, Yoon-Soo;Hong, Ki-Hwan
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.18 no.1
- /
- pp.33-38
- /
- 2007
Background and Objectives: It has been suggested that tonsillectomy possibly causes changes of voice because the morphology of the vocal tract is altered. This may cause serious problems for professional voice users. Materials and Method: Subjects were 26 patients. The oral and nasal sound spectrum of oral vowel /a/, /e/ and /i/ were measured before and after tonsillectomy. The formant frequencies and intensities for oral and nasal spectra were compared. The nasality and fundamental frequencies for oral vowel were measured. Results: The first formant frequencies for oral spectra of all vowels were not changed after surgery, but the second formant frequencies were increased significantly after surgery in the vowel /e/ and /i/. The first and second formant intensities for oral spectra were increased significantly after surgery in the all vowels. The first and second formant frequencies for nasal spectra of all vowels were not changed after surgery, but their intensities for nasal spectra were increased after surgery. The nasalities for oral vowel were not changed after surgery. Conclusion : Tonsillectomy appeared to change the spectral features of oral and nasal components of oral vowel, especially spectral intensities.
PDF

Search Result 191, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)