통합 검색 | Korea Science

Emotion Recognition Based on Frequency Analysis of Speech Signal

Sim, Kwee-Bo;Park, Chang-Hyun;Lee, Dong-Wook;Joo, Young-Hoon
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- 제2권2호
- /
- pp.122-126
- /
- 2002
In this study, we find features of 3 emotions (Happiness, Angry, Surprise) as the fundamental research of emotion recognition. Speech signal with emotion has several elements. That is, voice quality, pitch, formant, speech speed, etc. Until now, most researchers have used the change of pitch or Short-time average power envelope or Mel based speech power coefficients. Of course, pitch is very efficient and informative feature. Thus we used it in this study. As pitch is very sensitive to a delicate emotion, it changes easily whenever a man is at different emotional state. Therefore, we can find the pitch is changed steeply or changed with gentle slope or not changed. And, this paper extracts formant features from speech signal with emotion. Each vowels show that each formant has similar position without big difference. Based on this fact, in the pleasure case, we extract features of laughter. And, with that, we separate laughing for easy work. Also, we find those far the angry and surprise.
https://doi.org/10.5391/IJFIS.2002.2.2.122 인용 PDF KSCI

미국인 아동이 발음한 영어모음의 포먼트 궤적 (Formant Trajectories of English Vowels Produced by American Children)

양병곤
- 말소리와 음성과학
- /
- 제3권1호
- /
- pp.23-34
- /
- 2011
Many Korean children have difficulty learning English vowels. The gestures inside the oral and pharyngeal cavities are hard to control when they cannot see the gestures and the target vowel system is quite different from that of Korean. This study attempts to collect children's acoustic data of twelve English vowels published by Hillenbrand et al. (1995) online and to examine the acoustic features of English vowels for phoneticians and English teachers. The author used Praat to obtain the data systematically at six equidistant timepoints over the vowel segment avoiding any obvious errors. Results show inherent acoustic properties for vowels from the children's distribution of vowel duration, f0 and intensity values. Second, children's gestures for each vowel coincide with the regression analysis of all formant values at different timepoints regardless of the vocal fold and tract difference. Third, locus points appear higher than those of American males and females. Their gestures along the timepoints display almost similar patterns. From the results the author concludes that vowel formant trajectories provide useful and important information on dynamic articulatory gestures, which may be applicable to Korean children's education and correction of English vowels. Further studies on the developmental study of vowel formants and pitch values are desirable.
PDF

Speech recognition rates and acoustic analyses of English vowels produced by Korean students

Yang, Byunggon
- 말소리와 음성과학
- /
- 제14권2호
- /
- pp.11-17
- /
- 2022
English vowels play an important role in verbal communication. However, Korean students tend to experience difficulty pronouncing a certain set of vowels despite extensive education in English. The aim of this study is to apply speech recognition software to evaluate Korean students' pronunciation of English vowels in minimal pair words and then to examine acoustic characteristics of the pairs in order to check their pronunciation problems. Thirty female Korean college students participated in the recording. Speech recognition rates were obtained to examine which English vowels were correctly pronounced. To compare and verify the recognition results, such acoustic analyses as the first and second formant trajectories and durations were also collected using Praat. The results showed an overall recognition rate of 54.7%. Some students incorrectly switched the tense and lax counterparts and produced the same vowel sounds for qualitatively different English vowels. From the acoustic analyses of the vowel formant trajectories, some of these vowel pairs were almost overlapped or exhibited slight acoustic differences at the majority of the measurement points. On the other hand, statistical analyses on the first formant trajectories of the three vowel pairs revealed significant differences throughout the measurement points, a finding that requires further investigation. Durational comparisons revealed a consistent pattern among the vowel pairs. The author concludes that speech recognition and analysis software can be useful to diagnose pronunciation problems of English-language learners.
https://doi.org/10.13064/KSSS.2022.14.2.011 인용 PDF KSCI

Poodle의 발성음 (Common Calls of Poodle)

연성찬;서강문;권오경;남치주
- 한국임상수의학회지
- /
- 제13권2호
- /
- pp.163-170
- /
- 1996
This study was performed to analyse the common calls of poddle spectrographically : bark, growl, howl, snore, yelp and whine. The sonograms of 6 common calls were shown their own specific features. There were significant differences among each types of common callsin the parceter of minimun frequency of call (MIFC), maximun frequency of call (MAFC), duration of call (DC), interval between call (IBC), dominant frequency (DF), F1 formant, F2 formant and F3 formant (P<0.01). It was considered that it was possible to record the main common calls dogs by sonograms and it sould be applied to objective basic data for understanding the psychological stats of dogs, the social relationship among them and the relationship sith human being.
PDF

포만트 합성방식을 이용한 문자-음성 변환에 관한 연구 (A Study on the Text-to-Speech Conversion Using the Formant Synthesis Method)

최진산;김민년;서정욱;배건성
- 음성과학
- /
- 제2권
- /
- pp.9-23
- /
- 1997
Through iterative analysis and synthesis experiments on Korean monosyllables, the Korean text-to-speech system was implemented using the phoneme-based formant synthesis method. Since the formants of initial and final consonants in this system showed many variations depending on the medial vowels, the database for each phoneme was made up of formants depending on the medial vowels as well as duration information of transition region. These techniques were needed to improve the intelligibility of synthetic speech. This paper investigates also methods of concatenating the synthesis units to improve the quality of synthetic speech.
PDF

Perceptual Experiment on Number Production for Speaker Identification

Yang, Byung-Gon
- 음성과학
- /
- 제8권1호
- /
- pp.7-19
- /
- 2001
The acoustic parameters of nine Korean numbers were analyzed by Praat, a speech analysis software, and synthesized by SenSynPPC, a Klatt formant synthesizer. The overall intensity, pitch and formant values of the numbers were modified dynamically by a step of 1 dB, 1 Hz and 2.5% respectively. The study explored the sensitivity of listeners to changes in the three acoustic parameters. Twelve subjects (male and female) listened to 390 pairs of synthesized numbers and judged whether the given pair sounded the same or different. Results showed that subjects perceived the same sound quality within the range of 6.6 dB of intensity variation, 10.5 Hz of pitch variation and 5.9% of the first three formant variations. The male and female groups showed almost the same perceptual ranges. Also, an asymmetrical structure of high and low boundary was observed. The ranges may be applicable to the development of a speaker identification system while the method of synthesis modification may apply to its evaluation data.
PDF

포먼트에 의한 영어모음 비교 분석 (A Comparative Analysis on English Vowels of Korean Students by Formant Frequencies)

황영순
- 음성과학
- /
- 제8권4호
- /
- pp.221-228
- /
- 2001
The purpose of this study is to analyze the problems Korean students, having acoustic structure of Korean vowels, have when they pronounce English vowels by measuring formant frequencies. The experimental results show that the pronunciation of English vowels by Korean students is partially influenced by their Korean vowels. There is little distinction between /i/ and /I/, /U/ and /u/ due to the absence of short and long vowels in Korean pronunciation. Also, as observed in typical Korean vowel pronunciation, there is little difference between the F1 values of /$\varepsilon$/ and /$\{\ae}$/ by Korean speakers, resulting in inaccurate English pronunciation. In addition, compared to English native speakers, Korean speakers show the biggest difference in F1 value of /c/. The fact that they make pronunciation of /c/ covering /e/, /$\Lambda$/ and /c/ positions probably accounts for such phenomenon. The results of this experiment show the interference of Korean that occurred in some English vowels by native Korean speakers.
PDF

On Formant Extraction Based on Transfer Function

Jiang, Gang-Yi;Park, Tae-Young;Mei Yu
- The Journal of the Acoustical Society of Korea
- /
- 제18권2E호
- /
- pp.31-38
- /
- 1999
This paper focuses on extracting formants from transfer function, derived from linear prediction analysis of speech signal. The second derivative of the log magnitude spectrum of the transfer function, the first and third derivatives of the phase spectrum of the transfer function in the z-plane are discussed. Their resolutions of detecting formants are analyzed and some comparisons are given. Theoretical analyses and experimental results show that the third derivative of the phase spectrum decays more rapidly around the formant locations than the first derivative of the phase spectrum and the second derivative of the log magnitude spectrum. Compared with the second derivative of the log spectrum and the first derivative of the phase spectrum, the third derivative of the phase spectrum has higher resolution in frequency domain and provides more accurate formant extraction.
PDF

스펙트로그램을 이용한 근위축성측삭경화증 여성 화자의 모음 포먼트, 음성강도, 기본주파수의 변화 (Characteristics of Vowel Formants, Voice Intensity, and Fundamental Frequency of Female with Amyotrophic Lateral Sclerosis using Spectrograms)

변해원
- 한국융합학회논문지
- /
- 제10권9호
- /
- pp.193-198
- /
- 2019
본 연구는 근위축성측삭경화증(amyotrophic lateral sclerosis, ALS)으로 진단된 여성을 대상으로 음향음성학적 스펙트로그램 분석을 이용하여 11개월 동안 모음과 이중모음의 포먼트 변화(vowel formant variation)를 분석하였다. 검사어는 단모음 /a, i, u/와 이중모음 /h + ja + da/, /h + wi + da/, /h +ɰi+ da/를 이용하였다. 발화자료는 'Alvin' 프로그램을 이용하여 모니터에 제시된 단어읽기과제를 통해 수집되었고, 녹음환경은 nyquist frequency는 5,500Hz, sampling rate는 11,000Hz으로 설정하였다. 녹음자료는 스펙트로그램을 이용하여 강도, 음도와 이중모음의 포먼트를 분석하였다. 분석결과, ALS의 진행과정에서 기본주파수와 강도가 저하되었고, 단모음에서의 포먼트 변화보다는 이중모음의 포먼트 기울기의 감소가 특징으로 확인되었다. 이 결과는 병의 진행에 따른 ALS의 모음왜곡이 혀와 턱의 협응력 감소에 기인함을 시사한다.
https://doi.org/10.15207/JKCS.2019.10.9.193 인용 PDF KSCI

편도적출술이 구강 및 비강 음향스팩트럼에 미치는 영향 (Effects of Tonsillectomy on Oral and Nasal Spectral Outputs for Sustained Vowel)

최동일;공일승;이은정;소상수;양윤수;홍기환
- 대한후두음성언어의학회지
- /
- 제18권1호
- /
- pp.33-38
- /
- 2007
Background and Objectives: It has been suggested that tonsillectomy possibly causes changes of voice because the morphology of the vocal tract is altered. This may cause serious problems for professional voice users. Materials and Method: Subjects were 26 patients. The oral and nasal sound spectrum of oral vowel /a/, /e/ and /i/ were measured before and after tonsillectomy. The formant frequencies and intensities for oral and nasal spectra were compared. The nasality and fundamental frequencies for oral vowel were measured. Results: The first formant frequencies for oral spectra of all vowels were not changed after surgery, but the second formant frequencies were increased significantly after surgery in the vowel /e/ and /i/. The first and second formant intensities for oral spectra were increased significantly after surgery in the all vowels. The first and second formant frequencies for nasal spectra of all vowels were not changed after surgery, but their intensities for nasal spectra were increased after surgery. The nasalities for oral vowel were not changed after surgery. Conclusion : Tonsillectomy appeared to change the spectral features of oral and nasal components of oral vowel, especially spectral intensities.
PDF

검색결과 191건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)