• 제목/요약/키워드: Formant Analysis

검색결과 191건 처리시간 0.022초

벅아이 코퍼스에서의 연령별 모음 포먼트 분석 (An Analysis of the Vowel Formants of the Young versus Old Speakers in the Buckeye Corpus)

  • 김지은;윤규철
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.29-35
    • /
    • 2012
  • The purpose of this study was to measure the first two vowel formants of the forty male and female speakers (twenty young vs. old male speakers and twenty young vs. old female speakers) from the Buckeye Corpus of Conversational Speech and to examine the vowel formant changes across two generations (younger vs. older). The results indicated that the vowel space of the younger generation (in their thirties or less) shifted to the lower left position compared to those of the older generation (in their forties or more) in both male and female speakers. When the results were compared to those of Peterson & Barney (1952), it appears that differences can be found in the size of the vowel spaces through time.

CHARACTERISTICS OF COW′S VOICES IN TIME AND FREQUENCY DOMAINS FOR RECOGNITION

  • Ikeda, Y.;Ishii, Y.
    • 한국농업기계학회:학술대회논문집
    • /
    • 한국농업기계학회 2000년도 THE THIRD INTERNATIONAL CONFERENCE ON AGRICULTURAL MACHINERY ENGINEERING. V.II
    • /
    • pp.196-203
    • /
    • 2000
  • On the assumption that the voices of the cows are produced by the linear prediction filter, we characterized the cows' voices. The order of this filter is determined by examining the voices characteristics both in time and frequency domains. The proposed order of the linear prediction filter is 15 for modeling voice production of the cow. The combination of the two parameters of the fundamental frequency, the slope of the straight line regressed from the log-log spectra of the amplitude-envelope and the only one coefficient involved in the linear prediction filter can differentiate the two cows.

  • PDF

Characteristics of Cow´s Voices in Time and Frequency domains for Recognition

  • Ikeda, Yoshio;Ishii, Y.
    • Agricultural and Biosystems Engineering
    • /
    • 제2권1호
    • /
    • pp.15-23
    • /
    • 2001
  • On the assumption that the voices of the cows are produced by the linear prediction filter, we characterized the cows’voices. The order of this filter was determined by examining the voice characteristics both in time and frequency domains. The proposed order of the linear prediction filter is 15 for modeling voice production of the cow. The characteristics of the amplitude envelope of the voice signal was investigated by analyzing the sequence of the short time variance both in time and frequency domains, and the new parameters were defined. One of the coefficients o the linear prediction filter generating the voice signal, the fundamental frequency, the slope of the straight line regressed from the log-log spectra of the short time variance and the coefficients of the linear prediction filter generating the sequence of the short time variance of the voice signal can differentiate the two cows.

  • PDF

Glottal Parameters Contributing to the Perception of Loud Voices

  • Yi, So-Pae;Lee, One-Good;Kim, Hyung-Soon
    • 음성과학
    • /
    • 제8권1호
    • /
    • pp.143-157
    • /
    • 2001
  • This paper focused on glottal parameters contributing to the perception of loud voices because energy of a voice is not the only effective factor. We used a formant synthesizer to synthesize loud voices. We divided F0 tilt (the tilt of F0 contour), SQ (Speed Quotient), OQ (Open Quotient) and TL (spectral Tilt Level) into three levels to get different combinations with default values for the other synthesizer parameters. Analysis of listening tests indicated that F0 tilt, SQ, OQ and TL in descending order had significant influence on the perception of loud voices. F0 tilt had a far more significant effect than the others. The influence of SQ increased greatly with the exclusion of F0 tilt as a factor. The interaction between parameters was not significant.

  • PDF

정신피로와 음성특징과의 상관관계 측정 (Measuring Correlation between Mental Fatigues and Speech Features)

  • 김정인;권철홍
    • 말소리와 음성과학
    • /
    • 제6권2호
    • /
    • pp.3-8
    • /
    • 2014
  • This paper deals with how mental fatigue has an effect on human voice. For this a monotonous task to increase the feeling of the fatigue and a set of subjective questionnaire for rating the fatigue were designed. From the experiments the designed task was proven to be monotonous based on the results of the questionnaire responses. To investigate a statistical relationship between speech features extracted from the collected speech data and fatigue, the T test for two-related-samples was used. Statistical analysis shows that speech parameters deeply related to the fatigue are the first formant bandwidth, Jitter, H1-H2, cepstral peak prominence, and harmonics-to-noise ratio. According to the experimental results, it can be seen that voice is changed to be breathy as mental fatigue proceeds.

음성발생 모델로부터의 G-peak를 이용한 음성에너지 추출에 관한 연구 (A Study on the Energy Extraction Using G-peak from the Speech Production Model)

  • 배명진;임재열;안수길
    • 대한전자공학회논문지
    • /
    • 제24권3호
    • /
    • pp.381-386
    • /
    • 1987
  • By the speech production model, the first positive peak in a pitch interval of the voiced speech is mainly affected by the glottis and the first formant component, known as a typical energy source of the voiced speech. From these characteristics, the energy parameter can be replaced by the area of the area of the positve peak in a pitch interval, which parameter is generally used for classification of speech signals. In this method, the changed energy parameter is independent of window length applied for analysis, and the pitch can be extracted smultaneously. Furthermore, the energy can be extracted in the pitch period unit.

  • PDF

포먼트 주파수 특성에 근거한 신장 질환과 순음(層音)간의 비교·분석 (A Comparison and Analysis of Kidney Diseases and a Labial Sound Based on Formant Frequency Extraction)

  • 김봉현;가민경;이세환;곽지현;조동욱
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2008년도 추계학술발표대회
    • /
    • pp.137-140
    • /
    • 2008
  • 현대 사회는 육체적·정신적 활동을 많이 요구하게 되며 이러한 현상으로 스트레스의 증가와 이유없는 증후군의 발병이 점차 확대되고 있다. 특히 스트레스로 인한 피로의 누적으로 인체의 혈액 농도 및 순환에 영향을 끼치게 되며 이로 인해 신장의 상태가 악화될 수 있다. 따라서 신장의 이상 유무를 조기에 판단하여 적절한 조치를 취하는 것이 무엇보다 중요하다. 이를 위해 본 논문에서는 신장 질환을 앓고 있는 환자와 정상인을 대상으로 피실험자 집단군을 각각 구성하고 음성 정보의 형태학적 분석과 수치학적 분석을 실험으로 출력하고 결과값에 대해 비교·분석을 행하고자 한다. 최종적으로 실험을 통해 신장과 음성과의 관계를 정립하고자 한다.

Angle씨 II급 1류 부정교합아동의 발음에 관한 음향학적 연구 (AN ACOUSTIC ANALYSIS OF PRONUNCIATION IN CHILDREN WITH ANGLE'S CLASS II DIV. 1 MALOCCLUSION)

  • 박윤정;이상훈;손동수
    • 대한소아치과학회지
    • /
    • 제24권1호
    • /
    • pp.95-111
    • /
    • 1997
  • The human speech organ consists of respiration system (lung, larynx), phonation system (vocal cord), articulation system (esophagus, pharynx, uvula, teeth, gingiva, palate, tongue, lip) and resonating system(oral cavity, nasal cavity, paranasal sinus). Because teeth are components of the articulation system, it has been reported that the persons with abnormally positioned teeth generally have abnormal occlusion and pronunciation. In this study, using /ㅅ(s)/, the most commonly mispronunced consonant in children with malocclusion, and the seven single vowels, /사(sa), 서($s\delta$), 소(so), 수(su), 스($s\omega$), 시(si), 세(se)/ and / ㅏ(a), ㅓ($\delta$), ㅗ(o), ㅜ(u), ㅡ($\omega$), 1(i), ㅔ(e)/ were recorded and analyzed using speech analysis program on computer by measuring formants and compared them for investigating the differences in pronunciation in children with Angle's class I occlusions and those with Angle's class II div.1 malocclusion. The result were as follows: 1. In the Angle's Class II div.1 group, there were no significant differences in F1 of all recorded sounds as compared with Angle's Class I group(p>0.05). 2. In the consonants, there were significant differences in F2 of /스($s\omega$)/ and F2/F1 ratio of /사(sa), 서($s\delta$), 시(si)/ between the two group(p<0.05). 3. In the vowels, there were significant differences F2/F1 ratio of /ㅓ($\delta$)/(p<0.05) and no significant differences in F2/F1 ratio between two group(p>0.05). 4. In the consonants, there were significant differences in F2 and F2/F1 ratio when succeeding vowels were high or low, and F2/F1 ratio when front in accordance with tongue position (p<0.05). 5. In the vowels, there were no significant differences in formant in accordance with tongue position(p>0.05)

  • PDF

진동 데이터 기반 설비고장예지를 위한 신호처리기법 (A Signal Processing Technique for Predictive Fault Detection based on Vibration Data)

  • 송예원;이홍성;박훈석;김영진;정재윤
    • 한국전자거래학회지
    • /
    • 제23권2호
    • /
    • pp.111-121
    • /
    • 2018
  • 항공기 엔진, 풍력발전기, 모터 등 회전기기에서 발생하는 많은 문제들은 진동이나 소음과 같은 신호 데이터를 측정하여 이상감지를 할 수 있으며, 주파수 분석 등 여러 가지 신호처리가 데이터 전처리 단계에서 필요하다. 본 논문에서는 진동 데이터를 분석하여 설비 이상상태를 감지하는 기법을 소개한다. 정상상태 데이터를 기반으로 마할라노비스 거리를 측정하여 이상상태 유무를 모니터링 하는 방식을 사용한다. 특히 신호 데이터의 전처리 기법들을 도입하여 이상상태 감지의 성능을 개선할 수 있음을 보여준다. 전처리 단계에서 신호 데이터 수집 과정에서 발생한 누설오차(leakage)를 없애기 위해 해밍 윈도우(Hamming window)를 적용하고, 신호 데이터의 원신호인 포먼트(formant)를 분리하기 위하여 켑스트럼(cepstrum) 분석을 실시한다. IMS 베어링 진동 공개데이터를 대상으로 시간 구간별로 6가지 통계지표를 추출한 후 마할라노비스 거리 분류기를 적용하여 성능을 검증하였다. 제시된 신호처리 전처리 기법을 적용함으로써 성능이 획기적으로 향상되는 것을 실험에서 보여주었다.

언어습득기 이전 청각장애인의 후두소견 및 음성학적 특성 (Laryngeal Findings and Phonetic Characteristics in Prelingually Deaf Patients)

  • 김성태;윤태현;김상윤;최승호;남순열
    • 대한후두음성언어의학회지
    • /
    • 제20권1호
    • /
    • pp.57-62
    • /
    • 2009
  • Background and Objectives : There are few studies reported that specifically examine the laryngeal function in patients with profound hearing loss or deafness, This study was designed to examine videostroboscopic findings and phonetic characteristics in adult patients with prelingually deaf. Materials and Method: Sixteen patients (seven males, nine females) diagnosed as prelingually deaf aged from 19 to 54 years, and were compared with a 20 normal control group with no laryngeal pathology and normal hearing group, Videostroboscopic evaluations were rated by experienced judges on various parameters describing the structure and function of the laryngeal mechanism during comfortable pitch and loudness phonations. Acoustic analysis test were done, and a nasalance test performed to measure rabbit, baby, and mother passage. CSL were measured to determine the first and two formant frequencies of vowels /a/, /i/, /u/, Statistical analysis was done using Mann-Whitney U or Wilcoxon signed ranks test. Results: Videostroboscopic findings showed phase symmetry but significantly more occurrences decrement in the amplitude of vibration, mucosal wave, irregularity of the vibration and increased glottal gap size during the closed phase of phonation, In addition, group of prelingually deaf patients were observed to have significantly more occurrences of abnormal supraglottic activities during phonation. The percentage of shimmer in the group of prelingually deaf patients were higher than in the control group. Characteristics of vowels were lower of the second formant of the vowel /i/. Nasalance in prelingually deaf patients showed normal nasality for all passages, Conclusion: Prelingually deaf patients show stroboscopic abnormal findings without any mucosal lesion, suggesting that they have considerable functional voice disorder. We suggest that prelingually deaf adults should perform vocal training for normalized laryngeal function after cochlear implantation.

  • PDF