• 제목/요약/키워드: Formant Analysis

검색결과 191건 처리시간 0.023초

노인성 난청인의 음성특성에 관한 연구 (A study on speech analysis of person with presbycusis)

  • 이상민;송철규;우효창;이영묵;김원기
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1997년도 추계학술대회
    • /
    • pp.67-70
    • /
    • 1997
  • In this paper, we evaluated the character of speech of hearing impaired person (HIP) who acquire his hearing loss after the youth. It is usually observed that severe HIP decreased not only speech perception but also vocalization. so there is a need for sensitive and quantitative measures or the assesment of the speech of the HIP to serve both diagnostic and prognosic purposes, 7 HIP and 12 normal hearing person(NHP) were studied with pure tone test and speaking test using word/sentence table which consists of vowel(a:), mono and two syllables and a sentence. we analyzed formant frequency, pitch, sound intensity, speech duration of HIP and NHP speech. According to the results, in the HIP's speech we find that formant frequency was shifted, first-formant prominence was reduced, the dynamic range of sound intensity was decreased, speech duration was prolonged. In the next, we expect the correlation between hearing and speech character of HIP is cleared through analysis of more acoustic parameters and precise selection of HIP group.

  • PDF

수유행동시 모돈(랜드레이스×요크셔) 발성음의 개체 판별을 위한 음성 파라미터 (Sound parameters for classifying individual sows(Landrace×Yorkshire) during nursing behavior)

  • 전중환;장홍희;하정기;김현희;구자민;이효종;연성찬
    • 대한수의학회지
    • /
    • 제43권1호
    • /
    • pp.165-169
    • /
    • 2003
  • The aim of the present study was to analyse grunts of the sows and to extract parameters from the time and frequency signals in nursing behavior. Five crossbred $Landrace{\times}Yorkshire$ sows were used on day 5 or 6 postpartum. The grunts and the behaviors of the five sows were recorded with five digital camcorders. Three parameter groups [Group I: Formant vector alone, Group II: Formant vector+parameters from time signal, Group III: Formant vector+parameters from time signal-parameters eliminated by stepwise discriminant analysis backward (SDAB)] with parameter vectors extracted from single grunts in the maximum grunting rate period were used for individuality of the sows. The parameter groups were compared by a discriminant function analysis. The classification system adopted in the Group II represented the higher discriniation rate than those in other groups (Group I: 63.3%, Group II: 83.0%, Group III: 80.0%). This study demonstrated that formant, intensity, and pitch were available sound parameters for individuality of the sows during nursing behavior.

성전환자와 정상인이 발성한 모음의 음향분석과 지각실험 (An Acoustic Analysis and Perceptual Study of Korean Vowels Produced by Transgenders and Noraml Adults)

  • 조성미;정옥란
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.145-155
    • /
    • 2003
  • This study compared $F_{0}$ and the first three formants of eight Korean monophthongs produced by nine transgenders (male to female) to those of eighteen normal adults. Voice analysis was done by Praat (version 4.049). A one-way ANOVA with Tukey HSD post hoc tests were performed to determine statistical differences in $F_{0}$ and formant values obtained from transgenders, and normal male and female subjects. Results indicated that there was no significant difference in $F_{1}$ of /u/, /$\Lambda$/, and /o/, $F_{2}$ of /u/, /$\Lambda$/, and /i/ and $F_{3}$ of /u/ among the 3 groups (transgenders, normal males and normal females). However, in the comparison of transgenders vs. males, a significant difference was observed in $F_{0}$ of /o/, and $F_{2}$ of /i/, /a/, /e/, and /${\ae}$/ and $F_{3}$ of /e/. Furthermore, in the comparison of transgenders vs. females, a significant difference was also observed in $F_{0}$ of all vowels, $F_{1}$ of /i/, /$\alpha$/, /e/, /${\ae}$/, and /i/. $F_{2}$ of /i/, and /${\ae}$/, and $F_{3}$ of /i/, /$\alpha$/, /$\Lambda$/, /e/, /${\ae}$/, /i/, and /o/. Also, perceptual judgment of the transgenders' voice came out somewhat correlated strongly with their $F_{0}$ values but not much with the formant values. It was concluded that the transgenders' acoustic parameters are placed in between those of the normal males and females in. terms of fundamental and formant frequency analyses of vowels. Thus, it was assumed that those differences might stem from the transgenders' original big resonating cavities.

  • PDF

직.간접흡연 환경에서의 성대 및 음형대 변화에 대한 음성 분석학적 연구 (A Study on Voice Analytical the Vocal Cord and Formant Change in the Smoking and Secondhand Smoking Environments)

  • 김봉현;조동욱
    • 한국통신학회논문지
    • /
    • 제36권6B호
    • /
    • pp.720-727
    • /
    • 2011
  • 웰빙이 새로운 미래 사회적 이슈로 부각되면서 건강관리 및 유지에 대한 현대인들의 관심이 증대되고 있다. 특히, 흡연에 대한 좋지 않은 인식이 높아지면서 대대적인 금연 운동이 확산되고 있는 실정이다. 흡연은 인체의 호흡기와 순환기 등에 많은 악영향을 미치며 직접적인 흡연뿐만 아니라 간접흡연도 동일한 증상이 유발되는 치명적인 행위로 인식되고 있다. 따라서 본 논문에서는 직접흡연과 간접흡연 환경에서 성대 및 음형대에 미치는 영향을 음성 분석학적 요소 기술의 적용을 통해 비교, 분석하는 연구를 수행하였다. 이를 위해 20대 남성을 대상으로 흡연자와 비흡연자로 피실험자 집단을 구성하고 직 간접흡연 전과 후의 음성을 수집하여 Pitch, Jitter, Shimmer 및 5~8 Formant Frequency를 적용한 실험 결과를 추출, 분석하는 연구를 수행하였다.

음성 신호의 다구간 에너지 차를 이용한 새로운 프리엠퍼시스 방법에 관한 연구 (A Study on a New Pre-emphasis Method Using the Short-Term Energy Difference of Speech Signal)

  • 김동준;김주리
    • 대한전기학회논문지:시스템및제어부문D
    • /
    • 제50권12호
    • /
    • pp.590-596
    • /
    • 2001
  • The pre-emphasis is an essential process for speech signal processing. Widely used two methods are the typical method using a fixed value near unity and te optimal method using the autocorrelation ratio of the signal. This study proposes a new pre-emphasis method using the short-term energy difference of speech signal, which can effectively compensate the glottal source characteristics and lip radiation characteristics. Using the proposed pre-emphasis, speech analysis, such as spectrum estimation, formant detection, is performed and the results are compared with those of the conventional two pre-emphasis methods. The speech analysis with 5 single vowels showed that the proposed method enhanced the spectral shapes and gave nearly constant formant frequencies and could escape the overlapping of adjacent two formants. comparison with FFT spectra had verified the above results and showed the accuracy of the proposed method. The computational complexity of the proposed method reduced to about 50% of the optimal method.

  • PDF

개별 음향 정보를 이용한 화자 확인 알고리즘 성능향상 연구 (The Study for Advancing the Performance of Speaker Verification Algorithm Using Individual Voice Information)

  • 이재형;강선미
    • 음성과학
    • /
    • 제9권4호
    • /
    • pp.253-263
    • /
    • 2002
  • In this paper, we propose new algorithm of speaker recognition which identifies the speaker using the information obtained by the intensive speech feature analysis such as pitch, intensity, duration, and formant, which are crucial parameters of individual voice, for candidates of high percentage of wrong recognition in the existing speaker recognition algorithm. For testing the power of discrimination of individual parameter, DTW (Dynamic Time Warping) is used. We newly set the range of threshold which affects the power of discrimination in speech verification such that the candidates in the new range of threshold are finally discriminated in the next stage of sound parameter analysis. In the speaker verification test by using voice DB which consists of secret words of 25 males and 25 females of 8 kHz 16 bit, the algorithm we propose shows about 1% of performance improvement to the existing algorithm.

  • PDF

성악다들의 목소리에 대한 Long Term Average Spectrum 분석 -$2^{nd}$ Singer's Formant의 존재 가능성에 대하여- (Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers : Extended Study)

  • 반재호;권영경;진성민
    • 대한후두음성언어의학회지
    • /
    • 제15권1호
    • /
    • pp.31-36
    • /
    • 2004
  • Background and Objectives : It has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. In previous study, authors showed that in trained tenors, besides the conventional singer's formant in the region of ,5500Hz, another energy peak was observed in the region of 8,000Hz. This peak was interpreted as the second resonance of the epilarynx tube. Singers in other voice categories who produce vocal ring are assumed to have the same peak, but no measurements have as yet been made. Materials and Methods : Fifteen tenors, fourteen baritones, seven sopranos and five mezzo sopranos attending the music college, department of vocal music who could reliably produce the head and chest registers were chosen for this study. Each subject was asked to produce an/ah/sound for at least three seconds for the head register sound(tenors ; G4, barions ; E4 sopranos ; F5 and mezzosopranos ; C5) and for the chest register sound (tenors ; C3, baritones ; D3, sopranos ; D4 and Mezzosoprano ; A3). The sound data was analyzed using the Fast Fourier Transform (FFT)-based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab (CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social sciences(SPSS). Results : For head register sounds, a significant increase was seen in the 2,200-3,400Hz region(p<0.05) and the Similar to the head register sounds, there was a significant increase in energy in the four trained singer group compared with the untrained group in the 2,200-3,100Hz region(p<0.05), the 7,800-8,400Hz region(p<0.05) for the chest register sounds. Conclusions : When good vocal production was made for the head and chest registers, an energy peak was observed near 2,500Hz, a frequency already known as the "singer's formant', in all subjects in the study group. Another region of increased energy was observed around 8,000Hz that had not been noticed previously. The authors believe this region to be the second singer's formant.

  • PDF

구개상의 두께에 따른 한국어 자음의 발음 변화에 관한 컴퓨터 분석 - 치조음, 경구개음- (A COMPUTER ANALYSIS ON THE KOREAN CONSONANT SOUND DISTORTION IN RELATION TO THE PALATAL PLATE THICKNESS -Dentoalveolar and hard palatal consonant-)

  • 우이형;최대균;최부병;박남수
    • 대한치과보철학회지
    • /
    • 제25권1호
    • /
    • pp.71-94
    • /
    • 1987
  • This study was carried out to investigate the sound distortion following the alternation of the palatal plate thickness. For this study, 2 healthy male subjects (24-year-old) were selected. Born in Seoul, they both spoke Seoul dialect. First, their sounds of /na(나)/, /da(다)/, /1a(라)/, /ja(자)/, /cha(차)/, /ta(타)/, without inserting plates were recorded, and then the sounds with palatal plates of different thickness were recorded, successively. The plate was fabricated in 3 types, each palatal thickness being 1.0mm, 2.5mm, dentoalveolar portion 2.5mm, other residual portion was 1.0mm, successively. Each type plates named B, C, D-type, in succession. Series of analysis were administered through Computer(16 bit) to analyze the sound distortions. These experiments were analyzed by the LPC (without weighting, pre-weighting, post-weighting) of the consonants, vowels portion, formant frequency of the vowels and word duration of the consonants. The findings led to the following conclusions: 1. There was no correlation of the distortion rate on the 2 informants. 2. Generally, vowels were not affected by the palatal plate thickness in the formant analysis, however, more distortion was detected in the LPC analysis, especially C, D-type plates. 3. Consonants distortion was more evident in the C, D-type plate. 4. The second formant was most disturbed and reduced in the all consonants with insertion of the palatal plate, especially C, D-type plate. 5. Word duration was shortened in the plate inserted(except /ja/, /cha/), especially C, D-type. 6. It was found that dentoalveolar, hard palatal sounds were severely distorted in plate inserted, and they were mainly affected by the dentoalveolar portion thickness. 7. There was correlation between palatal thickness and consonants quality.

  • PDF

성대형태 및 음향발현에서 성악 발성 및 판소리 발성의 비교 연구 (A Comparative Study of Western Singer's Voice and a Pansori Singer's Voice Based on Glottal Image and Acoustic Characteristics)

  • 김선숙
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.165-177
    • /
    • 2004
  • Western singers voice have been studied in music science since the early 20th century. However, Korean traditional singers voice have not yet been studied scientifically. This study is to find the physiological and acoustic characteristics of Pansori singers voices. Western singers participated for comparative purposes. Ten western singers and ten Pansori singers participated in this study. The subjects spoke and sung seven simple vowels /a, e, i, o, u, c, w/. An analysis of Glottal image was done by Scope View and acoustic characteristics of speech and singing voice were analyzed by CSL. The results are as follows: (1) Glottal gestures of Pansori singers showed asymmetric vocal folds. (2) Singing vowel formants of Pansori singers showed breathiness based on Spectrogram. (3) Music formant of western singers appeared in around 3kHz area, however, Pansori singers formant appeared in low frequency area. Modulation of vibrato showed 6 frequency per sec in case of western singers. Pansori singers showed no deep modulation of vibrato on spectrogram.

  • PDF

한국인남성과 미국인남성이 발음한 영어 긴장.이완모음의 음향적 비교 (An Acoustical Comparison of English Tense and Lax Vowels Produced by Korean and American Males)

  • 양병곤
    • 음성과학
    • /
    • 제15권4호
    • /
    • pp.19-27
    • /
    • 2008
  • Several studies on the pronunciation of English vowels point out that Korean learners have difficulty distinguishing English tense and lax vowel pairs. The acoustic comparisons of those studies are mostly based on the formant measurement at one time point of a given vowel section. However, the English lax vowels usually show dynamic changes across their syllable peaks and subjects' English levels account for various conflicting results. The purposes of this paper are to compare the temporal duration and dynamic formant tracks of English tense and lax vowel pairs produced by five Korean and five American males. The subjects were graduate students of an American state university. Results showed that both the Korean and American males produced the vowels with comparable durations. The duration of the front tense-lax vowel pair was longer than that of the back vowel pair. From the formant track comparisons, the American males produced the tense and lax pairs much more distinctly than the Korean male speakers. The results suggest that the Korean males should pay attention to the F1 and F2 movements, i.e., the jaw and tongue movements, in order to match those of the American males. Further studies are recommended on the auditorily acceptable ranges of F2 variation for the lax vowels.

  • PDF