• 제목/요약/키워드: Formant Analysis

검색결과 191건 처리시간 0.035초

성문(聲紋)분석법에 의한 사상체질 진단의 객관화 연구(I) (An objective study of sasang constitution diagnosis by sound analysis)

  • 김달래;박성식;권기록
    • 사상체질의학회지
    • /
    • 제10권1호
    • /
    • pp.65-80
    • /
    • 1998
  • Proceeding an objective Study of sasang constitution diagnosis by Sound Analysis which uses Computed Sound lab(CSL), we verified the confidence level of Questionnaire of Sasang Constitution classification II(QSCC II) and the first results of Sound Analysis for verifying correlation between the physical character and Sound character are as follows. 1. The confidence level of QSCC II is 70.8% to Soeumin, 60.8% to Soyangin, 74.5% to Taeumin, and 70.08% in total. But, the actual results of verifying the confidence level after making 100 persons an object of study, are that the confidence level of that is 55.10% to Soeumin, 30.77% to Soyangin, 80.00% to Taeumin, and 55.29% in total. So it doesn't coincide with the confidence lecel of QSCC II 70.8%. 2. The results of verifying the confidence level about other 134 persons after enough explanation before the constitutional diagnosis by QSCC II are that the confidence of that is 71.08 to Soeumin, 54.76% to Soyangin 81.82% to Taeumin, and 69.22% in total. 3. The results of verifying the correlation between B.M.I. and Sasang Costitution are that there are significant differences below P<0.001 between Taeumin and Soeumin, and between Taeumin and Soyangin. 4. Height and Weight influence on a fundamental frequency and formant frequency. 5. There are differences for every constitutions in a amplitude when we nave a Sound analysis. As aboves, it is considered that we can find the differences among the constitutional groups, if we have a Sound analysis of the constitutional Sound characters.

  • PDF

Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석 (Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers)

  • 이경철;홍석진;진성민
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.122-127
    • /
    • 2004
  • Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.

  • PDF

얼굴 영상 및 음성신호 측정을 통한 신장 수지침 효과 분석 기법의 제안 (A Proposal for Effect Analysis Techniques of Kidney Hand Acupuncture through Face Image and Voice Signal Measurement)

  • 김봉현;조동욱
    • 한국통신학회논문지
    • /
    • 제37권3C호
    • /
    • pp.217-223
    • /
    • 2012
  • 본 논문에서는 얼굴 영상 및 음성신호 변화를 측정하는 기술을 적용하여 신장에 해당하는 수지침 자극에 따른 효과를 분석하는 기법을 제안하고자 한다. 이를 위해 신장 수지침 자극 전과 후의 얼굴 영상과 음성을 각각 수집하고 영상신호 분석 실험에서는 신장 관련 영역인 지각(턱) 부위의 색상 변화를 측정하였다. 또한, 음성신호 분석 실험에서는 신장과 관련된 음성신호 분석 요소인 1 포먼트 주파수 대역폭과 Shimmer값의 변화를 측정하였다. 실험을 통해 신장 수지침 자극에 따른 지각 부위의 흑색, 1 포먼트 주파수 대역폭 및 Shimmer 측정값이 감소하는 현상을 나타냈다. 최종적으로 실험 결과에 대한 통계적 유의성 분석을 통해 얼굴 영상 및 음성신호 측정 기법에 의한 신장 수지침 효과를 객관적으로 입증하고자 한다.

Text-Independent Speaker Identification System Based On Vowel And Incremental Learning Neural Networks

  • Heo, Kwang-Seung;Lee, Dong-Wook;Sim, Kwee-Bo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2003년도 ICCAS
    • /
    • pp.1042-1045
    • /
    • 2003
  • In this paper, we propose the speaker identification system that uses vowel that has speaker's characteristic. System is divided to speech feature extraction part and speaker identification part. Speech feature extraction part extracts speaker's feature. Voiced speech has the characteristic that divides speakers. For vowel extraction, formants are used in voiced speech through frequency analysis. Vowel-a that different formants is extracted in text. Pitch, formant, intensity, log area ratio, LP coefficients, cepstral coefficients are used by method to draw characteristic. The cpestral coefficients that show the best performance in speaker identification among several methods are used. Speaker identification part distinguishes speaker using Neural Network. 12 order cepstral coefficients are used learning input data. Neural Network's structure is MLP and learning algorithm is BP (Backpropagation). Hidden nodes and output nodes are incremented. The nodes in the incremental learning neural network are interconnected via weighted links and each node in a layer is generally connected to each node in the succeeding layer leaving the output node to provide output for the network. Though the vowel extract and incremental learning, the proposed system uses low learning data and reduces learning time and improves identification rate.

  • PDF

개선된 피치검출을 위한 스펙트럼 평탄화 기법에 관한 연구 (A Study on the Technique of Spectrum Flattening for Improved Pitch Detection)

  • 강은영;배명진;민소연
    • 한국음향학회지
    • /
    • 제21권3호
    • /
    • pp.310-314
    • /
    • 2002
  • 음성인식, 합성 및 분석과 같은 음성신호처리 분야에 있어서 기본주파수 즉, 피치를 정확히 검출하는 것은 중요하다. 그러나 포만트의 영향과 천이진폭의 영향 때문에 음성신호로부터 정확한 피치검출은 매우 어렵다. 따라서 본 논문에서는 음소의 천이나 변동의 영향이 적은 주파수 영역에서 스펙트럼을 평탄화함으로써 포만트의 영향을 제거한 후 피치를 검출한다. 본 논문에서는 새로운 스펙트럼 평탄화 기법을 제안하고 기존의 방법인 LPC법, 켐스트럼법과 비교하여 어느 정도의 우수성을 보이는지 평가하였다. 또한 각각의 방법을 적용하여 기본주파수 (피치)를 검출한 결과는 제안한 방법이 우수함을 보여주고 있다.

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers - Possibility of a Second Singer's Formant-

  • Jin, Sung-Min;Kwon, Young-Kyung;Song, Yun-Kyung
    • 음성과학
    • /
    • 제10권2호
    • /
    • pp.99-109
    • /
    • 2003
  • The purpose of this study was to analyze and compare head register with chest register of singers acoustically. Fifteen healthy tenor major students were participated. Fifteen healthy untrained adults were chosen as the control group for this study. Long term average (LTA) power spectrum using the Fast Fourier transform (FFT) algorithm and Linear predictive coding (LPC) filter response were made with /a/ sustained in both head (G4, 392 Hz) and chest registers (C3, 131 Hz). Statistical analysis was performed using the Mann-Whitney test. In the LTA power spectrum, head register of singers increased in the level of energy gain within the frequency of 2.2-3.4 kHz (p<0.01), and 7.5-8.4 kHz (p<0.01, p<0.05). Chest register of singers increased in the frequency of 2.2-3.1 kHz (p<0.01), 7.8-8.4 kHz (p<0.05) and around 9.6 kHz (p<0.01). The LTA power spectrum revealed a peak of acoustic energy around 2,500 Hz, known as the singer's formant and another peak of acoustic energy around 8,000 Hz in the singer's voice.

  • PDF

구개편도 및 아데노이드 절제술이 음향학적 자질에 미치는 영향 (The Effect of Tonsillectomy and Adenoidectomy on Acoustic Factors)

  • 임성태;손진호;유정운;강지원;이현석;신승헌;박재율
    • 대한후두음성언어의학회지
    • /
    • 제9권1호
    • /
    • pp.38-42
    • /
    • 1998
  • It has been reported that Tonsillectomy & Adenoidectomy(T & A) resulted in the change of voice by structural changes directly to the vocal track. We studied the effect of T & A on the voice of patients comparing the pre-operative to the post-operative voice. It was performed using a Computerized Speech Lab(CSL50) which is currently used as a method for voice analysis. Forty-five patients who had T&A, aging from 3 to 42 years old, took part in studies and wert evaluated for voice changes and the degree of formant changes of four basic vowels, /a/, /i/, /o/, and /u/. They were evaluated pre-operatively and post-operatively one month later using MDVP, CSL program of CSL50. The results obtained were as follows ; In using MDVP, there were some differences between pre-operative and post-operative shimmer measures within the normal range but other acoustic measures(Fo, jitter, NHR) show no significant differences(p>0.05). F3 of /a/ and /o/ were significantly decreased(p<0.05) and F2, F3 of /i/ were increased(p>0.05) in patients who only had Tonsillectomy in doing CSL spectrogram. For the patients who had T & A, Fl and F3 of /a/, F3 of /i/, Fl, F2 and F3 of /o/ were decreased with significant increase in F1 and F2 of /i/(p<0.05).

  • PDF

사육시설 기준 설정 연구: 사료급여 전 젖소 발성음에 대한 음성학적 분석 (A Study on Standards for Farm Housing Systems: Acoustic Analysis of Feed Anticipating Calls of Heifers and Cows)

  • 천시내;이준엽;양승학;박규현;전중환
    • 한국축산시설환경학회지
    • /
    • 제20권1호
    • /
    • pp.21-26
    • /
    • 2014
  • The goal of this study was to investigate acoustic characteristic of feed anticipating calls of heifers and cows. 6 cows and 6 heifers housed in a pen ($6.0m{\times}10.0m$) which was bedded with sawdust and straw. They were fed a standard ration of commercial concentrate and hay was ad libitum. The calls of heifers and cows were divided into Type 1 and Type 2 which were classified based on the shapes of waveform and spectrograms, respectively. There was difference in the fundamental frequency (P < 0.0001) and $1^{st}$ formant (P < 0.0077) among the calls. Acoustic parameters with the exception of fundamental frequency and $1^{st}$ formant were no difference between cows' calls and heifers' calls (P > 0.05). Duration of cows' calls was lower than that of heifers' calls, whereas the intensity of Type 1 calls was higher than that of Type 2 calls (P > 0.05).

연령증가에 따른 정상 노인의 음향분석학적 특징 (Acoustic and Stroboscopic Characteristics of Normal Person's Voices with Advancing Age)

  • 진성민;권기환;강현국
    • 대한후두음성언어의학회지
    • /
    • 제8권1호
    • /
    • pp.44-48
    • /
    • 1997
  • Anatomic and physiological changes of the larynx with advancing age result in morphologic changes of the vocal fold and reduced control of the phonatory mechanism in elderly individuals and are reflected in increased unstability of fundamental frequency (Fo). The purpose of this study is to increase current understanding of acoustic and stroboscopic characteristics of normal elderly persons voices. First, phonated /a/ vowel productions by 40 normal adults (20 to 40 years, 20 men and 20 women) and 40 normal elderly persons (60 to 80 years,20 men and 20 women) were analyzed, using CSL (model 4300B) acoustic analysis software, to obtain acoustic measures related to fundamental frequency stability nd vocal resonance characteristics. Second, stroboscopic images of the vocal fold behavior in all subjects were analyzed by experienced specialists. In the men, fundamental frequency variation (vFe) (p<0.01), jitter. (p<0.05), and shimmer (p<0.05) for the older group were significantly higher than the value for the adult group. In the stroboscopic findings, edema of vocal fold had a significant finding in aged men (15%). In the women, vFo (p<0.05), jitter (p<0.05), and noise to harmonic ratio (NHR) (p<0.05) for the older group were significantly higher than the value for e adult group and first formant frequency (F1) (p<0.01) and second formant frequency (F2) (p<0.01) for. the older group were significantly lower than the value for the adult group. In the stroboscopic findings, vocal fold atrophy had a significant finding in aged women (25%). Frequency stability, as reflected by vFo, jitter, shimmer, and NHR, decreases with advancing age in men and women and spectral analysis of phonated /a/ vowel productions reveals the lowering of the frequency of F1 and second F2 with advancing age, especially in aged women. Change in the mass of vocal folds, due to atrophy or edema, is considered to be the greatest factor in these acoustic changes.

  • PDF

구개인두부전증 환자와 모의 음성의 모음과 자음 분석 (Analysis on Vowel and Consonant Sounds of Patent's Speech with Velopharyngeal Insufficiency (VPI) and Simulated Speech)

  • 성미영;김희진;권택균;성명훈;김우일
    • 한국정보통신학회논문지
    • /
    • 제18권7호
    • /
    • pp.1740-1748
    • /
    • 2014
  • 본 논문에서는 구개인두부전증 (VPI) 환자 발음과 정상인의 모의 발음에 대한 듣기 평가와 음향 분석을 실시한다. 본 연구를 위해 음성 데이터 수집을 위해 50개의 단어, 모음 및 단음절로 이루어진 발음 목록을 설정한다. 듣기 평가실험의 편의를 위해 웹 기반의 듣기 평가 시스템을 구축한다. 듣기 평가 결과는 실제 VPI 환자의 발음에 대한 오인식 경향과 모의 발음의 오인식 경향이 유사함을 나타낸다. 이러한 유사성은 모음의 포먼트 위치와 자음의 스펙트럼의 비교를 통해서도 확인할 수 있다. 실험 결과는 본 연구에서 사용한 정상인의 VPI 모의 발화 기법이 실제 환자의 음성을 비교적 효과적으로 모의하는 것을 반영하는 결과이다. 향후 VPI 환자의 음성 인식 과정에서 정상인의 모의 발화음성 데이터를 음향 모델의 적응 기법과 같은 분야에 유용하게 사용할 수 있을 것으로 기대한다.