• 제목/요약/키워드: formant bandwidth

검색결과 35건 처리시간 0.018초

Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究) (A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth)

  • 박성진;김달래
    • 사상체질의학회지
    • /
    • 제16권1호
    • /
    • pp.61-73
    • /
    • 2004
  • This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".

  • PDF

포만트 밴드폭 정규화를 이용한 음성인식 (Speech Recognition Using Formant Bandwidth Normalization)

  • 홍종진;강석건;박군작;박규태
    • 한국통신학회논문지
    • /
    • 제16권5호
    • /
    • pp.458-467
    • /
    • 1991
  • 본 논문에서는 기존의 선형예측기법의 문제점을 선형예측계수, ar필터의 POLE위치, 포만트-밴드폭의 관점에서 분석하고, 정문반사계수의 영향을 정도추정이론에 따라 분석했으며, 이러한 분석을 근거로 하여 포만트 밴드폭 정규화 방법을 보완하였다. 정분반사계수를 1로 변경하여 정문의 영향을 정규화되어 포만트가 최적으로 강조된 스펙트럽이 된다. 이 전형예측계수는 앞뒤로 대칭되면서, 표준편차가 정문반사계수를 변경시키지 않은 성형예측계수보다 증가하므로써, 음성부호화시에 bit rate을 50%로 줄일 수있으면서 정보의 양을 그대로 보존하고 있음을 알수 있었다. 이러한 포만트 밴드폭을 0으로 정규화하는 방법을 이용하여 한국어 5개 모음을 포만트에 의해서 소음환경에서 인식하기 위한 실험을 실시하여 96.7%의 인식율을 얻을 수 있었다.

  • PDF

심도 청각장애 아동의 조음 특성: 포먼트 대역폭을 중심으로 (The Articulation Characteristics of the Profound Hearing-Impaired Children with Reference to Formant Bandwidth)

  • 최은아
    • 말소리와 음성과학
    • /
    • 제6권2호
    • /
    • pp.55-64
    • /
    • 2014
  • This study measured formant bandwidths of profound hearing impaired children and examined the characteristics of their articulation. For this study, 10 cochlear implanted children(CI), 10 hearing aid children(HA) and 10 normal hearing children(NH) were asked to read 7 Korean vowels(/ɑ, ʌ, o, u, ɯ, i, ɛ/). The subjects' readings were recorded by NasalView and analyzed by Praat. The analysis of the formant bandwidths explains the degree of vocal fold opening and the characteristics of radiation. Through the analysis of formant bandwidth, we can see that the hearing-impaired maintain vocal fold tension when they speak high vowels and characteristics of radiation. Narrower B1 means better maintain vocal fold tension, wider B2 means more front and wider B3 means the rounder lips. CI's B1 was widest and NH's was narrowest. And females' B1 was wider than males'. Among vowels, B1 of /a/ was widest, and B1 of /i/ was narrowest. In the case of B2, HA and NH's B2 was wider than CI's. Females' B2 was wider than males'. And B2 of /i/ was widest, and B2 of /ʌ/ was narrowest. In the case of B3, NH's was widest, and CI's was narrowest. Males' was wider than females'. Among vowels, B3 of /o/ was widest, and B3 of /ɛ/ was narrowest. As a result, first, through the analysis of B1, we can find that NH and males could better maintain vocal fold tension than the hearing-impaired or females, and all children articulate /i/ with vocal fold tension than other vowels. Second, through the analysis of B2, NH and HA articulate vowels with the weaker rounded than CI does. And females articulate vowels with the weaker rounded than males do. Third, through the analysis of B3, NH articulate vowels with the rounder than HA or CI do, and males articulate vowels with the rounder than females do. Through the results, we can expect that the analysis of formant bandwidth will be applied to the therapy of articulation for the hearing-impaired with hearing aids or cochlear implant.

Pitch Range와 Bandwidth를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究) (A study on the correlation between sound characteristic and sasang constitution by pitch range and bandwisth)

  • 양상묵;김선형;유준상;김형석;이영훈;김달래
    • 사상체질의학회지
    • /
    • 제13권3호
    • /
    • pp.31-39
    • /
    • 2001
  • Bandwidth and Pitch Range are very important in the area of distinguish of phone which is one of many areas of phonetics and distinguish the individual way of phone. So if each constitution has a trait in its phone, they are important to judge the constitutions. In this report we try to understand the relativity between constitutions and Formant Bandwidth, Pitch Range and the number of syllables in a minute which are important to distinguish the phone. And we try to make judging the constitutions objective. 1. We analyzed Formant Bandwidth and there are some differences between constitutions but it doesn't have any importance in the statistics. 2. We analyzed Pitch Range and there are some differences between constitutions but it doesn't have any importance in the statistics. 3. We analyzed the number of syllables in a minute and there are some differences between constitutions but it doesn't have any importance in the statistics. As mentioned above there are differences between constitutions in Formant Bandwidth, Pitch Range and the numbers of syllables in a minute, but they don't have any importance in the statistics. However if we increase the number of samples and remove noise, there will be great possibility to find some important meanings.

  • PDF

비폐색 부위에 따른 비강자음의 음향학적 특성 및 비음도의 변화 (Acoustic Characteristics of Nasal Consonants and the Change of Nasalance according to the Sites of Nasal Obstruction)

  • 손영익;정유석;이은경;정원호
    • 대한후두음성언어의학회지
    • /
    • 제9권1호
    • /
    • pp.27-31
    • /
    • 1998
  • Nasal sounds include nasalized vowels and consonants. Nasal cavity is important for the acoustics of nasal sounds. Evaluating the effects of site-specific nasal obstruction on nasal sound will help us to understand the importance of nasal geometry for the nasal sound and to foretell voice change after nasal surgery This study was designed to analyze the change of nasality and formant characteristics of nasal sound by obstructing different sites around the ostiomeatal unit(OMU). Ten adult male and female volunteers participated. The nasal formants and bandwidths of nasal consonant /n/ were checked in various conditions of nasal obstruction. The nasalance of rabbit, baby, and mama passages were compared in each conditions. Nasalance of all passages decreased when anterior portion of OMU was obstructed. Center frequency of first nasal formant(NF1) of /n/ has decreased in the order of anterior, inferior obstruction. The bandwidth of NF1 decreased in female with anterior obstruction. Anterior portion of OMU is most critical to the change of nasality and acoustics of nasal consonant. When anterior portion of OMU is obstructed, the shift of NF1 to a lower frequency and the narrowing of NF1 bandwidth are the major acoustic changes of nasal consonant /n/.

  • PDF

전처리된 가변대역폭 LPF에 의한 피치검출법 (On a Pitch Detection using Low Pass Filter with Variable Bandwidth Preprocessed)

  • 한진희
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1995년도 제12회 음성통신 및 신호처리 워크샵 논문집 (SCAS 12권 1호)
    • /
    • pp.221-224
    • /
    • 1995
  • In speech signal processing, it is necessary to detect exactly the pitch. The algorithms of pitch extraction with have been proposed until now are difficult to detect pitches over wide range speech signals. In this paper, thus, we proposed a new pitch detection algorithm that used a low pass filter with variable bandwidth. It is the method that preprosses to find the first formant of speech signals by the FFT at each frame and detects the pitches for signals LPFed with the cut off frequency according to the first formant. Applying the method, we obtained the pitch contours, improving the accuracy of pitch detection in some noise environments.

  • PDF

음성신호의 디지탈화와 대역폭축소의 방법에 관하여 [II]-Vocoding (On Speech Digitization and Bandwidth Compression Techniques[II]-Vocoding)

  • 은종관
    • 대한전자공학회논문지
    • /
    • 제15권5호
    • /
    • pp.1-6
    • /
    • 1978
  • 본 논문은 음성신호의 디지탈화와 대역식 축소에 관한 일부1)에 이은 이부 논문이다. 몇가지 근래에 개발된 Vocoding 방법, 즉 linear predictive coding (LPC), formant vocoding, residual excited linear prediction (RELP) vocoding,그리고 adaptive predictive coding(APC)에 관하여 논하였다. 본 논문에서는 음성전송에 있어서의 대역 제한 방법 중 지금 가장 효과가 있는 LPC방법을 중점적으로 취급하였다. 또한 현재 처하고 있는 문제점들과 해결책을 토의하였다.

  • PDF

Efficient Tracking of Speech Formant Using Closed Phase WRLS-VFF-VT Algorithm

  • Lee, Kyo-Sik;Park, Kyu-Sik
    • The Journal of the Acoustical Society of Korea
    • /
    • 제19권2E호
    • /
    • pp.8-13
    • /
    • 2000
  • In this paper, we present an adaptive formant tracking algorithm for speech using closed phase WRLS-VFF-VT method. The pitch synchronous closed phase methods is known to give more accurate estimates of the vocal tract parameters than the pitch asynchronous method. However the use of a pitch-synchronous closed phase analysis method has been limited due to difficulties associated with the task of accurately isolating the closed phase region in successive periods of speech. Therefore we have implemented the pitch synchronous closed phase WRLS-VFF-VT algorithm for speech analysis, especially for formant tracking. The proposed algorithm with the variable threshold(VT) can provide a superior performance in the boundary of phone and voiced/unvoiced sound. The proposed method is experimentally compared with the other method such as two channel CPC method by using synthetic waveform and real speech data. From the experimental results, we found that the block data processing techniques, such as the two-channel CPC, gave reasonable estimates of the formant/antiformant. However, the data windows used by these methods included the effects of the periodic excitation pulses, which affected the accuracy of the estimated formants. On the other hand the proposed WRLS-VFF-VT method, which eliminated the influence of the pulse excitation by using an input estimation as part of the algorithm, gave very accurate formant/bandwidth estimates and good spectral matching.

  • PDF

음성신호를 이용한 A16 혈자리와 심장 기능의 연관관계 분석 (Analysis of Association Relationship Between A16 Acupuncture Point and Heart Function Using Voice Signals)

  • 김봉현;조동욱
    • 한국통신학회논문지
    • /
    • 제35권11B호
    • /
    • pp.1651-1658
    • /
    • 2010
  • 최근 들어 삶의 지표가 향상됨에 따라 질병이 발생되지 전에 조기 진단하는 예방, 보건의 건강 패턴이 행해지고 있다. 이와 같은 예방, 보건 분야를 반영하는 대체의학으로 수지침 요법이 널리 사용되고 있다. 따라서 본 논문처리 기술을 이용하여 성장에 해당하는 상응점인 A16 혈자리를 자극하여 심장과 관련된 음성 요소의 변화를 측정하고 상호간의 비교, 분석을 통해 성장 가능의 향상을 측정하였다. 이를 위해 우선 심장 상응점인 A16 혈자리를 자극하기 전과 후의 음성을 수집하였으며 심장과 연관성이 있는 음성 신호 분석 요소인 제2포먼트 대역폭과 지터를 적용한 실험을 수행하였다. 결과적으로, A16 혈자리 자극에 의해 제2포먼트 대역폭과 지터가 낮아지는 결과를 추출했으며 이를 통해 IT 음성 신호 처리 기술을 이용하여 심장 기능이 향상되는 것을 입증할 수 있었다.

NOISE ROBUST FORMANT FREQUENCY ESTIMATION BASED ON COMPLEX AUTOCORRELATION FUNCTION

  • Diankha, Ousmane;Shimamura, Tetsuya
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -3
    • /
    • pp.1799-1802
    • /
    • 2002
  • This paper proposes an improved method for formant frequencies estimation based on the complex autocorrelation function of the speech signal. Instead of using the incoming signal as an input fur the LPC analysis, the analytic signal of the autocorrelation function of the speech signal is computed and itself used as an input for the LPC analysis. Due to the properties of the analytic signal, which occupies half of the bandwidth of the original signal, the required model order for the LPC analysis is halved. The accuracy of the proposed method in noisy environments is examined on five natural vowels. The effectiveness of the proposed method is shown by the estimated spectral shapes and the estimation errors of the formant frequencies.

  • PDF