• 제목/요약/키워드: voice parameter

검색결과 179건 처리시간 0.025초

음성분석에 의한 체질진단에 관한 연구 (Pilot Study on the Classification for Sasangin by the Voice Analysis)

  • 이의주;송광빈;최환수;유정희;곽창규;손은혜;고병희
    • 대한한의학회지
    • /
    • 제26권1호
    • /
    • pp.93-102
    • /
    • 2005
  • Objective : This research was conducted to evaluate the method of sasangin classification by voice analysis, The 2 pilot tests were thus designed to solve the following problems: 'What are the conditions at classification for sasangin by the voice analysis?' and 'What are the important variances of /a/ parameter?'. Methods: 122 volunteers Were examined to make a diagnosis of sasangin by QSCC II and they were disease-free and healthy, First, they said /a/ three times for 2 seconds in their usual voice, Second, they said /a/ for 2 seconds by the different ways of high tone, mid tone, and low tone. The sounds were collected by a recording program (cooledit 2000) through a Sony microphone (ecm-26l). We analyzed the voices by maltlab, the simulation tool. Results: There were no differences and were correlations when one said /a/ three times for 2 seconds in the usual voice. There were some things to correlate when one said /a/ three times for 2 seconds by the different ways of high speech, usual speech, and low speech. Others were nothing to correlate. We evaluated the value of sasangin classification method by only /a/ voice analysis. The hit ratio was average $66.3\%\;:\;soyangin\;67.9\%,\;taeumin\;68.0\%,\;soeumin\;63.9\%$. Conclusion: We must set up the conditions to use the method of sasangin classification by voice analysis. The value of sasangin classification method by only fa! voice analysis was a hit ratio of $66.3\%$.

  • PDF

고음질을 갖는 음색변경에 관한 연구 (A Study on the Voice Conversion Algorithm with High Quality)

  • 박형빈;배명진
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 제13회 신호처리 합동 학술대회 논문집
    • /
    • pp.157-160
    • /
    • 2000
  • In the generally a voice conversion has used VQ(Vector Quantization) for partitioning the spectral feature and has performed by adding an appropriate offset vector to the source speaker's spectral vector. But there is not represented the target speaker's various characteristics because of discrete characteristics of transformed parameter. In this paper, these problems are solved by using the LMR(Linear Multivariate Regression) instead of the mapping codebook which is determined to the relationship of source and target speaker vocal tract characteristics. Also we propose the method for solved the discontinuity which is caused by applying to time aligned parameters using Dynamic Time Warping the time or pitch-scale modified speech. In our proposed algorithm for overcoming the transitional discontinuities, first of all, we don't change time or pitch scale and by using the LMR change a speaker's vocal tract characteristics in speech with non-modified time or pitch. Compared to existed methods based on VQ and LMR, we have much better voice quality in the result of the proposed algorithm.

  • PDF

스펙트럼의 변동계수를 이용한 잡음에 강인한 음성 구간 검출 (Noise-Robust Speech Detection Using The Coefficient of Variation of Spectrum)

  • 김영민;한민수
    • 대한음성학회지:말소리
    • /
    • 제48호
    • /
    • pp.107-116
    • /
    • 2003
  • This paper deals with a new parameter for voice detection which is used for many areas of speech engineering such as speech synthesis, speech recognition and speech coding. CV (Coefficient of Variation) of speech spectrum as well as other feature parameters is used for the detection of speech. CV is calculated only in the specific range of speech spectrum. Average magnitude and spectral magnitude are also employed to improve the performance of detector. From the experimental results the proposed voice detector outperformed the conventional energy-based detector in the sense of error measurements.

  • PDF

후두미세수술 전후 /아/의 음향적 특성 비교 (Comparative Study on the Acoustic Characteristics of the Korean Vowel /a/ before and after LMS)

  • 황연시;성철재
    • 대한음성학회지:말소리
    • /
    • 제67호
    • /
    • pp.33-60
    • /
    • 2008
  • The aim of this study is to show the differences in acoustic parameters between a pathological voice /a/ caused by vocal polyp and a normal voice /a/ produced after LMS (Laryngeal Microscopic Surgery). It was expected that voices of two kinds could be analyzed effectively in terms of HNR in specific frequency bands than in all frequency bands. For this study, 10 patients' voice were recorded before and after LMS and then were manipulated in terms of four acoustic parameter. It was found out that (a) frequency bands of 500Hz in the range of 1,000Hz to 4,000Hz were very useful to obtain HNR values; (b) frequency bands in the range of 1,248Hz to 5,500Hz on a log scale were very useful to obtain HNR values; (c) F0 dropped after LMS but not significantly; (d) the bandwidth of the second formant (B2) decreased significantly after LMS, while that of the first formant (B1) decreased after LMS but not significantly.

  • PDF

MDVP와 Praat, Dr. Speech간의 음향학적 측정치에 관한 상관연구 (A Correlation Study among Acoustic Parameters of MDVP, Praat, and Dr. Speech)

  • 유재연;정옥란;장태엽;고도흥
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.29-36
    • /
    • 2003
  • The purposes of this study was to conduct a correlational analysis among $F_^{0}$, Jitter, Shimmer, and NHR (HNR), and NNE estimated by three speech analysis softwares, MDVP, Praat and Dr. Speech. Thirty females and 15 males with normal voice participated in the study. We used Sound Forge 6.0 to record their voice. MDVP, Praat and Dr. Speech were used to measure the acoustic parameters. The Pearson correlation coefficient was determined through a statistical analysis. The results came out as follows: Firstly, there was a strong correlation between $F_^{0}$ and Shimmer of both instruments. However, there was no correlation between Jitter of both instruments. Secondly, Shimmer showed a stronger correlation with HNR, NHR, and NNE than Jitter. Therefore, Shimmer was considered to be more useful and sensitive parameter to identify dysphonic voice compared to jitter.

  • PDF

Sample selection approach using moving window for acoustic analysis of pathological sustained vowels according to signal typing

  • 이지연
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.99-108
    • /
    • 2011
  • The perturbation parameters like jitter, shimmer, and signal-to-noise ratio (SNR) are largely estimated in the particular segment from the subjective or whole portion of the given pathological voice signal although there are many possible regions to be able to analyze the voice signals. In this paper, the pathological voice signals were classified as type 1, 2, 3, or 4 according to narrow band spectrogram and the value differences of the perturbation parameters extracted in the subjective and entire portion tended to be getting bigger as from type 1 to type 4 signals. Therefore, sample selection method based on moving window to analyze type 2 and 3 signals as well as type 1 signals is proposed. Although type 3 signals cannot be analyzed using the perturbation analysis, the type 3 signals by selecting out the samples in which error count is less than 10 through moving window were analyzed. At present, there is no method to be able to analyze the type 4 signals. Future research will endeavor to determine the best way to evaluate such voices.

  • PDF

보이스코일형 LOA의 제어정수 산정을 위한 특성 해석 및 시험 (Characteristic Analysis and Test of a Voice-Coil-Type LOA for Determination of Control Parameters)

  • 장석명;정상섭;박희창;문석준;박찬일;정태영
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1998년도 하계학술대회 논문집 A
    • /
    • pp.278-280
    • /
    • 1998
  • A voice-coil-type LOA consists of the NdFeB permanent magnets with high specific energy as the stator, a coil-wrapped nonmagnetic hollow rectangular structure, and an iron core as a pathway for magnetic flux. When applying a voice-coil-type LOA to the control system, we have to obtain the control parameters and circuit parameters, such as mass, coil inductance, coil resistance, thrust voltage & stroke, frequency & stroke and so on. Therefore, these parameter were determined from the analytical and experimental method.

  • PDF

u-Health 시스템을 위한 음성신호 분석 기반의 간 기능 모니터링에 관한 연구 (A Study on Monitoring of Liver Function Based on Voice Signal Analysis for u-Health System)

  • 김봉현;조동욱
    • 정보처리학회논문지B
    • /
    • 제18B권6호
    • /
    • pp.389-396
    • /
    • 2011
  • 현대 사회에서 식습관의 변화, 스트레스, 음주 등으로 인해 다양한 간 질환이 발생되거나 악화되어 가고 있다. 따라서 본 논문에서는 간 질환이 음성에 미치는 영향을 연구하여 간 질환을 조기에 진단할 수 있는 방법론을 제안하였다. 이를 위해 간 질환자를 대상으로 입원했을 때와 치료로 인해 정상적으로 퇴원했을 때의 음성을 각각 수집하여 음성 분석 요소를 적용한 실험을 수행하였다. 특히, 한의학적으로 간(肝)과 관련 있는 발음인 아음(牙音)에 대한 분석 실험으로 제3포먼트 주파수 대역폭과 발음 요소값을 적용한 실험을 수행하였으며 이를 통해 간 질환이 공명강과 발성에 미치는 영향을 객관적 지표로 출력하는 연구를 행하였다. 또한 실험 결과를 기반으로 u-Health 환경에서 간 기능을 모니터링하는 시스템 설계에 관한 연구를 수행하였다.

후두질환에 따른 자음의 음성발현시간의 특성 (The Characteristics of Voice Onset Time of the Korean Stops in the Benign Laryngeal Disorders)

  • 홍기환;이화욱;김진성;이은정;소상수;최동일;양윤수
    • 대한후두음성언어의학회지
    • /
    • 제17권2호
    • /
    • pp.98-102
    • /
    • 2006
  • Background and Objectives : Voice onset time(VOT) is defined as the time interval from oral release of a stop consonant to the onset of glottal pulsing in the following vowel. VOT is a temporal characteristics of stop consonants that reflects the complex timing of glottal articulation relative to supraglottal articulation. Stop consonants are characterized by creation of a pressure difference across a complete occlusion in the vocal tract, followed by a sudden release 'burst' due to opening that occlusion. The objects of this study is to evaluate a usefulness of voice onset time in the assessment of voice disorderd patients. Subjects : Subjects were 20 adults with normal voice and with benign laryngeal disorders. Subjects with voice disorders represented the following vocal pathologies : vocal polyp, vocal nodule, Reinke's edema and unilateral vocal fold paralysis(UVFP). Control subjects were matched for age (21-40 yews old) and sex(male) with the voice disorders subjects and had normal vocal qualities with no history of voice disorders. Methods : Each voice-disordered and matched control subject read the test passages containing three types of Korean bilabial consonants. VOT measures were made for the initial $/p/p^h/\;and\;/p'/$. VOT was measured using acoustic waveform or wide band spectrogram. Results : For each voiceless stop consonants, there was a significant difference in VOT between the voice disordered and normal subjects. The mean VOTs of the lax stops in UVFP was significantly shorter than those of control subjects in the UVFP. The mean VOTs of the aspirated stops in the vocal polyp and nodule were longer than those of control subjects, but not significant. The mean VOTs of the glottalized in voice disordered groups were longer than those of control subjects, and significant statistically in the UVFP. Conclusions : VOT may be a clinically useful acoustic parameter in the assessment of voice disordered patients, especially in the unilateral vocal fold paralysis.

  • PDF

연축성 발성장애 환자의 음향학적 및 공기역학적 양상 (The Acoustic and Aerodynamic Aspects of Patients with Spasmodic Dysphonia)

  • 이주환;김인섭;고윤우;오종석;배정호;윤현철;최성희;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.98-103
    • /
    • 2000
  • Background and Objectives : The etiology and pathophysiology of spasmodic dysphonia is yet unknown. This study was performed to determine if any laryngeal aerodynamic parameter distinguish the voice of patient diagnosed as having adductor spasmodic dysphonia from individuals with normal voice production and to investigate the pathophysiology of spasmodic dysphonia. Materials and Methods : fifteen women diagnosed as having adductor spasmodic dysphonia and fifteen normal control women participitated in this study Maximum phonation time, mean air flow rate, subglottic pressure, vocal efficiency, Vfo, NHR, VTI, FTRI, ATRI, Jitter percent, Shimmer percent were obtained from the participants using 'MDVP(multi-dimensional voice program)' of CSL(Computerized Speech lab, Kay Elemetrics, Co., Model No. 4300), and 'maximum sustained phonation' and 'IPIPI test' of AP II(Aerophone II, Kay Elemetrics, Co., Model 6800). Results : T-test statistical analysis revealed statistically different values for vocal efficiency, Vfo, NHR, MPT, litter percent, Shimmer percent between the spasmodic dysphonia group and the control group. Conclusions : Spasmodic dysphonia affects the ability of the laryngeal mechanism to function effectively. Results from our study demonstrate that certain aerodynamic and acoustic parameters distinguish adductor spasmodic dysphonia from normal voice.

  • PDF