• 제목/요약/키워드: sustained vowels

검색결과 40건 처리시간 0.023초

Perturbation and Nonlinear Dynamic Analysis of Sustained Vowels in Normal and Pathological Voices

  • 이지연;최성희;;한민수;최홍식
    • 말소리와 음성과학
    • /
    • 제2권1호
    • /
    • pp.113-120
    • /
    • 2010
  • In this paper, we investigate the acoustic characteristics of sustained voices from normal subjects and patients with laryngeal pathologies. Perturbation methods (including jitter and shimmer), signal-to-noise ratio (SNR), and nonlinear dynamic methods (such as correlation dimension) are used to analyze normal and pathological voices. We find that jitter does not statistically discriminate between normal and pathological voices, but a significant difference is found for shimmer, SNR, and correlation dimension. The results suggest that nonlinear dynamic analysis may be valuable for the analysis of normal and pathological voices but perturbation analysis should be applied with caution for pathological voice analysis.

  • PDF

음성신호의 Jitter 성분의 장시간 변화에 관한 통계적 분석 (Statistical analysis on long-term change of jitter component on continuous speech signal)

  • 조철우
    • 말소리와 음성과학
    • /
    • 제12권4호
    • /
    • pp.73-80
    • /
    • 2020
  • 본 연구에서는 연속음성에서의 장시간 jitter 성분 측정 방법에 대해 고찰하였다. 기존의 jitter측정방법으로는 지속 발성한 모음을 대상으로 변동성을 측정하는 방법을 주로 사용하여왔다. 문장음성 등 연속음성의 경우는 문장에 따른 운율정보의 영향으로 기존의 측정법으로는 왜곡이 발생하게 된다. 이에 연속 발성에 대해 운율정보의 피치 변동을 상쇄시키는 방법을 제안하고자 한다. 피치 변동을 제거하는 방법으로는 분석구간내에서의 피치 변동을 다항식 보간법에 의해 변동 경향을 대표하는 곡선을 구하고 그 곡선을 기준으로 변이를 제거하였다. 이후 변이가 제거된 피치의 궤적으로부터 jitter를 측정하는 방법을 적용하여 피치 주파수의 변동성을 측정하고 기존의 지속모음에 의한 측정 방법과 비교하였다. 제안한 방법의 효용성 측정을 위해 Kay Pentax MEEI DB의 음성 표본을 사용하였다. 통계분석 결과 제안된 방법에 의해 연속음성으로부터 측정한 jitter 값은 동일 화자의 지속모음으로부터 측정한 파라미터 값과 유사한 변동성을 보여 주었다.

변성기 일반 아동 음성의 기본주파수 연구 (Fundamental Frequencies of Normal Children's Voice in mutational Period)

  • 김선해
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.251-260
    • /
    • 2007
  • The structure changes of the vocal folds are related to the fundamental frequencies (F0). In other words, the increasing in vocal fold length and thickness makes the result of dropping in the F0 during the mutational period. The purpose of this study was to investigate F0 of normal children's voice in mutational period. 360 children (180 boys and 180 girls) were participated in this experiment. The age was ranged from 11 to 16 years. The subjects were asked to produce sustained comer vowels (/a/ /i/ /u/) five times each and the data were analyzed using the MDVP of CSL. The result shows that the F0 are considerably decreased with age and reach to adults' F0 by 16 years in most cases. In particular, the F0 of male subjects were rapidly decreased between the ages from 12 ($226.98\;{\pm}\;19\;Hz$) to 13 years ($169.3\;{\pm}\;25\;Hz$), while the F0 of female subjects were slowly changed from the later period of 12 to 16 years old. This result may be used by the meaning of guideline and lead the basic data to differentiate between normal voice and voice disorder.

  • PDF

소음환경이 정상 및 병적음성에 미치는 영향 (The Effect of Noise on the Normal and Pathological Voice)

  • 홍기환;양윤수;김현기
    • 음성과학
    • /
    • 제9권4호
    • /
    • pp.27-38
    • /
    • 2002
  • The purpose of this article is to present the acoustic parameters (VOT, jitter, shimmer, vF0, vAm, NHR, SPI, VTI, DVB, DSH) for consonants (/pipi/, /$p^{h}ip^{h}i$/, /p'ip'i/) and sustained vowels (/a/, /e/, /i/) produced by normal subjects and dysphonia patients at two vocal effort(normal, high) by Lombard effect using 60dB white noise. Lombard effect indicates the vocal effort increase in noisy situation. At normal vocal effort, in general the acoustic parameter values of patients are greater than normal. And in noisy situation, significant decrease of acoustic values is seen in normal compared with in dysphonia patients. The clinical implication of this finding, the vocal quality in dysphonia is not compensated by vocal effort as well as normal subjects because of the inefficiency caused by abnormal vocal fold appearance and function. And with this result, we can counsel that the voice quality can not be improved as well as the patient expect.

  • PDF

Sample selection approach using moving window for acoustic analysis of pathological sustained vowels according to signal typing

  • 이지연
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.99-108
    • /
    • 2011
  • The perturbation parameters like jitter, shimmer, and signal-to-noise ratio (SNR) are largely estimated in the particular segment from the subjective or whole portion of the given pathological voice signal although there are many possible regions to be able to analyze the voice signals. In this paper, the pathological voice signals were classified as type 1, 2, 3, or 4 according to narrow band spectrogram and the value differences of the perturbation parameters extracted in the subjective and entire portion tended to be getting bigger as from type 1 to type 4 signals. Therefore, sample selection method based on moving window to analyze type 2 and 3 signals as well as type 1 signals is proposed. Although type 3 signals cannot be analyzed using the perturbation analysis, the type 3 signals by selecting out the samples in which error count is less than 10 through moving window were analyzed. At present, there is no method to be able to analyze the type 4 signals. Future research will endeavor to determine the best way to evaluate such voices.

  • PDF

음성을 이용한 사상체질 분류 알고리즘 (Automated Speech Analysis Applied to Sasang Constitution Classification)

  • 강재환;유종향;이혜정;김종열
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.155-163
    • /
    • 2009
  • This paper introduces an automatic voice classification system for the diagnosis of individual constitution based on Sasang Constitutional Medicine (SCM) in Traditional Korean Medicine (TKM). For the developing of this algorithm, we used the voices of 473 speakers and extracted a total of 144 speech features from the speech data consisting of five sustained vowels and one sentence. The classification system, based on a rule-based algorithm that is derived from a non parametric statistical method, presents binary negative decisions. In conclusion, 55.7% of the speech data were diagnosed by this system, of which 72.8% were correct negative decisions.

  • PDF

성인기 이후 연령과 성에 따른 음향음성학적 특성 (Age and Sex Differences in Acoustic Parameters of Adult Voice.)

  • 이효진;김수진
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.141-144
    • /
    • 2005
  • The purpose of this study is to identify the acoustic changes according to age and to provide the evaluation criteria of elderly voice. The number of 120 Korean adults (three age groups * two sex groups) proceeded sustained three vowels, read apart of 'Taking a walk' and explained a picture. The data was analyzed acoustically with MDVP of CSL. The results showed that: 1)there was statistically most significant changes in sex and age in F0 than the others but no significant in Shimmer. 2)acoustic parameters were changed from young adulthood to old age. Different patterns of change with aging were observed in men and women.

  • PDF

한국어 모음에서 연령증가에 따른 제2음형대의 변화양상 ($F_2$ Formant Frequency Characteristics of the Aging Male and Female Speakers)

  • 김찬우;차흥억;장일환;김선태;오승철;석윤식;이영숙
    • 대한후두음성언어의학회지
    • /
    • 제10권2호
    • /
    • pp.119-123
    • /
    • 1999
  • Background and Objectives : Conditions such as muscle atrophy, stretching of strap muscles, and continued craniofacial growth factors have been cited as contributing to the changes observed in the vocal tract structure and function in elderly speakers. The purpose of the present study is to compare F$_1$ and F$_2$ frequency levels in elderly and young adult male and female speakers producing a series of vowels ranging from high-front to low-back placement. Material and Methods : The subjects were two groups of young adults(10 males, 10 females, mean age 21 years old range 19-24 years) and two groups of elderly speakers(10 males, 10 females, mean age 67 years : range 60-84 years). Each subject participated in speech pathologist to be a speaker of unimpared standard Korean. The headphone was positioned 2 cm from the speakers lips. Each speaker sustained the five vowels for 5 s. Formant frequency measures were obtained from an analysis of linear predictive coding in CSL model 4300B(Kay co). Results : Repeated measure AVOVA procedures were completed on the $F_1$ and $F_2$ data for the male and female speakers. $F_2$ formant frequency levels were proven to be significantly lower fir elderly speakers. Conclusions : We presume $F_2$ vocal cavity(from the point of tongue constriction to lip) lengthening in elderly speakers. The research designed to observe dynamic speech production more directly will be needed.

  • PDF

음성장애 연속구어의 음향학적 분석 (A Study of Acoustic Measurement in Connected Speech with Dysphonia)

  • 이명순
    • 말소리와 음성과학
    • /
    • 제3권4호
    • /
    • pp.109-115
    • /
    • 2011
  • The purposes of this study were to identify acoustic parameters of connected speech and to contribute to acoustic analysis of dysphonic voice about patient's natural speech voice as well as sustained phonation of vowels. Acoustic parameters of sentences included LTAS (long-term average spectrum) mean and spectral slope over frequence ranges such as 0-4kHz, 0-6kHz, 0-8kHz, 0-12.5kHz as well as HNR. Acoustic parameters of the vowel 'a' included jitter, RAP, shimmer, NHR, and HNR. Based on 'G' of GRBAS for the severity of dysphonia, two experienced raters judged and classified as four groups including controls, mild, moderate and severe dysphonic group. Connected speech was two sentences extracted from 'stroll' passage. Parameters of the vowel and LTAS mean of the sentences were measured by CSL. The spectral slope of the sentences and HNR of the vowel and the sentences were measured by Praat. Data were statistically analyzed by Spearman correlation and Kruskal-Wallis test using SPSS 12.0. The results of this study are as follows: First, jitter, RAP, shimmer and NHR were significantly different between the groups. Second, for several frequencies, LTAS mean and spectral slope of the sentences were significantly different between the groups. Third, the HNR of the sentences were significantly different between the groups. Forth, there was a presence of correlation between HNR and NHR of the vowel and HNR of the sentences. Accordingly, this study concluded that LTAS, spectral slope, and HNR were predictive parameters of connected speech voice for dysphonic voice.

  • PDF

양성후두 질환의 지속모음을 대상으로 한 기존 피치 추정 방법들의 성능 비교 분석 (Comparative Analysis of Performance of Established Pitch Estimation Methods in Sustained Vowel of Benign Vocal Fold Lesions)

  • 장승진;김효민;최성희;박영철;최홍식;윤영로
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.179-200
    • /
    • 2007
  • In voice pathology, various measurements calculated from pitch values are proposed to show voice quality. However, those measurements frequently seem to be inaccurate and unreliable because they are based on some wrong pitch values determined from pathological voice data. In order to solve the problem, we compared several pitch estimation methods to propose a better one in pathological voices. From the database of 99 pathological voice and 30 normal voice data, errors derived from pitch estimation were analyzed and compared between pathological and normal voice data or among the vowels produced by patients with benign vocal fold lesions. Results showed that gross pitch errors were observed in the cases of pathological voice data. From the types of pathological voices classified by the degree of aperiodicity in the speech signals, we found that pitch errors were closely related to the number of aperiodic segments. Also, the autocorrelation approach was found to be the most robust pitch estimation in the pathological voice data. It is desirable to conduct further research on the more severely pathological voice data in order to reduce pitch estimation errors.

  • PDF