• 제목/요약/키워드: vowel quality

검색결과 81건 처리시간 0.023초

노화에 따른 음질과 구어 유창성의 음향학적 특성 변화 (Change in acoustic characteristics of voice quality and speech fluency with aging)

  • 박희준;박진
    • 말소리와 음성과학
    • /
    • 제15권4호
    • /
    • pp.45-51
    • /
    • 2023
  • 나이가 들면서 발생하는 음성 문제는 사회적, 정서적으로 영향을 미칠 수 있으며, 나아가 고립감과 우울증으로 이어질 수 있다. 이에 본 연구에서는 노화로 인한 음향학적 특성 변화를 음질과 구어 유창성의 변화를 알아보고자 한다. 이를 위해 노년층 남성 20명과 청년층 남성 20명이 산출한 연장발성과 구절 읽기 과제를 녹음하여 분석하였다. 음질 분석 변수로 기본주파수(F0), 주기 변동률(jitter), 진폭 변동률(shimmer), 켑스트럼 정점(cepstral peak prominence, CPP) 값을 분석하였으며 구어 유창성 분석 변수로는 평균 음절 길이(average syllable duration, ASD), 조음 속도(articulation rate, AR), 구어 속도(SR)를 분석하였다. 연구결과, 음질 측정에서 노년층의 경우 F0가 높게 나타났으며 jitter, shimmer, CPP의 결과값을 통해 음질이 저하된 것으로 나타났다. 구어 유창성 분석 결과, 노년층은 ASD, AR, SR의 결과값을 통해 느리게 발화하는 것으로 나타났다. 음질과 구어유창성 간 상관관계 분석 결과, shimmer와 CPP 값과 각각 ASD와 SR에서 높은 상관관계가 나타났다. 본 연구결과를 통해 노화에 따른 음성과 구어 유창성 변화를 조기에 발견하고 이에 대한 적절한 훈련법을 제공할 수 있을 것으로 기대된다.

양성후두 질환의 지속모음을 대상으로 한 기존 피치 추정 방법들의 성능 비교 분석 (Comparative Analysis of Performance of Established Pitch Estimation Methods in Sustained Vowel of Benign Vocal Fold Lesions)

  • 장승진;김효민;최성희;박영철;최홍식;윤영로
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.179-200
    • /
    • 2007
  • In voice pathology, various measurements calculated from pitch values are proposed to show voice quality. However, those measurements frequently seem to be inaccurate and unreliable because they are based on some wrong pitch values determined from pathological voice data. In order to solve the problem, we compared several pitch estimation methods to propose a better one in pathological voices. From the database of 99 pathological voice and 30 normal voice data, errors derived from pitch estimation were analyzed and compared between pathological and normal voice data or among the vowels produced by patients with benign vocal fold lesions. Results showed that gross pitch errors were observed in the cases of pathological voice data. From the types of pathological voices classified by the degree of aperiodicity in the speech signals, we found that pitch errors were closely related to the number of aperiodic segments. Also, the autocorrelation approach was found to be the most robust pitch estimation in the pathological voice data. It is desirable to conduct further research on the more severely pathological voice data in order to reduce pitch estimation errors.

  • PDF

The Effects of Pitch Increasing Training (PIT) on Voice and Speech of a Patient with Parkinson's Disease: A Pilot Study

  • Lee, Ok-Bun;Jeong, Ok-Ran;Shim, Hong-Im;Jeong, Han-Jin
    • 음성과학
    • /
    • 제13권1호
    • /
    • pp.95-105
    • /
    • 2006
  • The primary goal of therapeutic intervention in dysarthric speakers is to increase the speech intelligibility. Decision of critical features to increase the intelligibility is very important in speech therapy. The purpose of this study is to know the effects of pitch increasing training (PIT) on speech of a subject with Parkinson's disease (PD). The PIT program is focused on increasing pitch while a vowel is sustained with the same loudness. The loudness level is somewhat higher than that of the habitual loudness. A 67-year-old female with PD participated in the study. Speech therapy was conducted for 4 sessions (200 minutes) for one week. Before and after the treatment, acoustic, perceptual and speech naturalness evaluation was peformed for data analysis. Speech and voice satisfaction index (SVSI) was obtained after the treatment. Results showed Improvements in voice quality and speech naturalness. In addition, the patient's satisfaction ratings (SVSI) indicated a positive relationship between improved speech production and their (the patient and care-givers) satisfaction.

  • PDF

경직형 마비말장애의 음성언어의학적 특성 (Characteristics of Phoniatrics in Patients with Spastic Dysarthria)

  • 김숙희;김현기
    • 음성과학
    • /
    • 제15권4호
    • /
    • pp.159-170
    • /
    • 2008
  • The purpose of this study was to find out the ability of coordination of the articulatory motor and the ability of control of the respiration and laryngeal for spastic dysarthria by acoustic analysis. The sustained of vowel /a/ and repetition of syllable /pa/ in 15 normal and 10 spastic dysarthria were measured. Multi-Speech, MDVP, and MSP were used for data recording and analysis. As a result, the mean DDK rate in the spastic group was significantly slower than in the normal. The maximum phonation time in the spastic group ($4.80{\pm}1.94$) was shorter than in the normal ($11.20{\pm}3.72$). The DDKjit in the spastic group was significantly higher than in the normal. The DDKsla was reduced in the spastic group. The mean syllable duration in the spastic group (146.2ms) was significantly longer than in the normal (75.8ms). The mean energy was reduced in the spastic group. The range of Fo was greater than in the normal. The frequency perturbation (jitter, vFo) and amplitude perturbation (shimmer, vAm) were higher than in the normal group. The NHR was higher than in the normal group. The parameters of this were significantly difference between the spastic dysarthria and the normal (p<0.05). Finally, the spastic dysarthria has short respiration, slow speech rate, and voice quality problem. The these results will help to establish a plan and the intervention of treatment.

  • PDF

목사들의 음성발성에 대한 음향분석학적 특징 (Acoustic and Stroboscopic Characteristics in Clergies)

  • 진성민;박상욱;강현국;이경철;이용배;김보형
    • 대한후두음성언어의학회지
    • /
    • 제9권1호
    • /
    • pp.47-52
    • /
    • 1998
  • Objectives : To compare the objective differences in voice quality and voice problems between clergies and normal male control group. Materials and Methods : The sustained vowel sound of 46 clergies and 40 normal persons were analyzed, using a videostroboscopy and acoustic analyzer. Together with these analyses, a questionnaire associated with current and past voice problems was handed over to the patients. Results : The most common symptom in subjective group was the voice fatigue. Stroboscopic findings in subjective group were as following 23 cases(50%) of pachydermia, 17 cases(37%) of phase difference, 12 cases(25%) of anterior-posterior contracture, 6 cases(13%) of vocal polyp and 3 cases(7%) of vocal nodule. The mean maximal phonation time in clergies was 17.8 seconds and in control group was 19 seconds. litter, pitch perturbation quotient and shimmer were significantly increased in subjective group than in control group(p<0.05), but there were no significant differences between two groups in fundamental frequency, vFo, amplitude perturbation quotient and noise to harmonic ratio. Conclusion : In the clergies using loud and forceful voice, vocal polyp and functional voice disorder findings were frequently noted in stroboscopic examination. litter and shimmer, reflecting the roughness of voice, were increased in acoustic analysis. Therefore, clergies, classified into untrained professional voice users, need professional career guidance and counseling.

  • PDF

교사, 목사 및 교환수들의 음성발성에 대한 음향분석학적 특징 (Acoustic and Stroboscopic Characteristics in Teachers, Clergies and Telephone Operators)

  • 진성민;박상욱;이정우;이경철;이용배
    • 대한후두음성언어의학회지
    • /
    • 제9권1호
    • /
    • pp.53-58
    • /
    • 1998
  • Objectives : To compare the voice quality and voice problems of untrained professional voice user groups with that of normal control group without voice problem. Materials and Methods : The sustained vowel sounds of 13 male and 36 female teachers, 46 clergies and 15 telephone operators, and 40 normal male and 20 normal female persons were analyzed, using a videostroboscopy and acoustic analyzer. Together with these analyses, a questionnaire associated with risk factors for current and past voice problems was handed over to the patients. Results : The most common symptom in subjective groups was the voice fatigue. In stroboscopic examination, the professional voice user groups shelved functional voice disorder findings regardless of the Intensity of voice use. In the clergy and teacher using loud voice, vocal polyp, vocal nodule and hyperfunction of laryngeal muscle were frequently observed. In the clergy and telephone operator, jitter and shimmer were significantly increased. In the female teacher, the value of jitter, fundamental frequency variation and fundamental frequency were statiscally significant. However, the voice of male teacher showed no significant findings in the acoustic and aerodynamic studies. Conclusion : In the management of voice problems for untrained professional voice user groups, it is important to find the exact causes and patterns of voice problems, and to be individualized the management according to the causes.

  • PDF

음성 및 음향분석 프로그램 Praat의 임상적 활용법 (Guidance to the Praat, a Software for Speech and Acoustic Analysis)

  • 성철재
    • 대한후두음성언어의학회지
    • /
    • 제33권2호
    • /
    • pp.64-76
    • /
    • 2022
  • Praat is a useful analysis tool for linguists, engineers, doctors, speech-language pathologits, music majors, and natural scientists. Basic parameters including duration, pitch, energy and perturbation parameters such as jitter and shimmer can be easily measured and manipulated in the sound editor. When a more in-depth analysis is needed, it is recommended to understand the advanced menus of the object window and learn how to use them. Among the object window menus, vowel formant analysis, spectrum analysis, and cepstrum analysis can be cited as useful ones in the clinical field. The spectrum object can be usefully used for voice quality measurement and diagnosis of patients with voice disorders by showing the energy distribution according to frequency axis (domain). A cepstrum object is useful for speech analysis when periodicity of the sound object is not measurable. The low to high ratio obtained from the spectral object and the CPPs measured from the cepstrum object have attracted many researchers, and it has been proven that the CPPs measured in Praat are relatively excellent.

기능성 음성장애의 진단을 위한 음향학적, 청지각적 평가 (Acoustic Analysis and Auditory-Perceptual Assessment for Diagnosis of Functional Dysphonia)

  • 김근효;이연우;배인호;이재석;이창윤;박희준;이병주;권순복
    • 임상이비인후과
    • /
    • 제29권2호
    • /
    • pp.212-222
    • /
    • 2018
  • Background and Objectives : The purpose of this study was to compare the measured values of acoustic and auditory perceptual assessments between normal and functional dysphonia (FD) groups. Materials and Methods : 102 subjects with FD and 59 normal voice groups were participated in this study. Mid-vowel portion of the sustained vowel /a/ and two sentences of 'Sanchaek' were edited, concatenated, and analyzed by Praat script. And then auditory-perceptual (AP) rating was completed by three listeners. Results : The FD group showed higher acoustic voice quality index version 2.02 and version 3.01 (AVQIv2 and AVQIv3), slope, Hammarberg index (HAM), grade (G) and overall severity (OS), values than normal group. Additionally, smoothed cepstral peak prominence in Praat (PraatCPPS), tilt, low-to high spectral band energies (L/H ratio), long-term average spectrum (LTAS) in FD group were lower than normal voice group. And the correlation among measured values ranged from -0.250 to 0.960. In ROC curve analysis, cutoff values of AVQIv2, AVQIv3, PraatCPPS, slope, tilt, L/H ratio, HAM, and LTAS were 3.270, 2.013, 13.838, -22.286, -9.754, 369.043, 27.912, and 34.523, respectively, and the AUC of each analysis was over .890 in AVQIv2, AVQIv3, and PraatCPPS, over 0.731 in HAM, tilt, and slope, over 0.605 in LTAS and L/H ratio. Conclusions : In conclusion, AVQI and CPPS showed the highest predictive power for distinguishing between normal and FD groups. Acoustic analyses and AP rating as noninvasive examination can reinforce the screening capability of FD and help to establish efficient diagnosis and treatment process plan for FD.

켑스트럼 변수와 랜덤포레스트 알고리듬을 이용한 MTD(근긴장성 발성장애) 여성화자 음성과 정상음성 분류 (Classification of muscle tension dysphonia (MTD) female speech and normal speech using cepstrum variables and random forest algorithm)

  • 윤주원;심희정;성철재
    • 말소리와 음성과학
    • /
    • 제12권4호
    • /
    • pp.91-98
    • /
    • 2020
  • 근긴장성 발성장애(cepstral peak prominence, MTD) 환자의 모음 발성과 문장읽기 과제를 켑스트럼 기반 변수를 이용하여 분석하였으며 음성장애 환자의 GRBAS청지각적 특성과 음향학적 특성의 상관관계를 살펴보고, 랜덤포레스트 머신러닝 분류 알고리듬을 이용한 MTD 감별 진단 가능성을 논의하였다. 내원 시 MTD로 진단받은 여성 36명과 정상음성을 사용하는 여성 36명이 연구에 참여했으며, 수집한 음성샘플은 ADSVTM를 사용하여 분석하였다. 연구 결과, 음향학적 측정치 중 MTD의 CSID(cepstral spectral index of dysphonia)는 대조군보다 높았으며, CPP(cepstral peak prominence), CPP_Fo 값이 대조군보다 유의하게 낮았다. 이는 모음 발성과 읽기 과제에서 모두 동일하게 나타났다. MTD 환자의 음질 특성은 전반적인 음성중증도(G)가 가장 두드러졌으며, 조조성(R), 기식성(B), 노력성(S)순으로 음성 특성을 보였다. 이 특성이 높아질수록 CPP가 감소하는 부적 상관을 보이고, CSID는 증가하는 정적 상관이 관찰되었다. 켑스트럴 변수 중 모음과 문장읽기과제 모두에서 집단간 유의한 차이를 보여준 CPP와 CPP_F0를 이용하여 MTD와 대조군의 음성분류를 시도하였다. 머신러닝 알고리듬인 랜덤포레스트로 모델링한 결과 문장읽기 과제에서 모음연장발성보다 조금 더 높은 분류 정확도(83.3%)가 나왔으며, 모음 발성과 문장 읽기 과제 모두에서 CPP변수가 더 중심적 역할을 수행하였음을 알 수 있었다.

삼킴장애 환자의 wet voice 관련 음향학적 분석 (Acoustic analysis of wet voice among patients with swallowing disorders)

  • 강영애;구본석;권인선;성철재
    • 말소리와 음성과학
    • /
    • 제10권4호
    • /
    • pp.147-154
    • /
    • 2018
  • Wet voice quality (WVQ) is a characteristic that appears after swallowing. Although the concept is accepted by many clinicians worldwide, it is nevertheless ambiguous. In this study, we investigated WVQ in patients with swallowing disorders using acoustic analysis. A total of 106 patients diagnosed with penetration-aspiration by the videofluoroscopic swallowing study (VFSS) were recruited. A voice recording of vowel /a/ was conducted before and after the VFSS, and an acoustic analysis was then performed using PRAAT. Voice after VFSS was used for a perceptual judgment and divided into two groups: the Wet group (48 patients) and the Non-wet group (58 patients). At the post-VFSS stage, the two groups displayed significant differences in many acoustic parameters including F0_SD, Jitter, RAP, Shimmer, APQ, HNR, NHR, FUF, DVB, and CPP. The parameter affecting judging wetness resulted into Jitter and NHR by the logistic regression test. At the pre-VFSS stage, the two groups differed significantly in many acoustic parameters including Intensity, Jitter, RAP, Shimmer, NHR, FUF, DVB, and CPP. Both pre-and post-VFSS, the mean values of all significant parameters, except Intensity, HNR, and CPP, were higher in the Wet group. According to pre-and post-VFSS, the two groups displayed interactions in many parameters (Intensity, F0_SD, Jitter, RAP, Shimmer, APQ, HNR, NHR, FUF, DVB, and CPP). In particular, Intensity increased in both groups after the VFSS, although the increase in the Non-wet group was greater. Based on these results, it was conjectured that the WVQ after swallowing resulted from the secretion effect of the mucous membrane due to the dry laryngeal characteristic of elderly patients, rather than aspiration resulting in food on the vocal cords.