• 제목/요약/키워드: voice quality assessment

검색결과 39건 처리시간 0.024초

음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성 (The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing)

  • 최성희;최철희
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.63-72
    • /
    • 2014
  • The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.

규칙 합성음의 이해성 평가를 위한 단어표 구성 및 실험법 (A Word List Construction and Measurement Method for Intelligibility Assessment of Synthesized Speech by Rule)

  • 김성한;홍진우;김순협
    • 전자공학회논문지B
    • /
    • 제29B권1호
    • /
    • pp.43-49
    • /
    • 1992
  • As a result of recent progress in speech synthesis techniques, the those new services using new techniques are going to introduce into the telephone communication system. In setting standards, voice quality is obviously an important criterion. It is very important to develope a quality evaluation method of synthesized speech for the diagnostic assessment of system algorithm, and fair comparison of assessment values. This paper has described several basic concepts and criterions for quality assessment (intelligibility) of synthesized speech by rule, and then a word selection method and the word list to be used in word intelligibility test were proposed. Finally, a test method for word intelligibility is described.

  • PDF

Change of Voice Parameters After Thyroidectomy Without Apparent Injury to the Recurrent Laryngeal or External Branch of Superior Laryngeal Nerve: A Prospective Cohort Study

  • Lee, Doh Young;Choe, Goun;Park, Hanaro;Han, Sungjun;Park, Sung Joon;Kim, Seong Dong;Kim, Bo Hae;Jin, Young Ju;Lee, Kyu Eun;Park, Young Joo;Kwon, Tack-Kyun
    • 대한후두음성언어의학회지
    • /
    • 제33권2호
    • /
    • pp.89-96
    • /
    • 2022
  • Background and Objectives The quality of life after thyroidectomy, such as voice change, is considered to be as important as control of the disease. In this study, we aimed to evaluate changes in both subjective and objective voice parameters after thyroidectomy resulting in normal morbidity of the vocal cords. Materials and Method In this prospective cohort study, 204 patients who underwent thyroidectomy with or without central neck dissection at a single referral center from Feb 2015 to Aug 2016 were enrolled. All patients underwent prospective voice evaluations including both subjective and objective assessments preoperatively and then at 2 weeks, 3, 6, and 12 months postoperatively. Temporal changes of the voice parameters were analyzed. Results Values of the subjective assessment tool worsened during the early postoperative follow-up period and did not recover to the preoperative values at 12 months postoperatively. The maximal phonation time gradually decreased, whereas most objective parameters, including maximal vocal pitch (MVP), reached preoperative values at 3-6 months postoperatively. The initial decrease in MVP was significantly greater in patients undergoing total thyroidectomy, and their MVP recovery time was faster than that of patients undergoing lobectomy (p=0.001). Patients whose external branch of the superior laryngeal nerve was confirmed intact by electroidentification showed no difference in recovery speed compared with patients without electroindentification (p=0.102), although the initial decrease in MVP was lower with electroidentification. Conclusion Subjective assessment in voice quality and maximal phonation time after thyroidectomy did not show recovery to preoperative values. Aggravation of MVP was associated with surgical extent and electroidentification.

후두 미세 수술 후 양성 성대 병변 환자의 예후 (Prognosis of Patients with Benign Vocal Fold Lesions after Laryngeal Microsurgery)

  • 최병길;김병준;최효근;박범정
    • 대한후두음성언어의학회지
    • /
    • 제29권1호
    • /
    • pp.37-40
    • /
    • 2018
  • Background and Objectives : This study aimed to evaluate patients' subjective and objective outcomes after laryngeal microsurgery for benign vocal fold (VF) lesions, and to identify usefulness of surgical treatment. Materials and Methods : The authors reviewed the 102 patients medical records, retrospectively who received laryngeal microsurgery for benign VF lesions from January 2013 to August 2017. Subjective voice were measured using the Voice Handicap Index (VHI). Objective voice were recorded with Multi-Dimensional Voice Program (MDVP) just before surgery, and after at least 3 months of surgery. Results : Benign VF lesions were categorized as VF nodule (n=34, 33%), VF Polyp (n=47, 26%), Intracordal cyst (n=15, 15%), Reinke's edema (n=6, 6%), and VF Papilloma (n=2, 2%). Post-operative voice assessment at VHI scores showed statistically significant reductions in all of functional, physical and emotional parts (p<0.001). MDVP were showed significant improvement of Jitter (P=0.001), Shimmer (p<0.001) and Noise to Harmonic Ratio (NHR) (p=0.001). Conclusion : Laryngeal microsurgery for benign vocal fold lesions is effective treatment with statistically significant improvement at subjective and objective vocal quality assessment.

공기 역학 검사 (Aerodynamic Analysis of Phonation)

  • 권택균;임윤성
    • 대한후두음성언어의학회지
    • /
    • 제19권2호
    • /
    • pp.85-88
    • /
    • 2008
  • Several parameters are used for the assessment of phonatory function and voice quality in clinical settings. Glottic airflow, subglottal pressure, mean phonation time, laryngeal resistance and voice efficiency are the most commonly used aerodynamic parameters. Aerodynamic analysis is developed to evaluate phonatory energy source and to estimate laryngeal efficiency. Also these measurements have shown the good correlation with perceptions of breathiness and findings of glottic competence. Aerodynamic study is important to understand relationships between pulmonary and phonatory function.

  • PDF

병적음성에 대한 지속 모음 및 이음절어 발화시 나타나는 음향학적 차이에 대한 연구 (A Study of Acoustic Characteristics of Two Syllables Words and Sustained Vowel)

  • 채윤정;김범규;홍기환
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.104-112
    • /
    • 2000
  • An evaluation of voice disorder has two methods. One is a perceptual analysis and the other is an acoustic analysis. All of these methods are just focused on sustained vowel. The analysis of conversational speech levels in voice disorder has not been achieved enough. The purpose of the present study is to compare two syllable words and sustained vowel in the vocal polyp patients and normal male speakers and to be applied on the vocal assessment and the voice therapy as a basic data. fifteen male patients with vocal polyp were the subject group. Fifteen healthy male were the control group for this study. The voices of the subject and control group, saved in MDVP of CSL were analyzed by its own analysis program. As a results, in subject group, the voice qualities between the vowel following lenis stop and the sustained vowel had no differences, and the voice qualities were different significantly between the vowel following heavily aspirated stop and the sustained vowel. In the control group the vowel fllowing stops and sustained vowel had also many differences in their voice quality, especially significant between the vowel following glottal stop and e sustained vowel.

  • PDF

성대 용종 환자의 후두미세수술 전후 음성 평가에서 OperaVOXTM와 Multi-Dimensional Voice Program 간의 신뢰도 연구 (Reliability of OperaVOXTM against Multi-Dimensional Voice Program to Assess Voice Quality before and after Laryngeal Microsurgery in Patient with Vocal Polyp)

  • 김선우;김소연;조재경;진성민;이상혁
    • 대한후두음성언어의학회지
    • /
    • 제31권2호
    • /
    • pp.71-77
    • /
    • 2020
  • Background and Objectives OperaVOXTM (Oxford Wave Research Ltd.) is a portable voice analysis software package designed for use with iOS devices. As a relatively cheap, portable and easily accessible form of acoustic analysis, OperaVOXTM may be more clinically useful than laboratory-based software in many situations. The aim of this study was to evaluate the agreement between OperaVOXTM and Multi-Dimensional Voice Program (MDVP; Computerized Speech Lab) to assess voice quality before and after laryngeal microsurgery in patient with vocal polyp. Materials and Method Twenty patients who had undergone laryngeal microsurgery for vocal polyp were enrolled in this study. Preoperative and postoperative voices were assessed by acoustic analysis using MDVP and OperaVOXTM. A five-seconds recording of vowel /a/ was used to measure fundamental frequency (F0), jitter, shimmer and noise-to-harmonic ratio (NHR). Results Several acoustic parameters of MDVP and OperaVOXTM related to short-term variability showed significant improvement. While pre-operative value of F0, jitter, shimmer, NHR was 155.75 Hz (male: 125.37 Hz, female: 183.37 Hz), 2.20%, 6.28%, 0.16, post-operative values of these parameter was 164.34 Hz (male: 129.42 Hz, female: 199.26 Hz), 2.15%, 5.18%, 0.14 Hz in MDVP. While pre-operative value of F0, jitter, shimmer, NHR was 168.26 Hz (male: 135.16 Hz, female: 201.37 Hz), 2.27%, 6.95%, 0.26, post-operative values of these parameters was 162.72 Hz (male: 128.267 Hz, female: 197.18 Hz), 1.71%, 5.36%, 0.20 in OperaVOXTM. There was high intersoftware agreement for F0, jitter, shimmer with intraclass correlation coefficient. Conclusion Our results showed that the short-term variability of acoustic parameters in both MDVP and OperaVOXTM were useful for the objective assessment of voice quality in patients who received laryngeal microsurgery. OperaVOXTM is comparable to MDVP and has high intersoftware reliability with MDVP in measuring the F0, jitter, and shimmer

성대돌기 육아종의 음성치료 효과 (The Effects of Voice Therapy in Vocal Process Granuloma)

  • 김성태;최승호;남순열
    • 말소리와 음성과학
    • /
    • 제2권4호
    • /
    • pp.165-171
    • /
    • 2010
  • Vocal process granuloma can occur commonly by laryngopharyngeal reflux (LPR), vocal abuse or misuse. It has been reported that voice therapy is employed with medication therapy for the patients who has vocal process granuloma, however research about effect of voice therapy can be hardly founded. For that matter, the primary aim of this study was to evaluate the effect of therapeutic method we implement. Thirty one patients who has been diagnosed with vocal process granuloma from January, 2007 to June, 2009 participated in this study. 19 patients among them are provided voice therapy and medication, 12 patients take only medication. Voice therapy is implemented ranging from 5 to 19 sessions (mean: 8.6 sessions). We provided explanation about problem each patient has, voice rest, SKMVTT$^{(R)}$, abdominal breathing, and relaxation in session. All subjects were examined by videostroboscopy, perceptual assessment, acoustic and aerodynamic measures. Consequantly, the greater part of the patients (78.9%) who is treated by voice therapy and medication are confirmed disappearance or decrease of granuloma, it shows better results compared with the group provided only medication (66.7%). Especially, the period of drug administration is 3.7 months in the group runs parallel with voice therapy, the period of other group is 7.8 months. The results of acoustic and aerodynamic measures after treating indicates there are significant decrease in Jitter, Shimmer, and NHR, and increase in MPT, Psub (p<.05). However, there is no large difference statistically even though voice quality has improved since the therapy. In conclusion, it is verified that the voice therapy to the vocal process granuloma patients taking medication is effectual method, we recommend combining voice therapy with medication when treatment is needed for the vocal process granuloma patients.

  • PDF

모음 유형과 표준문단의 문장 위치가 음성장애 환자의 청지각적 및 켑스트럼 및 스펙트럼 분석에 미치는 효과 (Effects of vowel types and sentence positions in standard passage on auditory and cepstral and spectral measures in patients with voice disorders)

  • 최미현;최성희
    • 말소리와 음성과학
    • /
    • 제15권4호
    • /
    • pp.81-90
    • /
    • 2023
  • 청지각적 평가 및 음향학적 분석은 음성평가를 위해 임상 현장에서 일반적으로 사용해오고 있다. 본 연구는 음성장애 환자의 청지각적 및 음향학적 측정 시 말 과제 효과를 조사하고자 한다. 음성장애로 진단받은 총 22명의 환자로부터 모음연장발성(/a/, /e/, /i/, /o/, /u/, /ɯ/, /ʌ/)과 연속구어('가을'표준문단의 9개 하위문장)를 녹음하였다. 음성장애 평가 및 치료 경험이 있는 2명의 음성언어치료사가 맹검 및 무작위 음성 샘플을 사용하여 GRBAS('G', 'R', 'B', 'A', 'S')척도 및 CAPE-V('OS', 'R', 'B', 'S', 'P', 'L')를 사용하여 청지각적 평가를 실시하였다. 또한, ADSV(analysis of dysphonia in speech and voice model)를 이용하여 켑스트럼 및 스펙트럼 측정치를 구하였다. 모음 유형에 따라 GRBAS 척도에서 'B'를 제외하고 청지각적 평가에 영향을 미치지 않았으나, CAPE-V에서는 'OS', 'R', 'B'에 영향을 미쳤다(p<.05). CPP 및 L/H ratio 는 모음 유형과 문장 위치의 영향을 받았다. 표준문단의 CPP값은 모든 모음에서 'G', 하위 9문장과 유의미한 부적 상관 관계가 나타났고, 특히, /e/모음(r=-.739)에서 가장 높은 상관관계를 보였다. 두 번째 문장의 CPP는 모든 모음과 높은 상관관계를 보였다. CAPE-V는 말 자극에 따라 GRBAS보다 청지각적 평가에 더 많은 영향을 받을 수 있으며, 'B' 척도, CPP, L/H ratio는 모음 유형과 자음을 포함한 문장 위치에 따라 영향을 받았다. 따라서, 음성 장애 환자의 음성 평가에서 모음을 사용할 때는 /a/뿐만 아니라 ' 기식성'음질과 음향적으로 상관성이 높은 /i/모음을 함께 사용하는 것이 유용할 수 있다. 또한 /e/모음은 한국 표준문단 '가을' 및 하위 문장들과 음향적으로 상관성이 높았으므로 문단 대신 사용할 수 있을 것이다. 또한, 음성장애 신호들이 대부분 비주기적이라는 점을 감안할 때, CPP와 함께 표준문단 중 가장 음향적으로 상관성이 높은 두 번째 문장을 사용할 수 있을 것이다. 이러한 결과는 말과제가 청지각적 평가 및 음향학적 측정에 미치는 영향에 대한 임상적 증거를 제공하며, 이는 음성장애 환자의 음성 평가에 대한 가이드라인을 제공하는 데 도움을 줄 수 있을 것이다.

양성후두 질환 음성에 대한 여러 기존 피치검출 알고리즘의 성능 평가 (Performance Assessment of Several Established Pitch Detection Algorithms in Voices of Benign Vocal Fold Lesions)

  • 장승진;최성희;김효민;최홍식;윤영로
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2007년도 하계종합학술대회 논문집
    • /
    • pp.407-408
    • /
    • 2007
  • Robust pitch estimation is an important study in many areas of speech processing. In voice pathology, diverse statistics extracted form pitch were commonly used to test voice quality. In this study, we compared several established pitch detection algorithms (PDAs) for verification of adequacy of the PDAs. In the database of total pathological voices of 99 and normal voices of 30, an analysis of errors related with pitch detection was evaluated between pathological and normal voices, or among the types of pathological voices such as benign vocal fold lesions; polyp, nodule, and cysts. Consequently, it is required to survey the severity of tested voice in order to obtain accurate pitch estimates.

  • PDF