• 제목/요약/키워드: sustained vowel phonation

검색결과 32건 처리시간 0.024초

모음연장 음성 샘플의 분석 구간에 따른 음향학적 파라미터 비교 (Comparison of Acoustic Parameters According to the Section of Analysis in Sustained Vowel Phonation)

  • 신유정
    • 한국산학기술학회논문지
    • /
    • 제18권7호
    • /
    • pp.269-274
    • /
    • 2017
  • 본 논문은 임상에서 음성장애 환자의 객관적 음성 분석 대상으로 주로 쓰이는 모음연장 발성이 분석하는 구간에 따라 어떠한 음향학적 차이를 보이는지 밝히고자 하였다. 본 연구에서는 성대결절 환자 17명의 /아/ 모음연장 발성을 시작, 중간, 끝 구간으로 편집하여 MDVP를 통해 각 구간의 jitter, shimmer, NHR을 산출하였고, 비교를 위하여 정상 음성 집단 12명의 음성도 분석하였다. 산출 결과는 R 통계프로그램을 활용하여 Fridman test와 사후 검정을 실시하였다. 음성장애 환자집단은 모음연장 발성의 끝 구간이 중간 구간에 비해 jitter, shimmer, NHR 값이 모두 유의하게 높은 것으로 나타났다. 또한, 발성의 시작 구간은 중간 구간에 비해 세 파라미터 모두에서 높게 산출됐지만 유의한 차이는 없었다. 반면, 정상 집단은 발성의 시작, 중간, 끝 모든 구간에서 유의한 차이가 없었다. 모음연장 발성은 구간에 따라 음향학적 파라미터의 분석 결과가 다르고 발성 끝 구간에서 중간 구간보다 유의하게 음성이 불안정해지는 것으로 나타났다. 이러한 결론은 임상 현장에서 모음연장 발성의 분석 구간 선택과 결과 해석에 유용하게 활용될 수 있을 것이다.

발성장애 평가 시 /a/ 모음연장발성 및 문장검사의 켑스트럼 분석 비교 (Comparison of Vowel and Text-Based Cepstral Analysis in Dysphonia Evaluation)

  • 김태환;최정임;이상혁;진성민
    • 대한후두음성언어의학회지
    • /
    • 제26권2호
    • /
    • pp.117-121
    • /
    • 2015
  • Background : Cepstral analysis which is obtained from Fourier transformation of spectrum has been known to be effective indicator to analyze the voice disorder. To evaluate the voice disorder, phonation of sustained vowel /a/ sound or continuous speech have been used but the former was limited to capture hoarseness properly. This study is aimed to compare the effectiveness in analysis of cepstrum between the sustained vowel /a/ sound and continuous speech. Methods : From March 2012 to December 2014, total 72 patients was enrolled in this study, including 24 unilateral vocal cord palsy, vocal nodule and vocal polyp patients, respectively. The entire patient evaluated their voice quality by VHI (Voice Handicap Index) before and after treatment. Phonation of sustained vowel /a/ sample and continuous speech using the first sentence of autumn paragraph was subjected by cepstral analysis and compare the pre-treatment group and post-treatment group. Results : The measured values of pre and post treatment in CPP-a (cepstral peak prominence in /a/ vowel sound) was 13.80, 13.91 in vocal cord palsy, 16.62, 17.99 in vocal cord nodule, 14.19, 18.50 in vocal cord polyp respectively. Values of CPP-s (cepstral peak prominence in text-based speech) in pre and post treatment was 11.11, 12.09 in vocal cord palsy, 12.11, 14.09 in vocal cord nodule, 12.63, 14.17 in vocal cord polyp. All 72 patients showed subjective improvement in VHI after treatment. CPP-a showed statistical improvement only in vocal polyp group, but CPP-s showed statistical improvement in all three groups (p<0.05). Conclusion : In analysis of cepstrum, text-based analysis is more representative in voice disorder than vowel sound speech. So when the acoustic analysis of voice by cepstrum, both phonation of sustained vowel /a/ sound and text based speech should be performed to obtain more accurate result.

  • PDF

인공와우이식 아동과 건청 아동의 음성 분석 비교 (A Comparison of Voice Analysis of Children with Cochlear Implant and with Normal Hearing)

  • 윤미선;최은아;성영주
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.71-78
    • /
    • 2013
  • The purpose of this study was to compare the acoustic voice outcomes of children with cochlear implant to those of children with normal hearing. Participants were 41 children using unilateral cochlear implant (18 males and 23 females), and children with normal hearing from the same age and sex. Mean age of implantation was approximately 3 years old, mean duration of implant use was 4 years in CI group. Acoustic analyses were performed using MDVP of CSL. Speech samples were 3 sustained vowels, /a, i, u/. 9 parameters (F0, Fhi, Flo, Jitter, Shimmer, vF0, vAm, NHR, and SPI) were analyzed. Children with CI did not show the significant differences in those parameters after the vowel /a/ phonation. Meanwhile, there were significantly different results in F0, Fhi, vF0, and SPI after /i, u/ phonation. These results revealed that differences of voice characteristics in children with CI compare to children with NH persist regarding vowel context. It suggests that high vowels would recommend as speech samples for acoustic evaluation. Futhermore perceptual analysis and speech therapy for phonation control would be necessary for children with CI.

지속적으로 발성한 모음에 의한 화자인식 (Automatic Speaker Identification by Sustained Vowel Phonation)

  • 배건성
    • 한국음향학회지
    • /
    • 제11권1호
    • /
    • pp.35-41
    • /
    • 1992
  • 지속적으로 발성한 모음에 대해 각 화자의 특징을 나타내는 벡터양자화 코드북을 만들고 이를 이용해 화자를 인식하는 방법을 제안하고 실험하였다. 특히 벡터로는 모음 /이/로 부터 각각의 피치 주기에 대해 얻어진 선형예측계수를 사용하였으며, 코드북의 크기는 4가 적절함을 실험적으로 보였다. 인식실험에서, 학습에 사용된 데이타를 이용했을 경우에는 99.4%의 인식율을 보였으며, 학습에 사용되지 않은 50개의 피치 주기를 포함하는 음성신호로 부터는 89.4%의 인식율을 보였다.

  • PDF

최소 제곱 서포트 벡터 회귀 기반 비선형 자귀회귀 방법을 이용한 지속 모음 모델링 (Sustained Vowel Modeling using Nonlinear Autoregressive Method based on Least Squares-Support Vector Regression)

  • 장승진;김효민;박영철;최홍식;윤영로
    • 한국지능시스템학회논문지
    • /
    • 제17권7호
    • /
    • pp.957-963
    • /
    • 2007
  • 본 연구에서는 비선형 지속 모음 모델링을 위한 최소 제곱 서포트 벡터 회귀 기반 비선형 자귀회귀 방법을 소개하고 분석하였다. 비주기적인 파형 특성을 갖는 양성 후두 질환자 43명의 지속 모음을 대상으로 한 실험에서 제안된 비선형 합성기는 거의 완벽하게 혼란한 지속 모음을 생성하고 선형 예측 코딩은 할 수 없는 주파수 변동과 같은 자연스러운 음의 특성 또한 보존할 수 있었다. 하지만 일부 모음의 합성 결과 실제 원음과 다른 차이점을 보였다. 이러한 결과들은 단일 밴드 모델이 음의 고주파 성분을 조정, 분해 못하기 때문에 발생한 것이라 가정된다. 그러므로 웨이블릿 필터 뱅크를 이용한 멀티 밴드 모델을 단일 밴드 모델과 대치하여 실험을 수행한 결과 향상된 안정성을 보였다. 결과적으로 최소 제곱 서포트 벡터 회귀 기반 비선형 자귀회귀 방법은 성공적으로 원음에 가까운 합성음을 생성할 수 있다는 것을 확인 할 수 있었다.

발성방법에 따른 소프라노 성악도의 음성 특성 (The characteristics of soprano students' voice related to the vocal methods)

  • 김정택;성철재
    • 말소리와 음성과학
    • /
    • 제9권3호
    • /
    • pp.75-83
    • /
    • 2017
  • The purpose of this study is to find clues to the risk of voice disorders in soprano students. The subjects of the study were 17 soprano students and 18 general students (women). The phonation of vowels /a/, /i/, and /u/ with C4 and F4 notes in each group were recorded. Then, only soprano students were made to record their classical vocalization containing vibrato. Formant, formant energy, bandwidth, VAI (vowel area index), VSA (vowel space area) and L/H ratio were analyzed. There was significant difference in F3 such that the singers' note was measured around 3 kHz which seems to be 400 Hz higher than one from general students. But, There was no significant difference in L/H ratio between soprano student and the general student. There was a significant difference in F3 in the comparison of the soprano students' two vocalization methods. Classical vocalization was measured at 200Hz higher than sustained phonation in F3. Vocal tract adjustment was made and vowel space changed, but there was no significant difference in F3 energy, which is the index of singers' formant according to the phonation method. The L/H ratio, which can be a direct indicator of vocal effort, has no difference in phonation method and is lowered in all phonation methods as the pitch increases. C4 and F4 pitches are lower than the singing range of the soprano. When the pitch changes, vocal effort increases like a general student which will be an indicator of the risk of vocalization. This will be a clue to the vocalization of the immature soprano student.

모음 연장 발성이 보이는 연령대별 음향음성학적 특성 연구 (Acoustic characteristics of the sustained vowel phonation according to age groups)

  • 서윤정;신지영
    • 말소리와 음성과학
    • /
    • 제10권4호
    • /
    • pp.67-76
    • /
    • 2018
  • This study was performed to investigate acoustic characteristics of sustained vowels produced by Seoul Korean speakers. For this study, three hundred nine healthy adults were chosen as participants from Korean Standard Speech Database. These subjects were divided into five chronological age groups (20s, 30s, 40s, 50s, 60-70s) and two gender groups (male and female). Fundamental frequency (f0), jitter, shimmer, and NHR (noise-to-harmonics ratio) was measured with 8 Korean vowels (/ɑ/, /æ/, /ʌ/, /e/, /o/, /u/, /ɯ/, /i/) by using Praat. The results showed that the vowel type significantly affected all acoustic parameters. Gender affected f0, jitter, and NHR significantly. The mean female speakers' f0 was greater than the males', and the mean jitter and NHR of male speakers was greater than the females'. Moreover, age affected shimmer and NHR significantly; in particular, the shimmer and NHR of elderly speakers was greater than the young speakers.

연속모음에서의 Electroglottograph 신호해석에 의한 후두기능 평가 (Layngeal Function Assessment by Electroglottographic Signal Analysis during Sustained Vowel Phonation)

  • 송철규;이명호
    • 대한의용생체공학회:학술대회논문집
    • /
    • 대한의용생체공학회 1994년도 춘계학술대회
    • /
    • pp.79-81
    • /
    • 1994
  • Petubation in the fundamental frequency and in the peak amplitude of the EGG signal derived with a four-electrode EGG system were investigated for the purpose of developing useful measures for the detection of layngeal pathology. The data were compared to the degree of amplitude perturbation and frequency perturbation. There was a close relation between amplitude perturbation and frequency perturbation analysis of EGG signal and degree of laryngeal pathology.

  • PDF

Trill 발성시 전기성문파 측정검사로 분석한 성대점막 진동의 변화 : 예비연구 (Alterations of Mucosal Vibration of True Vocal Folds on Tongue-Tip Trill : Preliminary Study Using the Electroglottography)

  • 진성민;반재호;김남훈;이경철;권기환;이용배
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.76-80
    • /
    • 2000
  • Tongue-tip trill is a sound made by the tongue tip making contract with the alveolar ridge and oscillating rapidly as sound is produced. It is an exercise used by many singers to warm up the voice and used as one of the methods of voice rehabilitation for patients who have the vocal folds scarred postoperatively and also who present with a variety of disorders, particularly hypofunction and presbyphonia. We intended to investigate the mucosal vibration of the true vocal folds on tongue-tip trill by electroglottography and to find e effective methods of tongue-tip trill. One adult male volunteer participated. Spectrography and electroglottography were checked repeatedly 15 times, more than 5 second in each times, at same pitch, in three conditions of phonation : sustained /a/ vowel, anterior trill in which tongue-tip vibrated at anterior portion of alveolar ridge just behind the anterior tooth, and posterior trill in which at palatal crest behind the transverse palatine fold We measured the first and second formant to determine indirectly the position of tongue and calculated speed quotient and the ratio of closing phase to closed phase. Speed quotients of posterior trill were higher than sustained /a/ vowel and anterior trill in 14 times. The ratio of closing phase to dosed phase of posterior trill were lower than the others in 14 times. Mucosa of true vocal folds is vibrated more effectively on posterior trill rather than sustained /a/ vowel and anterior trill. So, when tongue-tip trill is used as a method of voice rehabilitation, we suggest that posterior trill is better in producing effective mucosal vibration

  • PDF