• Title/Summary/Keyword: Shimmer

Search Result 243, Processing Time 0.021 seconds

Separation of Periodic and Aperiodic Components of Pathological Speech Signal (장애음성의 주기성분과 잡음성분의 분리 방법에 관하여)

  • Jo Cheolwoo;Li Tao
    • Proceedings of the KSPS conference
    • /
    • 2003.10a
    • /
    • pp.25-28
    • /
    • 2003
  • The aim of this paper is to analyze the pathological voice by separating signal into periodic and aperiodic part. Separation was peformed recursively from the residual signal of voice signal. Based on initial estimation of aperiodic part of spectrum, aperiodic part is decided from the extrapolation method. Periodic part is decided by subtracting aperiodic part from the original spectrum. A parameter HNR is derived based on the separation. Parameter value statistics are compared with those of Jitter and Shimmer for normal, benign and malignant cases.

  • PDF

Effects of Aging and Smoking on Acoustic Characteristics of Voice (노화와 흡연에 따른 음성 변화의 측정)

  • 남의철;남순열;이광선
    • Proceedings of the KSLP Conference
    • /
    • 1996.11a
    • /
    • pp.75-75
    • /
    • 1996
  • 노화와 흡연에 따른 음성의 변화에 대하여 객관적인 음향 지표들을 측정함으로써, 노화와 흡연에 따른 정상적인 음성의 변화와 질병에 기인한 변화를 감별하는 지표를 제시하고자 본 연구를 시행하였다. 정상의 발성기관과 청력을 가진 20세 이상의 성인으로, 60세 이상군과 35세 이하군으로 남녀 각각 30명을 대상으로 CSL50-MDVP(Computerized Speech Lab50-Multidimensional voice program)을 이용하여 기본 주파수(Fundamental frequency), jitter, shimmer, NHR(Noise to harmonic ratio)을 측정하였다. (중략)

  • PDF

Acoustic and Stroboscopic Characteristics of Normal Person's Voices with Advancing Age (연령증가에 따른 정상 노인의 음향분석학적 특징)

  • 진성민;권기환;강현국
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.8 no.1
    • /
    • pp.44-48
    • /
    • 1997
  • Anatomic and physiological changes of the larynx with advancing age result in morphologic changes of the vocal fold and reduced control of the phonatory mechanism in elderly individuals and are reflected in increased unstability of fundamental frequency (Fo). The purpose of this study is to increase current understanding of acoustic and stroboscopic characteristics of normal elderly persons voices. First, phonated /a/ vowel productions by 40 normal adults (20 to 40 years, 20 men and 20 women) and 40 normal elderly persons (60 to 80 years,20 men and 20 women) were analyzed, using CSL (model 4300B) acoustic analysis software, to obtain acoustic measures related to fundamental frequency stability nd vocal resonance characteristics. Second, stroboscopic images of the vocal fold behavior in all subjects were analyzed by experienced specialists. In the men, fundamental frequency variation (vFe) (p<0.01), jitter. (p<0.05), and shimmer (p<0.05) for the older group were significantly higher than the value for the adult group. In the stroboscopic findings, edema of vocal fold had a significant finding in aged men (15%). In the women, vFo (p<0.05), jitter (p<0.05), and noise to harmonic ratio (NHR) (p<0.05) for the older group were significantly higher than the value for e adult group and first formant frequency (F1) (p<0.01) and second formant frequency (F2) (p<0.01) for. the older group were significantly lower than the value for the adult group. In the stroboscopic findings, vocal fold atrophy had a significant finding in aged women (25%). Frequency stability, as reflected by vFo, jitter, shimmer, and NHR, decreases with advancing age in men and women and spectral analysis of phonated /a/ vowel productions reveals the lowering of the frequency of F1 and second F2 with advancing age, especially in aged women. Change in the mass of vocal folds, due to atrophy or edema, is considered to be the greatest factor in these acoustic changes.

  • PDF

A Study on the Sasang Constitutional Symptom of Taeumin by Voice Characteristics (음향특성에 따른 태음인 체질병증(體質病證) 연구(硏究))

  • Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.19 no.1
    • /
    • pp.90-97
    • /
    • 2007
  • 1. Objectives and Methods This study was done to investigate the relationships of Sound parameters between Liver Heat Symptom and Esophagus Symptom of Taeumin using PSSC(Phonetic System of Sasang Constitution) in a sentence. Experimental Participants were 20 Korean adult males including, each 10 Liver Heat Symptom and Esophagus Symptom of Taeumin. 2. Results In Pitch segment, APQ segment and Shimmer segment, there were no significant differences between Liver Heat Symptom and Esophagus Symptom of Taeumin. In Octave segment, there were significant differences in Octave 1, Octave 3, Octave 4, Octave 6 of Liver Heat Symptom of Taeumin were significantly high compared with Esophagus Symptom of Taeumin. In Energy segment, FreQ Domain Total Sum / cnt(0), 0k-2k Total Sum,0k-2k sum dev., 2k-4k Total Sum, 2k-4k sum dev., A# Tot E, B__TOT_E, C__TOT_E, C# Tot E, D__TOT_E, A sum dev., A# sum dev., B sum dev., C sum dev., C# sum dev., Dsum dev., D# sum dev., E sum dev., F sum dev., F# sum dev., G sum dev., G# sum dev. of Liver Heat Symptom of Taeumin were significantly high compared with Esophagus Symptom of Taeumin. In Voice Recording time segment, Total Voice Recording Time, Voice Recording Time, Divide By Time3, Divide By Energy10, Total Unit, Max Unit Position, U_0 TO 3 of Liver Heat Symptom of Taeumin were significantly high compared with Esophagus Symptom of Taeumin. 3. Conclusion From above result, there is the postbility of efficiency quide constitutional sx. of Taeumin by Voice characteristics. More Soeumin, Soyangin and Taeyangin Symptoms are needed to determine Sasang Constitution using PSSC and to make PSSC effective.

  • PDF

Comparison of the Voice and Treatment Results after Laser Cordectomy or Radiotherapy on Tla Staged Glottic Cancer (Tla 병기의 성문암에 대한 레이저 절제술과 방사선 치료 비교)

  • 남순열;이윤세;김찬종;김종찬;김범규;김상윤
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.2
    • /
    • pp.139-144
    • /
    • 2002
  • Background and objectives : The various voice-conserving treatments are used for Tla staged glottic cancer. Especially, Tla staged glottic cancer has been shown excellent treatment result after laser cordectomy or radiotherapy. To evaluate which treatment results better voice after treatment made it valuable to define the exact indication and recommending treatment modality on the Tla staged glottic cancer patients. Method : The medical records of 75 patients with glottic TlaN0 cancer diagnosed at Asan medical center, University of Ulsan college of medicine form May, 1989 to July,2001 were retrospectively reviewed on the point of voice quality and oncology including 5-year survival rate and local control rate. Results : Laser cordectomy and radiotherapy showed 100% and 94.0% 5-year survival rate, respectively. And laser cordectomy had 94.3% local control rate while radiotherapy got 87.6% local control rate. Voice analysis of pretreatment and posttreatment were used to compare each result. Fundamental frequency(F0), shimmer, jitter, noise to harmony ratio(NHR), maximum confortable phonation time(MPT) and vocal efficiency(VE) were used for parameters for voice analysis. Only in shimmer and MPT, we could find significant posttreatment difference between two therapies. In addition, we reviewed the total expenses for each therapy. Conclusion : On the basis of the oncologic result, both the laser cordectomy and radiotherapy had the similar results. Laser cordectomy showed the relatively acceptable voice as radiotherapy did. Laser cordectomy cost less than radiotherapy did. Laser cordectomy can be used for treatment about Tla staged glottic cancer.

  • PDF

Development of medical/electrical convergence software for classification between normal and pathological voices (장애 음성 판별을 위한 의료/전자 융복합 소프트웨어 개발)

  • Moon, Ji-Hye;Lee, JiYeoun
    • Journal of Digital Convergence
    • /
    • v.13 no.12
    • /
    • pp.187-192
    • /
    • 2015
  • If the software is developed to analyze the speech disorder, the application of various converged areas will be very high. This paper implements the user-friendly program based on CART(Classification and regression trees) analysis to distinguish between normal and pathological voices utilizing combination of the acoustical and HOS(Higher-order statistics) parameters. It means convergence between medical information and signal processing. Then the acoustical parameters are Jitter(%) and Shimmer(%). The proposed HOS parameters are means and variances of skewness(MOS and VOS) and kurtosis(MOK and VOK). Database consist of 53 normal and 173 pathological voices distributed by Kay Elemetrics. When the acoustical and proposed parameters together are used to generate the decision tree, the average accuracy is 83.11%. Finally, we developed a program with more user-friendly interface and frameworks.

The Changes in the Closed Qutient of Trained Singers and Untrained Controls Under Varying Intensity at a Constant Vocal Pitch (음도 고정 시 강도 변화에 따른 일반인과 성악인 발성의 성대접촉률 변화 특성의 비교)

  • Kim, Han-Su;Jeon, Yong-Sun;Chung, Sung-Min;Cho, Kun-Kyung;Park, Eun-Hee
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.28-32
    • /
    • 2005
  • Background and Objectives : The most important two factors of the voice production are the respiratory function which is the power source of voice and the glottic closure that transform the air flow into sound signals. The purpose of this study was to investigate the differences between trained singers and untrained controls under varying intensity at a constant vocal pitch by simulataneous using the airway interruption method and electroglottography(EGG). Materials and Methods : Under two different intensity condition at a constant vocal pitch(/G/), 20(Male 10, Female 10) trained singers were studied. Mean flow rate(MFR), subglottic pressure(Psub) and intensity were measured with aerodynamic test using the Phonatory function analyzer. Closed quotients(CQ), jitter and shimmer were also investigated by electroglottography using Lx speech studio. These data were compared with that of normal controls. Results : MFR and Psub were increased on high intensity condition in all subject groups but there was no statistically significance. Statistically significant increasing of CQ. were observed in male trained singers on high intensity condition (untrained male : 51.31${\pm}$3.70%, trained male :55.52${\pm}$6.07%, p=.039). Shimmer percent, one of the phonatory stability parameters, was also decreased statistically in all subject groups(p<.001). Conclusion : The trained singers' phonation was more efficient than untrained singers. The result means that the trained singers can increase the loudness with little changing of mean flow rate, subglottic pressure but more increasing of glottic closed quotients.

  • PDF

The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing (음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성)

  • Choi, Seong Hee;Choi, Chul-Hee
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.63-72
    • /
    • 2014
  • The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.

The Correlation of Voice Characteristics and Depression Index Analysis in Accordance with Menstrual Cycle (월경주기에 따른 우울지수 정도와 음성특성과의 상관관계 분석)

  • Kim, YuMi;Jang, Seoung-Jin;Kim, Eunyeon;Choi, Yaelin
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.41-48
    • /
    • 2014
  • This study investigated the differences between emotional parameters BDI, VHI, STAI-X-I and STAI-X-II according to the menstrual cycles of the female and the relation between changes of the depression index and voice characteristics (jitter, shimmer, CPP, HNR, $pF0{\cdot}F1{\cdot}F2{\cdot}F3$, sF0, sF4, sB1, $H1_{c/u}$, $A1_u$, $A3_c$, $H1A3_{c/u}$, $H1A1_u$). Twenty three females ($30{\pm}4.4$ years old) living in Seoul and Gyeonggi Province were participated in this study to answer the questionnaires and record their voice. The participants prolonged /a/ vowel for 5 seconds in a natural condition for their voice recording. Voice data were analyzed using the Matlab and Praat program. A t-test and a correlation analysis were conducted by using SPSS for the statistical analysis. The results are as follows. First, the BDI is significantly higher in group I (lurear phase contrast the menstrual period) and group II (follicular phase against the menstrual period) than group III (luteal phase for follicular phase) (p<.05). Second, shimmer, CPP, pF0 showed a statistically high correlation regarding the BDI in group I (lurear phase contrast the menstrual period). Voice parameters may be useful as supplement in evaluating the emotional change in the phase of menstrual cycle.

Acoustic parameter delta of an aspirated voice in stroke patients (뇌졸중 환자 대상 흡인 음성의 음향변수 변동)

  • Kang, Young Ae;Jee, Sung Ju;Koo, Bon Seok;Jo, Cheolwoo
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.85-91
    • /
    • 2017
  • The present study aimed to investigate the changes of acoustic parameters of the aspirated voice in stroke patients. The eighty-eight subjects diagnosed with cerebro-vascular accident were divided into 32 penetration/aspiration (P/A) and 56 Non-P/A groups according to the videofluroscopic swallowing study (VFSS) results, and 26 control subjects participated. All subjects preformed VFSS and vowel /a/ was recorded three times pre- and post VFSS. Since the variation in the acoustic parameters within a single phonation has been observed, we proposed a delta formula for the acoustic parameters which can reflect the temporal changes of the each parameter in an utterance. We measured from the voice data eight acoustic parameters: fundamental frequency (F0), standard deviation of F0 (F0_SD), Jitter, relative average perturbation (RAP), Shimmer, amplitude perturbation quotient (APQ), harmonic to noise ration (HNR), noise to harmonic ratio (NHR). Then we found parameters which show the meaningful biggest temporal change in an utterance using the suggested delta parameter. Among them, the deltas of shimmer and APQ were significantly different pre- and post VFSS. These deltas of the P/A and the control group were increased after VFSS, while those of the Non-P/A group was descended. The variation patterns of the P/A and the control group were similar but the change width of the P/A group was larger. The large variations in an aspirated phonation of the P/A group are thought to be caused by irregular changes in air resistance due to residual food on the vocal cords.