• 제목/요약/키워드: Perceptual evaluation

검색결과 248건 처리시간 0.029초

조음장애 환아에서 개별화된 치료계획 수립과 효과 판정을 위한 음향음성학적 분석방법의 활용 (Use of Acoustic Analysis for Indivisualised Therapeutic Planning and Assessment of Treatment Effect in the Dysarthric Children)

  • 김연희;유희;신승훈;김현기
    • 음성과학
    • /
    • 제7권2호
    • /
    • pp.19-35
    • /
    • 2000
  • Speech evaluation and treatment planning for the patients with articulation disorders have traditionally been based on perceptual judgement by speech pathologists. Recently, various computerized speech analysis systems have been developed and commonly used in clinical settings to obtain the objective and quantitative data and specific treatment strategies. 10 dysarthric children (6 neurogenic and 4 functional dysarthria) participated in this experiment. Speech evaluation of dysarthria was performed in two ways; first, the acoustic analysis by Visi-Pitch and a Computerized Speech Lab and second, the perceptual scoring of phonetic errors rates in 100 word test. The results of the initial evaluation served as primary guidlines for the indivisualized treatment planning of each patient's speech problems. After mean treatment period of 5 months, the follow-up data of both dysarthric groups showed increased maximum phonation time, increased alternative motion rate and decreased occurrence of articulatory deviation. The changes of acoustic data and therapeutic effects were more prominent in children with dysarthria due to neurologic causes than with functional dysarthria. Three cases including their pre- and post treatment data were illustrated in detail.

  • PDF

성대접촉이완훈련이 성대결절아동의 음성개선에 미치는 효과 (The Effects of Vocal Relaxation Training on Voice Improvement of Children with Vocal Nodules)

  • 한지은;성철재
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.147-154
    • /
    • 2012
  • The purpose of this study is to examine the effect of voice improvement when vocal training, which relaxes the vocal contact, is applied to children with vocal nodules. Subjects included 20 5- to 12-year-old boys with vocal nodules in Otolaryngology and for whom voice therapy had been advised. The vocal therapy was conducted for 40 minutes per a week for a total of eight times. Results were evaluated by videostroboscopy, auditory-perceptual evaluation of GRBAS Scale, aerodynamic test, and acoustic analysis before and after therapy. As a result, first, the size of vocal nodules was reduced and the unstable pattern of vocal contact was improved. Glottic closure was increased and Phase symmetry was decreased during vocal vibration. Mucosal wave was increased and muscle tension of the larynx was reduced. Second, auditory-perceptual evaluation showed that subjects' overall quality of voice improved. GRBAS Scale Evaluation showed that the characteristics of the subjects' voice which were rough, breathy, and strained and breathy were reduced after therapy. Third, the measurements of acoustic parameters showed a statistically significant improvement. The fundamental frequency of the subejects' voice was increased and values of Jitter and Shimmer, NHR, [H1-H2] decreased. Fourth, the maximum phonation time of children was increased. These results imply that vocal relaxation training conducted in this study has a very positive effect to improve the voice of children with vocal nodules.

편도적출술로 음성변화가 올 수 있는 편도 상태에 관한 연구 (The Study of Tonsil Affected Voice Quality after Tonsillectomy)

  • 안철민;정덕희
    • 대한후두음성언어의학회지
    • /
    • 제9권1호
    • /
    • pp.32-37
    • /
    • 1998
  • Tonsillectomy is the one of operation that is performed the most commonly in otolaryngology field. Many changes that include range of voice, tone, voice quality and resonance were made by tonsillectomy. Sometimes, any patients taken tonsillectomy has suffer from these voice problem after tonsillectomy. However there are less study for these problems until now. Then, we studied to find the anatomical findings that affected the voice quality when tonsillectomy was performed. We evaluated the voice in 2 groups, one is the group showed the normal pharyngeal space by using the transnasal fiberscopy, the other is group showed medially bulging tonsil at pharyngeal cavity by using same method, with perceptual evaluation, nasalance score, nasality, oral formant and nasal formant. We used the computerized speech analysis system, the nasometer and the spectrogram in the CSL program. We could not find any differences in perceptual evaluation between two groups. But objective measures were provided. Nasalance score and nasality on the nasometric analysis were increased significantly and oral formant on the spectrogram was changed singnificantly after tonsillectomy in Group 2. Authors thought medially bulging tonsil in the pharynx is able to affect the voice quality after tonsillectomy when we evaluted through the nasal cavity by the using of fiberscopy and this evaluation would be important especially in singers.

  • PDF

연결발화에서 마비말화자의 음질 특성 (Voice Quality of Dysarthric Speakers in Connected Speech)

  • 서인효;성철재
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.33-41
    • /
    • 2013
  • This study investigated the perceptual and cepstral/spectral characteristics of phonation and their relationships in dysarthria in connected speech. Twenty-two participants were divided into two groups; the eleven dysarthric speakers were paired with matching age and gender healthy control participants. A perceptual evaluation was performed by three speech pathologists using the GRBAS scale to measure the cepstrual/spectral characteristics of phonation between the two groups' connected speech. Correlations showed dysarthric speakers scored significantly worse (with a higher rating) with severities in G (overall dysphonia grade), B (breathiness), and S (strain), while the smoothed prominence of the cepstral peak (CPPs) was significantly lower. The CPPs were significantly correlated with the perceptual ratings, including G, B, and S. The utility of CPPs is supported by its high relationship with perceptually rated dysphonia severity in dysarthric speakers. The receiver operating characteristic (ROC) analysis showed that the threshold of 5.08 dB for the CPPs achieved a good classification for dysarthria, with 63.6% sensitivity and the perfect specificity (100%). Those results indicate the CPPs reliably distinguished between healthy controls and dysarthric speakers. However, the CPP frequency (CPP F0) and low-high spectral ratio (L/H ratio) were not significantly different between the two groups.

시설 노인을 위한 기능적 그룹활동 프로그램의 개발 및 운영 평가 (Development and Evaluation of Functional Group Activity Program on Institutionalized Aged)

  • 방요순;김희영
    • 근관절건강학회지
    • /
    • 제18권1호
    • /
    • pp.83-92
    • /
    • 2011
  • Purpose: The purpose of this study was to identify the changes of physical function, perceptual and cognitive function, emotional function, and functional independence in the institutionalized aged according to functional group activity program (self help Tai Chi exercise plus functional task). Methods: Study subjects were 20 institutionalized aged from June to October in 2010. The subjects received functional group activity program two times a week for 15 weeks. Physical function (grip strength, coordination, lower extremity strength, balance, gait, trunk flexibility), perceptual and cognitive function, emotional function(depression, social skill), and functional independence were measured before and after the program. Results: The subjects showed significantly increased physical function (coordination, lower extremity strength, gait, trunk flexibility), perceptual and cognitive function, emotional function (depression, social skill), and functional independence. The functional group activity program may be an effective strategy for institutionalized elders to enhance their functions. Conclusion: The functional group activity program may be effective on elderly institutions which have limitation in human, material, environmental resources.

A Multi-category Task for Bitrate Interval Prediction with the Target Perceptual Quality

  • Yang, Zhenwei;Shen, Liquan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권12호
    • /
    • pp.4476-4491
    • /
    • 2021
  • Video service providers tend to face user network problems in the process of transmitting video streams. They strive to provide user with superior video quality in a limited bitrate environment. It is necessary to accurately determine the target bitrate range of the video under different quality requirements. Recently, several schemes have been proposed to meet this requirement. However, they do not take the impact of visual influence into account. In this paper, we propose a new multi-category model to accurately predict the target bitrate range with target visual quality by machine learning. Firstly, a dataset is constructed to generate multi-category models by machine learning. The quality score ladders and the corresponding bitrate-interval categories are defined in the dataset. Secondly, several types of spatial-temporal features related to VMAF evaluation metrics and visual factors are extracted and processed statistically for classification. Finally, bitrate prediction models trained on the dataset by RandomForest classifier can be used to accurately predict the target bitrate of the input videos with target video quality. The classification prediction accuracy of the model reaches 0.705 and the encoded video which is compressed by the bitrate predicted by the model can achieve the target perceptual quality.

승용차소음의 주관적 음질평가 실험연구 (Experimental Study on Subjective Sound Quality Evaluation of Vehicle Noises)

  • 최병호
    • 한국소음진동공학회논문집
    • /
    • 제14권12호
    • /
    • pp.1223-1232
    • /
    • 2004
  • This study is directed toward determining the number and characteristics of psychologically meaningful perceptual dimensions required for assessing the sound quality with respect to vehicle noises, and toward identifying the acoustical and/or psychoacoustical bases underlying the preference and similarity judgments. For the purpose of analyzing the paired comparison data produced by subjective ratings we used nonmetric multidimensional scaling(MDS). The perceptual dimensions based upon preference ratings could explain 76.3 % of the variance by maximum dB(A) and sharpness acum. The correlation between objective and subjective positions of the stimuli is $R^2$=0.97(F(1,13)=195.45, p < .01), corrected $R^2$=0.93. The less the intensity of the stimulus the more becomes the subjective Position would be over-estimated relative to the objective one. The same is valid for the opposite case. The perceptual dimensions based upon similarity judgments could be accounted for 47.8 % and 23.5% of the variance, each of which might be a match for the maximum dB(A) and the sharpness acum, respectively. The correlation between objective and subjective positions of the stimuli is $R^2$=0.94(F(1,13)=92.38, p < .01), corrected $R^2$=0.87. The more the intensity of the stimulus the more becomes the subjective position would be over-estimated relative to the objective one. The same is valid for the opposite case. In other words, it is likely that the larger the amount of two stimuli which to compare would be judged similar. So far it should be further clarified that whether the relationship between preference ratings and psychological distances nay be optimized through which psycho-physical models.

성대마비의 음성장애 측정을 위한 청지각적 및 음향학적 평가 (Auditory-Perceptual and Acoustic Evaluation in Measuring Dysphonia Severity of Vocal Cord Paralysis)

  • 김근효;이연우;박희준;배인호;이병주;권순복
    • 대한후두음성언어의학회지
    • /
    • 제28권2호
    • /
    • pp.106-111
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to investigate the criterion-related concurrent validity of two standardized auditory-perceptual assessments and the Acoustic Voice Quality Index (AVQI) for measuring dysphonia severity in patients with vocal cord paralysis (VCP). Materials and Methods : Total 210 patients with VCP and 236 normal voice subjects were asked to sustain the vowel [a:] and to read aloud the Korean text "Walk". A 2 second mid-vowel portion of the sustained vowel and two sentences (with 26 syllables) were recorded. And then voice samples were edited, concatenated, and analyzed according to Praat script. Two standardized auditory-perceptual assessment (GRBAS and CAPE-V) were performed by three raters. Results : The VCP group showed higher AVQI, Grade (G) and Overall Severity (OS) values than normal voice group. And the correlation among AVQI, G, and OS ranged from 0.904 to 0.926. In ROC curve analysis, cutoff values of AVQI, G, and OS were <3.79, <0.00, and <30.00, respectively, and the AUC of each analysis was over .89. Conclusion : AVQI and auditory evaluation can improve the early screening ability of VCP voice and help to establish effective diagnosis and treatment plan for VCP-related dysphonia.

  • PDF

음성장애에 대한 음향학적 중등도 지표 (The Acoustic Severity Index in the Pathologic Voice)

  • 홍기환;김현기;양윤수
    • 음성과학
    • /
    • 제10권4호
    • /
    • pp.201-219
    • /
    • 2003
  • Background: The perceptual assessment is generally performed by the voice specialist. The objective evaluation is performed in a voice laboratory. Research in voice laboratories has generated a variety of different objective tests and parameters. The perceptual evaluation is one of the most controversial topics in voice research. Review of literature reveals a wide variety of rating scales and reliability data fluctuating from study to study. Unfortunately, there is no widely accepted valid method for classifying voice disorders and assessing outcome after voice treatment. Objectives: The goals of this research were to identify important objective acoustic parameters of vocal quality, and to establish an objective and quantitative correlate of the perceived vocal quality. Materials and Methods : We evaluated the voice analyzed data from 122 dysphonic patients and 20 normal volunteers. A computerized speech lab. 4300B(CSL) was used to carry out the analysis of each voice sample. Results: Three dysphonia severity indices(DSI) were created using discriminant analysis. DSI is based on the weighted combination of the following selected set of acoustic parameters: absolute jitter(Jita in us), smoothed pitch period perturbation (sPPQ in %), amplitude perturbation quotient(APQ in %), soft phonation index(SPI), average fundamental frequency(Fo in Hz), lowest fundamental frequency(Flo in Hz), and smoothed amplitude perturbation quotient(sAPQ in %). The DSI, being the discriminating rule calculated by the logistic regression, consists of three equation based on statistically significant acoustic parameters. Three DSI were created to reflects best the degree of hoarseness as expressed by G from the GRBAS scale. The more positive this DSI is for a patient, the worse the vocal quality. The more it is negative, the better it is. The effect of sex is included implicitly in the DSI-1 and DSI-2, so that a separate DSI-1 and DSI-2 for males and females need not be used. The DSI is objective because no perceptual input is required for its calculation. Conculsion : This research demonstrates that the voice function values calculated from three different multivariate objective dysphonia severity indices are significantly associated with subjective voice assessments. These multivariate objective dysphonia severity indices may be appropriate for use in clinical trials and outcomes research on treatment effectiveness for voice disorders.

  • PDF

S-JND 기반의 HEVC 주관적 율 제어 알고리즘 (S-JND based Perceptual Rate Control Algorithm of HEVC)

  • 김재련;심동규
    • 방송공학회논문지
    • /
    • 제22권3호
    • /
    • pp.381-396
    • /
    • 2017
  • 본 논문에서는 주관적 화질 기반의 비트 분배를 수행하는 율 제어 알고리즘을 수행하는 HEVC (High Efficiency Video Coding) 부호화 방법을 위한 연구를 진행하였다. 본 논문은 이러한 단점을 해소하고자 율 왜곡 최적화 시의 화질 측정에서 주관적 화질을 고려할 수 있는 율 제어 알고리즘을 통한 HEVC 부호화 방법을 제안한다. 제안하는 방법은 영상을 하나의 CTU 마다 인지 시각적 중요도를 측정하여, 이를 이용하여 픽쳐 단위, CTU 단위에의 비트 분배 시 적응적인 분배를 수행한다. 본 논문에서 제안하는 방법은 HEVC 참조 소프트웨어 16.9 버전 대비 CTC (Common Test Condition) Class B 영상에서 평균적으로 BD-rate 3.12%의 성능향상과 BD-PSNR의 0.08dB 향상 및 목표 비트율에의 비트 정확도 0.07% 증가를 보였다. 또한 주관적 화질 측정 결과도 기존 HEVC의 참조 소프트웨어에 적용된 율 제어 알고리즘 대비 DSCQS 스케일에서 평균 0.16 향상된 것을 확인하였다.