• 제목/요약/키워드: voice quality parameters

검색결과 96건 처리시간 0.021초

이어콘을 적용한 음성 메뉴의 사용성 평가에 관한 연구 (A Study on the Usability Evaluation of Earcon Applied to Voice Menu)

  • 임치환;이재인;이성수
    • 산업경영시스템학회지
    • /
    • 제28권4호
    • /
    • pp.55-62
    • /
    • 2005
  • This paper describes the experiment that investigated the possibility of design and evaluation of the usability of earcon applied to voice menu. The earcon has functions in providing navigational cue in the hierarchical menu, and also can be applied to voice menu for improving the recall rate and the response time. In this experiment, participants identified their location with the help of earcon applied to the voice menu with the various earcon parameters. In detail some participants listened to the good quality sound of voice menu using the structured earcon, and they recalled the location they heard. Other participants listened to the good quality sound of voice menu without earcon, and they recalled the location they heard in the same manner. And the response times were checked through their answers. On analyzing the results, we found the earcon applied to voice menu showed the increase of recalling rate from what they heard during experiment. That is the performance of task was better with earcon applied to voice menu than with voice menu without earcon. In the earcon applied to voice menu test, it showed the accuracy of 92.50%, but in voice menu without earcon test, people could only recall 66.25% among given questions. The response time was reduced from 4.98 sec to 3.85 sec. In addition, this experiment showed the 87.5% of participants preferred the earcon applied to voice menu.

성대구증의 형태에 따른 음향학적 분석 및 치료 결과 (Voice Analysis and Treatment Result According to Configuration of Sulcus Vocalis)

  • 양호철;정병서;김동영;우주현
    • 대한후두음성언어의학회지
    • /
    • 제23권2호
    • /
    • pp.119-123
    • /
    • 2012
  • Background and Objectives : Sulcus vocalis could be classified into type I, type IIa, and type IIb. There have been a little reports about voice quality and treatment results related with types of sulcus vocalis. The authors conducted an analysis of voice and treatment according to different types of sulcus vocalis. Materials and Methods : This study was based on a retrospective chart review. The sulcus types were classified into type I and type II. Objective and subjective voice assessments were analyzed. Patients were treated individually with voice therapy, percutaneous steroid injection, and injection laryngoplasty. Comparison was performed on the voice difference between type I group and type II group, and between pre-treatment and post-treatment of each types. Results : One hundred and one patients were enrolled into this study, and 49 patients were type I and 52 patients were type II. Type I group showed longer mean maximal phonation time (MPT) than type II group, although other voice parameters didn't show any difference between two groups. Even after the management, almost all of the voice parameters didn't show improvement except MPT of type II group. Conclusion:Although the type I sulcus has been known as a non-pathologic lesion, it can result in some degree of voice change and discomfort, and thus need an active management. In this study, voice therapy, percutaneous steroid injection, and injection laryngoplasty showed limited effect to the both types of sulcus vocalis. Further studies for management of sulcus vocalis were needed.

  • PDF

연축성 발성장애 환자의 Lax Vox 음성치료 효과 (Effects of Lax Vox voice therapy in a patient with spasmodic dysphonia: A case report)

  • 임혜진;최성희;김정규;최철희
    • 말소리와 음성과학
    • /
    • 제8권2호
    • /
    • pp.57-63
    • /
    • 2016
  • Recently, the Lax Vox voice therapy has been used as one of the SOVTE(Semi-Occluded Vocal Tracts Exercise). The purpose of this study was to explore the effect of Lax Vox voice therapy for a patient with Spasmodic dysphonia on voice improvement. One female spasmodic dysphonia patient(age=27) who had been diagnosed by a laryngologist received Lax Vox voice therapy. The Lax Vox protocol was configured as 5 steps (1 warm-up and 4 steps : bubbling without / with phonation/ gliding with phonation/ generalization) in this study. A total of 11 sessions were performed by a certified speech language pathologist. The present study evaluated the acoustic, aerodynamic, auditory perceptual, and patient's self-rating between pre-, mid-, and post- voice therapy. All objective and subjective parameters were improved after voice therapy; Reduced frequency variation, increased maximum phonation time, enlarged voice range, improved 'G' and 'S' in GRBAS & USDRS, and reduced VHI were observed. Especially, decreased $f_0$ and remarkably reduced voice tremor were also demonstrated following Lax Vox voice therapy. Accordingly, Lax Vox voice therapy technique can be useful for improving voice and quality of life in patients with spasmodic dysphonia.

소음환경이 정상 및 병적음성에 미치는 영향 (The Effect of Noise on the Normal and Pathological Voice)

  • 홍기환;양윤수;김현기
    • 음성과학
    • /
    • 제9권4호
    • /
    • pp.27-38
    • /
    • 2002
  • The purpose of this article is to present the acoustic parameters (VOT, jitter, shimmer, vF0, vAm, NHR, SPI, VTI, DVB, DSH) for consonants (/pipi/, /$p^{h}ip^{h}i$/, /p'ip'i/) and sustained vowels (/a/, /e/, /i/) produced by normal subjects and dysphonia patients at two vocal effort(normal, high) by Lombard effect using 60dB white noise. Lombard effect indicates the vocal effort increase in noisy situation. At normal vocal effort, in general the acoustic parameter values of patients are greater than normal. And in noisy situation, significant decrease of acoustic values is seen in normal compared with in dysphonia patients. The clinical implication of this finding, the vocal quality in dysphonia is not compensated by vocal effort as well as normal subjects because of the inefficiency caused by abnormal vocal fold appearance and function. And with this result, we can counsel that the voice quality can not be improved as well as the patient expect.

  • PDF

순방향 WCDMA 채널에서 AMR 음성 코덱 모드 할당방식에 대한 성능 비교 (Performance Comparison of AMR Codec Mode Allocations in Downlink WCDMA System)

  • 정성환;홍정완;이상천;이창훈
    • 대한산업공학회지
    • /
    • 제31권4호
    • /
    • pp.349-357
    • /
    • 2005
  • The Adaptive Multi-Rate (AMR) speech codec is the mandatory for voice service in WCDMA systems. The AMR codec can be used efficiently to provide a balanced trade-off between the capacity and quality of voice by adjusting various service rates. In this paper, three ways of AMR mode allocation schemes on the downlink in WCDMA system are evaluated. To evaluate users satisfaction efficiently, new system performance measure and analytic models are proposed. The proposed analytic models can be applied to obtain optimal mode allocation ways while considering the system capacity and quality of voice. In numerical examples, the ways of finding optimal parameters are illustrated for the given traffic loads and the performances of three mode allocation schemes are compared.

식도발성화자 음성의 spectral & cepstral 분석 (Spectral and Cepstral Analyses of Esophageal Speakers)

  • 심희정;장효령;신희백;고도흥
    • 말소리와 음성과학
    • /
    • 제6권2호
    • /
    • pp.47-54
    • /
    • 2014
  • The purpose of this study was to analyze spectral versus cepstral measurements in esophageal speakers. The comparison between the measurements in thirteen male esophageal speakers was compared with the control group of thirteen normal speakers using the sustained vowel /a/. The main results can be summarized as below: (a) the CPP and L/H ratio of the esophageal group were significantly lower than those of the control group (b) the CPP was significantly correlated with the spectral parameters such as jitter, shimmer, NHR and VTI, and (c) the ROC analysis showed that the threshold of 10.25dB for the CPP achieved a good classification for esophageal speakers, with 100% perfect sensitivity and specificity. Thus, it was known that cepstral-based acoustic measures such as CPP, may be more reliable predictors than other spectral-based acoustic measures such as jitter and shimmer. And it was found that cepstral-based acoustic measures were effective in distinguishing esophageal voice quality from normal voice quality. This research will contribute to establishing a baseline related to speech characteristics in voice rehabilitation with laryngectomees.

T1a 병기 성문암의 방사선 치료 후 음성에 관한 연구 (Quantitative Analysis of Voice Quality after Radiation Therapy for Stage T1a Glottic Carcinoma)

  • 이준규;정웅기
    • Radiation Oncology Journal
    • /
    • 제23권1호
    • /
    • pp.17-21
    • /
    • 2005
  • 목적 : 후두암에서 방사선 치료는 음성을 보존할 수 있기 때문에 조기 성문암의 일차적인 치료법으로 사용된다. 이에 T1a 병기 성문암에서 방사선 치료가 환자의 음성에 미치는 영향을 알아보고자 하였다. 대상 및 방법 : 조기 성문암(T1a)으로 진단 받고 방사선 치료를 받은 후 최소 1년이 지난 17명의 남자 환자들을 대상으로 객관적인 음성검사들(음향분석, 공기역학검사, 후두 스트로보스코피)을 이용하여 음성을 평가하였고, 이것을 성별과 연령을 맞춘 정상 대조군과 비교하였다. 음향분석으로는 평균 기본주파수(Fo), jitter, shimmer, 잡음 대 조화음 비율(Noise to Harmonics Ratio)을 측정하였다. 공기역학적 검사로는 최대발성지속시간, 평균호기류율, 음강도, 성문하압, 성문저항, 성문효율, 성문력을 측정하였다. 결과 : 방사선 치료를 받은 환자에서 음향분석의 shimmer만이 통계학적으로 의의 있게 높았다. 그 외 다른 검사나 공기역학검사에서는 두 군 간에 통계학적인 유의성이 없었다. 결론 : 본 연구에서는 단지 shimmer만이 방사선 치료 환자군에서 높았기 때문에 T1a 병기 성문암에서의 방사선치료는 음성의 질에 큰 영향을 미치지 않은 것으로 사료된다.

후두질환 음성의 자동 식별 성능 비교 (Performance Comparison of Automatic Detection of Laryngeal Diseases by Voice)

  • 강현민;김수미;김유신;김형순;조철우;양병곤;왕수건
    • 대한음성학회지:말소리
    • /
    • 제45호
    • /
    • pp.35-45
    • /
    • 2003
  • Laryngeal diseases cause significant changes in the quality of speech production. Automatic detection of laryngeal diseases by voice is attractive because of its nonintrusive nature. In this paper, we apply speech recognition techniques to detection of laryngeal cancer, and investigate which feature parameters and classification methods are appropriate for this purpose. Linear Predictive Cepstral Coefficients (LPCC) and Mel-Frequency Cepstral Coefficients (MFCC) are examined as feature parameters, and parameters reflecting the periodicity of speech and its perturbation are also considered. As for classifier, multilayer perceptron neural networks and Gaussian Mixture Models (GMM) are employed. According to our experiments, higher order LPCC with the periodic information parameters yields the best performance.

  • PDF

음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성 (The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing)

  • 최성희;최철희
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.63-72
    • /
    • 2014
  • The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.

병리적 음성에 대한 언어습득 이후 인공와우이식 성인의 청지각적 변별특성과 중재 프로그램의 효과 (The Effect on Intervention Program and Auditory-Perceptual Discrimination Feature of Postlingual Cochlear Implant Adults about Pathological Voice)

  • 배인호;김근효;이연우;박희준;김진동;이일우;권순복
    • 말소리와 음성과학
    • /
    • 제7권2호
    • /
    • pp.9-17
    • /
    • 2015
  • In the present study, we investigated ability of recognition of auditory perception with regards to the quality of voice in postlingual CI adults and proposed a training program to improve within subject reliability. A prospective case-control study was conducted in adults with 7 postlingual deaf who received a CI surgery and 10 normal hearing controls. The pre and post test and training program included parameters of consensus auditory-perceptual evaluation of voice(CAPE-V) with pathological voice sample by using Alvin. In results of pre-post test for monitoring improvements of internal reliability for listeners via the training program, there was statistically significant difference in both test and group. There was statistically significant difference in internal reliability between pre-post test in the normal hearing group, the result was no significant in the CI group. The present study found that CI adults showed less ability in awareness of voice quality compared to normal hearing group. Also the training program improved pitch and loudness in CI adults.