• 제목/요약/키워드: quality of voice

검색결과 767건 처리시간 0.032초

성대 용종 환자의 후두미세수술 전후 음성 평가에서 OperaVOXTM와 Multi-Dimensional Voice Program 간의 신뢰도 연구 (Reliability of OperaVOXTM against Multi-Dimensional Voice Program to Assess Voice Quality before and after Laryngeal Microsurgery in Patient with Vocal Polyp)

  • 김선우;김소연;조재경;진성민;이상혁
    • 대한후두음성언어의학회지
    • /
    • 제31권2호
    • /
    • pp.71-77
    • /
    • 2020
  • Background and Objectives OperaVOXTM (Oxford Wave Research Ltd.) is a portable voice analysis software package designed for use with iOS devices. As a relatively cheap, portable and easily accessible form of acoustic analysis, OperaVOXTM may be more clinically useful than laboratory-based software in many situations. The aim of this study was to evaluate the agreement between OperaVOXTM and Multi-Dimensional Voice Program (MDVP; Computerized Speech Lab) to assess voice quality before and after laryngeal microsurgery in patient with vocal polyp. Materials and Method Twenty patients who had undergone laryngeal microsurgery for vocal polyp were enrolled in this study. Preoperative and postoperative voices were assessed by acoustic analysis using MDVP and OperaVOXTM. A five-seconds recording of vowel /a/ was used to measure fundamental frequency (F0), jitter, shimmer and noise-to-harmonic ratio (NHR). Results Several acoustic parameters of MDVP and OperaVOXTM related to short-term variability showed significant improvement. While pre-operative value of F0, jitter, shimmer, NHR was 155.75 Hz (male: 125.37 Hz, female: 183.37 Hz), 2.20%, 6.28%, 0.16, post-operative values of these parameter was 164.34 Hz (male: 129.42 Hz, female: 199.26 Hz), 2.15%, 5.18%, 0.14 Hz in MDVP. While pre-operative value of F0, jitter, shimmer, NHR was 168.26 Hz (male: 135.16 Hz, female: 201.37 Hz), 2.27%, 6.95%, 0.26, post-operative values of these parameters was 162.72 Hz (male: 128.267 Hz, female: 197.18 Hz), 1.71%, 5.36%, 0.20 in OperaVOXTM. There was high intersoftware agreement for F0, jitter, shimmer with intraclass correlation coefficient. Conclusion Our results showed that the short-term variability of acoustic parameters in both MDVP and OperaVOXTM were useful for the objective assessment of voice quality in patients who received laryngeal microsurgery. OperaVOXTM is comparable to MDVP and has high intersoftware reliability with MDVP in measuring the F0, jitter, and shimmer

응답형 음성제어 전동 휠체어(INMEL-1)의 설계 (Design of the Motorized Wheel Chair(INMEL-1) Controlled by Response Type Voices)

  • 정동명;홍승홍
    • 대한의용생체공학회:의공학회지
    • /
    • 제8권2호
    • /
    • pp.231-240
    • /
    • 1987
  • This Paper introduces a new design of motorized wheel chair for the disabled, which is intended to improve the quality of the disabled's indoor life. This vehicle was based on high manoeuvrability of the omnidirectional drive and saftey. Usually, the vehicle controlled by a joystick but also the voice control system to be prepared for the severely disabled. This voice control system responds to the result of voice recognition, state of system or warning of dangers with voices, which has real time response and 95.3% recognition ratio and satisfactory synthesis voice Quality Therefore this system is able to provide independency in driving and the disabled's daily life.

  • PDF

LF 모델에 고조파 성분을 보상한 음원 모델링 (Voice Source Modeling Using Harmonic Compensated LF Model)

  • 이건웅;김태우홍재근
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 1998년도 추계종합학술대회 논문집
    • /
    • pp.1247-1250
    • /
    • 1998
  • In speech synthesis, LF model is widely used for excitation signal for voice source coding system. But LF model does not represent the harmonic frequencies of excitation signal. We propose an effective method which use sinusoidal functions for representing the harmonics of voice source signal. The proposed method could achieve more exact voice source waveform and better synthesized speech quality than LF model.

  • PDF

노인성 음성장애의 음성치료 효과 (The Effects of Voice Therapy in Age-related Dysphonia)

  • 김성태
    • 말소리와 음성과학
    • /
    • 제2권2호
    • /
    • pp.117-121
    • /
    • 2010
  • The This study aimed to evaluate the effects of the voice therapy we operated to the patients with age-related dysphonia. Thirty four participants who were diagnosed as age-related dysphonia in laryngoscopic finding from January, 2009 to December, 2009 completed the study. The participants were aged from 60 to 82 years old with a mean age of 70.6. All participants had received the abdominal breath technique, SKHPIP with laughter, and basic vocal training with description of their problem, the length of which ranged from four sessions to twelve sessions. We executed the videostroboscopy to compare the aspect of voicing change and the perceptual assessment, voice range profile, acoustic and aerodynamic measures to identify change of voice. Participants had glottal gap due to incomplete glottic closure during voicing on the pretest. After they took the voice therapy, the glottic gap became narrow and rough and breathy voice was reduced. There were significant difference in acoustic and aerodynamic measures. Jitter, Shimmer, MFR were reduced and MPT, Psub were increased(p<.05). Participants' pitch range and intensity range were increased on the posttest performance after taking voice therapy. Especially, most of them were showed that pitch range was increased significantly in high frequency area. The results of this investigation indicate that the voice therapy using abdominal breath, SKHPIP, and exercise together is effective for the patients who have age-related dysphonia to improve their voice quality. We recommend to apply this technique to functional voice disorders who are showed glottal gap.

  • PDF

13kbps QCELP에서 8kbps QCELP로의 음성 패킷 변환 기술 (Voice Packet Conversion from 13kbps QCELP to 8kbps QCELP Speech Codecs)

  • 박호종;권상철
    • 한국음향학회지
    • /
    • 제18권6호
    • /
    • pp.71-76
    • /
    • 1999
  • 디지털 이동 통신 시스템에서 서로 다른 음성 압축기를 사용하는 단말기 사이의 통신은 음성 신호를 두 번의 압축/복원 과정을 거쳐 전달하므로 음질 저하, 계산량 증가, 전달 지연 증가 등의 문제를 발생시킨다. 본 논문에서는 이와 같은 단말기 사이의 통신에서의 문제점을 해결하기 위하여 음성 패킷 변환 방법을 제안하고, 13kbps QCELP 패킷을 8kbps QCELP 패킷으로 변환하는 방법을 개발한다. 여러 음성 신호를 이용한 모의 실험 결과, 본 논문에서 개발된 패킷 변환기가 짧은 음성전달 지연과 약 33%의 계산량으로 일반적인 이중 압축 방법과 동등한 음질의 음성 신호를 합성하는 것을 확인하였다.

  • PDF

Multiple Average Ratings of Auditory Perceptual Analysis for Dysphonia

  • Choi, Seong-Hee;Choi, Hong-Shik
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.165-170
    • /
    • 2009
  • This study was to investigate for comparison between single rating and average ratings from multiple presentations of the same stimulus for measuring the voice quality of dysphonia using 7-point equal-appearing interval (EAI) rating scale. Overall severity of voice quality for 46 /a/ vowel stimuli (23 stimuli from dysphonia, 23 stimuli from control) was rated by 3 experienced speech-language pathologists (averaged 19 years; range = 7 to 40 years). For average ratings, each stimulus was rated five times in random order and averaged from two to five times. Although higher inter-rater reliability was found in average ratings than in single rating, there were no significant differences in rating scores between single and multiple average ratings judged by experienced listeners, suggesting that auditory perceptual ratings judged by well-trained listeners have relatively good agreement with the same stimulus across the judgment. Larger variations in perceptual ratings were observed for moderate voices than for mild or severe voices, even in the average ratings.

  • PDF

갑상선암 수술과 수술 전후 음성관리 (Perioperative Management of the Voice in Thyroid Cancer)

  • 윤소연;홍현준
    • 대한후두음성언어의학회지
    • /
    • 제31권2호
    • /
    • pp.49-55
    • /
    • 2020
  • Evaluating the patient's voice before thyroidectomy is useful for the purpose of identifying patients with vocal cord paralysis without symptoms, identifying other patient's voice abnormalities, and whether it is related to voice disorders that may occur after surgery. Also voice evaluation after thyroid surgery is helpful in diagnosis, treatment, and rehabilitation and follow-up of voice disorders that occur without clear nerve damage after thyroidectomy. And it is helpful for rapid recovery through active early rehabilitation treatment for patients who complain of speech impairment without paralysis. In particular, neck exercise can improve the adhesion of the surgical site and increase the range of motion of the neck as well as improve subjective neck discomfort. In addition, hearing, voice and breathing functions should be improved, and voice hygiene education and counseling should be provided. Vocal cord injection is the first treatment option for unilateral vocal cord palsy. By establishing a protocol for voice disorders before and after thyroid surgery and providing appropriate treatment, the quality of life of patients can be improved.

양성후두 질환의 지속모음을 대상으로 한 기존 피치 추정 방법들의 성능 비교 분석 (Comparative Analysis of Performance of Established Pitch Estimation Methods in Sustained Vowel of Benign Vocal Fold Lesions)

  • 장승진;김효민;최성희;박영철;최홍식;윤영로
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.179-200
    • /
    • 2007
  • In voice pathology, various measurements calculated from pitch values are proposed to show voice quality. However, those measurements frequently seem to be inaccurate and unreliable because they are based on some wrong pitch values determined from pathological voice data. In order to solve the problem, we compared several pitch estimation methods to propose a better one in pathological voices. From the database of 99 pathological voice and 30 normal voice data, errors derived from pitch estimation were analyzed and compared between pathological and normal voice data or among the vowels produced by patients with benign vocal fold lesions. Results showed that gross pitch errors were observed in the cases of pathological voice data. From the types of pathological voices classified by the degree of aperiodicity in the speech signals, we found that pitch errors were closely related to the number of aperiodic segments. Also, the autocorrelation approach was found to be the most robust pitch estimation in the pathological voice data. It is desirable to conduct further research on the more severely pathological voice data in order to reduce pitch estimation errors.

  • PDF

순방향 WCDMA 채널에서 AMR 음성 코덱 모드 할당방식에 대한 성능 비교 (Performance Comparison of AMR Codec Mode Allocations in Downlink WCDMA System)

  • 정성환;홍정완;이상천;이창훈
    • 대한산업공학회지
    • /
    • 제31권4호
    • /
    • pp.349-357
    • /
    • 2005
  • The Adaptive Multi-Rate (AMR) speech codec is the mandatory for voice service in WCDMA systems. The AMR codec can be used efficiently to provide a balanced trade-off between the capacity and quality of voice by adjusting various service rates. In this paper, three ways of AMR mode allocation schemes on the downlink in WCDMA system are evaluated. To evaluate users satisfaction efficiently, new system performance measure and analytic models are proposed. The proposed analytic models can be applied to obtain optimal mode allocation ways while considering the system capacity and quality of voice. In numerical examples, the ways of finding optimal parameters are illustrated for the given traffic loads and the performances of three mode allocation schemes are compared.

음성인식프로그램을 이용한 무후두 음성의 말 명료도와 병적 음성의 수술 전후 개선도 측정 (Speech Intelligibility of Alaryngeal Voices and Pre/Post Operative Evaluation of Voice Quality using the Speech Recognition Program(HUVOIS))

  • 김한수;최성희;김재인;임재열;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.92-97
    • /
    • 2004
  • Background and Objectives : The purpose of this study was to examine objectively pre and post operative voice quality evaluation and intelligibility of alaryngeal voice using speech recognition program, HUVOIS. Materials and Methods : 2 laryngologists and 1 speech pathologist were evaluated 'G', 'R', 'B' in the GRBAS sclae and speech intelligibility using NTID rating scale from standard paragraph. And also acoustic estimates such as jitter, shimmer, HNR were obtained from Lx Speech Studio. Results : Speech recognition rate was not significantly different between pre and post operation for pathological vocie samples though voice quality(G, B) and acoustic values(Jitter, HNR) were significantly improved after post operation. In Alaryngeal voices, reed type electrolarynx 'Moksori' was the highest both speech intelligibility and speech recognition rate, whereas esophageal speech was the lowest. Coefficient correlation of speech intelligibility and speech recognition rate was found in alaryngeal voices, but not in pathological voices. Conclusion : Current study was not proved speech recognition program, HUVOIS during telephone program was not objective and efficient method for assisting subjective GRBAS scale.

  • PDF