• Title/Summary/Keyword: Voice evaluation

Search Result 357, Processing Time 0.028 seconds

Acoustic Features of Oral Vowels in the Esophagus Speakers (식도음성의 모음종류에 따른 음향학적 특성)

  • Yun, Eunmi;Mok, Eunhee;Minh, Phan huu Ngoc;Hong, Kihwan
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.85-92
    • /
    • 2015
  • This study aimed to establish characteristics related to voice and speech through the natural base frequency analysis of esophagus vocalization. In the study, 8 subjects were selected for esophagus vocals, and 10 other subjects were selected for a control group. MDVP(Multi-dimensional Voice Program, Model 4800, USA, 2001), Multi Speech(Model 3700, Kaypantax, USA, 2008) were used as experiment equipment. The speech samples selected for evaluation were vowels and sentences (both declarative and interrogative). For acoustic analysis, the intonation form of fo, jitter, energy, shimmer, HNR, and intonation patterns of the speech sample were measured. The results were as follows: First, the natural intrinsic frequency of extended vowels in the esophagus vocal group was lower than the frequency in the normal vocal group. In particular, the intrinsic frequency difference for high vowel /i/ was much greater than the frequency difference for low vowel /a/. Second, the jitter values of the esophagus vocal group were higher than the control group. In particular, there was a large difference between the jitter values for /a/ and /i/, with the jitter values being highest for /i/. Third, there was no significant difference in vocal strength between the esophagus vocal patient group and the control group. Fourth, the shimmer values of the voices in the esophagus vocal group were higher than shimmer values in the control group. In particular, there was a large difference in shimmer values for low vowel /a/. Fifth, the HNR values of the esophagus vocal group were showed significantly lower than the control group. In particular, the largest difference in HNR values between the two groups was for high vowel /i/. Sixth, the pitch contours of interrogative and declarative sentences of the esophagus vocal patient group showed a different form or only had with small differences compared to the pitch contours of the normal vocal group, thus presenting an inconsistent pattern.

Management of Vocal Cord Palsy during Thyroid Surgery (갑상선 수술 시의 성대마비의 처치)

  • Choi Hong-Shik;Kim Se-Heon;Park Kuk-Jin;Kim Kwang-Moon;Hong Won-Pyo
    • Korean Journal of Head & Neck Oncology
    • /
    • v.14 no.1
    • /
    • pp.27-34
    • /
    • 1998
  • Objectives, Materials & Methods: To prevent deterioration of postoperative voice due to iatrogenic transection of the recurrent laryngeal nerve during the thyroid surgery, intraoperative medialization of the membranous vocal cord by type I thyroplasty together with direct epineurial neurorraphy was done on 2 cases of benign thyroid lesion. To improve the quality of voice together with complete removal of advanced thyroid carcinoma, intraoperative vocal cord medialization on the lesion side together with total thyroidectomy was done by type I thyroplasty in 2 cases and combined procedure by arytenoid adduction and type I thyroplasty in another 2 cases. Results: The resultant voice of the iatrogenic injury cases was relatively tolerable. The voice of the combined procedure was better than that of type I thyroplasty cases for the intraoperative rehabilitation cases. Not only for the preoperative evaluation of the severity of the nerve lesion but also the prognosis will be expected by use of laryngeal EMG in the cases of thyroid cacer with vocal cord palsy. Conclusion: Intraoperative simultaneous rehabilitation for the vocal cord palsy during thyroid surgery is beneficial for the patients.

  • PDF

Analysis of Pre and Post-Operative Speech In Combined Operation of Type I Thyroplasty and Arytenoid Adduction for Unilateral Vocal Cord Palsy (편측성대마비에 대한 제 1형 갑상성형술과 피열연골내전술의 동시수술시 술전 및 술후 음성언어분석비교)

  • 최홍식;정유삼;김성국;김영호;김광문
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.9 no.1
    • /
    • pp.66-70
    • /
    • 1998
  • Background and Objectives : The managements of unilateral vocal cord palsy include type Ⅰ thyroplasty and arytenoid adduction. One type operation has been shown no satisfactory effect. We evaluated preoperative and postoperative speech of unilateral vocal cord palsy patients who received combined operation of type Ⅰ thyroplasty and arytenoid adduction to help for the management plan of unilateral vocal cord palsy patients. Materials and Methods : We reviewed the postoperative results and complication of 17 surgically treated patients of unilateral vocal cord palsy at Severance hospital from Nov. 1996 to Dec. 1997 retrospectively. They were received combined operation of type Ⅰ thyroplasty and arytenoid adduction. Their pre and post-operative speech were analyzed with MDVP(Multi-Dimension-Voice analysis Program) of CSL(Computerized Speech Lab). Results : After the operation, MPT(Maximal Phonation Time) was increased and MFR(Mean Flow Rate) was decreased in all patients. NHR(Noise to Harmonic Ratio) and VTI(Voice Turbulence Index) were decreased : liner, RAP(Relative Average Perturbation Quotient), PPQ(Pitch Period Perturbation Quotient), sPPQ(smoothed Pitch Period Perturbation Quotient), vFo(fundamental frequency Variation) were decreased : Shimmer, APQ(Amplitude Perturbation Quotient), sAPQ(Smoothed Amplitude Perturbation Qoutient), vAm(Peak Amplitude Variation) were decreased in all the patients. Conclusions : In unilateral vocal cord pals), combined operation of type Ⅰ thyroplasty and arytenoid adduction could obtain satisfactory postoperative voice. MDVP has many parameters and good method for evaluation of voice surgery.

  • PDF

A Seamless Voice Call Handover Scheme for the 3G LTE System (3G LTE 시스템을 위한 끊김없는 음성 호 핸드오버 방법)

  • Kim, Kyung-Min;Jung, Hyun-Duk;Lee, Jai-Yong
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.2A
    • /
    • pp.174-185
    • /
    • 2010
  • A seamless handover between the 3G LTE and legacy 3G system is required for the smooth deployment of the 3G LTE system which is the next generation cellular network system. Especially on voice call handover, the service interruption time is very sensitive for user's satisfaction and therefore, a seamless voice call handover scheme is necessarily required. However, handover between the 3G LTE and the 3G CS system is hard to be achieved due to the lack of interface between two systems and the restriction of radio resource. In this paper, a new network entity called SCSE is proposed and inter-working between the 3G LTE and the 3G CS systems is enabled. Also contributed to the feature of the SCSE, the handover procedure is simplified and the service interruption time is minimized as a consequence. The evaluation result shows that the proposed SCSE scheme exclusively meets the service interruption time requirement which is smaller than 300 ms.

Design of FIR filter using direct memory access for voice signal processing module in implantable middle ear hearing devices (이식형 인공중이용 음성신호 처리 모듈을 위한 직접 메모리 억세스 기반의 FIR 필터 설계)

  • Kim, Jong-Min;Park, Il-Yong;Yoon, Young-Ho;Kim, Min-Kyu;Lim, Hyung-Gyu;Han, Ji-Hun;Kim, Myoung-Nam;Cho, Jin-Ho
    • Journal of Sensor Science and Technology
    • /
    • v.15 no.4
    • /
    • pp.223-230
    • /
    • 2006
  • An FIR filter for digital voice signal processing has been designed and implemented using a microcontroller in implantable middle ear hearing devices (IMEHDs). The designed digital voice signal processing filter which has fast and accurate filtering operation and controllable filter characteristics has been implemented using a hardware multiplier and a direct memory access (DMA) in the low power microcontroller, MSP430F169. It has been confirmed that each of the implemented 6-orders Remez FIR filters with 1 channel and 2 channels can be applied to the voice signal processing module of IMEHDs based on the evaluation results of the filtering performance experiment.

A Study on the Performance Evaluation of a Voice Coil Actuator for Electro-Discharge Micro-Drilling Machine (보이스코일 액츄에이터로 이송되는 미세구멍 가공용 방전 가공기의 작동특성 연구)

  • Yang, Seung-Jin;Baek, Hyeong-Chang;Kim, Byeong-Hui;Jang, In-Bae
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.18 no.12
    • /
    • pp.152-158
    • /
    • 2001
  • In this paper, we have developed an electro discharge machine for micro drilling driven by a voice coil actuator. Because the voltage signal of the electro-discharging circuit shows a lot of peaks and valleys, the active type low-pass filtering technique is adopted to get the average of the signal. Since the motion of the voice coil is precisely controlled by the error value between the object voltage value and the measured one, it is possible to prevent the mechanical contact between the rotating electrode and the workpiece and to maintain the appropriate machining conditions during the process. The electro-chemical machining technology was also adopted to make small diameter electrodes. Pure water is used as a dielectric. The machining procedure is performed to verify the feasibility of the developed system. It takes about 10 seconds to drill the ${\phi}m$100${\mu}m$ hole to the 100${\mu}m$ thickness stainless steel plate. The machining time depends on the values of the resister and the capacitor. There may exist the optimal values of time constant and the tendency is displayed In the appendix.

  • PDF

Application of Machine Learning on Voice Signals to Classify Body Mass Index - Based on Korean Adults in the Korean Medicine Data Center (머신러닝 기반 음성분석을 통한 체질량지수 분류 예측 - 한국 성인을 중심으로)

  • Kim, Junho;Park, Ki-Hyun;Kim, Ho-Seok;Lee, Siwoo;Kim, Sang-Hyuk
    • Journal of Sasang Constitutional Medicine
    • /
    • v.33 no.4
    • /
    • pp.1-9
    • /
    • 2021
  • Objectives The purpose of this study was to check whether the classification of the individual's Body Mass Index (BMI) could be predicted by analyzing the voice data constructed at the Korean medicine data center (KDC) using machine learning. Methods In this study, we proposed a convolutional neural network (CNN)-based BMI classification model. The subjects of this study were Korean adults who had completed voice recording and BMI measurement in 2006-2015 among the data established at the Korean Medicine Data Center. Among them, 2,825 data were used for training to build the model, and 566 data were used to assess the performance of the model. As an input feature of CNN, Mel-frequency cepstral coefficient (MFCC) extracted from vowel utterances was used. A model was constructed to predict a total of four groups according to gender and BMI criteria: overweight male, normal male, overweight female, and normal female. Results & Conclusions Performance evaluation was conducted using F1-score and Accuracy. As a result of the prediction for four groups, The average accuracy was 0.6016, and the average F1-score was 0.5922. Although it showed good performance in gender discrimination, it is judged that performance improvement through follow-up studies is necessary for distinguishing BMI within gender. As research on deep learning is active, performance improvement is expected through future research.

Usability Evaluation of Artificial Intelligence Search Services Using the Naver App (인공지능 검색 서비스 활용에 따른 서비스 사용성 평가: 네이버 앱을 중심으로)

  • Hwang, Shin Hee;Ju, Da Young
    • Science of Emotion and Sensibility
    • /
    • v.22 no.2
    • /
    • pp.49-58
    • /
    • 2019
  • In the era of the 4th Industrial Revolution, artificial intelligence (AI) has become one of the core technologies in terms of the business strategy among information technology companies. Both international and domestic major portal companies are launching AI search services. These AI search services utilize voice, images, and other unstructured data to provide different experiences from existing text-based search services. An unfamiliar experience is a factor that can hinder the usability of the service. Therefore, the usability testing of the AI search services is necessary. This study examines the usability of the AI search service on the Naver App 8.9.3 beta version by comparing it with the search services of the current Naver App and targets 30 people in their 20s and 30s, who have experience using Naver apps. The usability of Smart Lens, Smart Voice, Smart Around, and AiRS, which are the Naver App beta versions of their artificial intelligence search service, is evaluated and statistically significant usability changes are revealed. Smart Lens, Smart Voice, and Smart Around exhibited positive changes, whereas AiRS exhibited negative changes in terms of usability. This study evaluates the change in usability according to the application of the artificial intelligence search services and investigates the correlation between the evaluation factors. The obtained data are expected to be useful for the usability evaluation of services that use AI.

Study on Assessment and Treatment Patterns of Speech-Language Pathologists in Pediatric Vocal Problem Through Multicenter Survey (다기관 설문조사를 통한 국내 소아 음성질환 환자의 검사 및 치료 유형 연구)

  • Lee, Jong-Geun;Bang, Seung-Hwan;Jeon, Jae-Min;Lee, Jung-Kyu;Kim, Angela Yun;Woo, Jeong-Soo;Cho, Jae-Gu
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.30 no.1
    • /
    • pp.39-47
    • /
    • 2019
  • Background and Objectives : Pediatric vocal health problems are relatively common. However, it is not yet well studied whether uniform diagnosis and treatment is done properly in South Korea. The purpose of this study was to investigate the methods that the Korean speech therapists use to diagnose and treat pediatric voice problem. Materials and Method : An anonymous online questionnaire was administered to 32 speech language therapists registered at the Korean laryngeal speech linguistics society detailing demographics, employment institution, general management of pediatric patients with vocal problem including assessment and treatment procedures. Results : Current practice patterns were analyzed on 32 speech language therapists providing services in South Korea mostly working at tertiary university hospital. One third of pediatric patients were assessed without proceeding to treatment. One fifth of patients were treated without assessment. Perceptual assessment was the main pretreatment assessment methods used. Treatment was done in the following order : Voice rest, SOVT, yawn-sigh and resonant voice. Post-treatment evaluation was used in the following order : Instrumental assessment, clinical judgment, and recording comparison. Conclusion : Speech language therapists practice in South Korea mostly follows the ASHA practice guidelines. However, there are still great amount of cases in which only the evaluation was done without appropriate treatment. Further research is needed to make SPLs more systematic and efficient for evaluating and treating pediatric vocal patients.

The Perceptual Evaluation and Aerodynamic Analysis of Spasmodic Dysphonia (연축성발성장애의 청지각적 평가 및 공기역학적 특성)

  • Park, Sun-Young;Kim, Jae-Ock;Lim, Sung-Eun;Nam, Do-Hyun;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.19 no.1
    • /
    • pp.38-42
    • /
    • 2008
  • Background and Objectives : This study was performed to investigate the perceptual and aerodynamic characteristics and the relation between vocal efficiency and the severity of strained voice. of adductor spasmodic dysphonia. Materials and Methods : 13 female patients with adductor spasmodic dysphonia were examined and compared with 10 normal female control group. MPT, MFR, Psub, Sound Intensity, VE(vocal efficiency) were obtained using PAS(Phonatory Aerodynamic System). GRBA(S) scale was used for Perceptual evaluation. Results : Psub(subglottic pressure) of SD was significantly higher than normal group. MPT, MFR, Sound Intensity, VE were not significantly different between two groups. Correlation between VE and 'S'(strained) was not significant. Conclusion : The results of this study show that certain aerodynamic parameters(Psub) distinguish adductor spasmodic dysphonia from normal voice.

  • PDF