• Title/Summary/Keyword: Voice evaluation

Search Result 358, Processing Time 0.032 seconds

Implementation of Embedded Speech Recognition System for Supporting Voice Commander to Control an Audio and a Video on Telematics Terminals (텔레메틱스 단말기 내의 오디오/비디오 명령처리를 위한 임베디드용 음성인식 시스템의 구현)

  • Kwon, Oh-Il;Lee, Heung-Kyu
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.42 no.11
    • /
    • pp.93-100
    • /
    • 2005
  • In this paper, we implement the embedded speech recognition system to support various application services such as audio and video control using speech recognition interface on cars. The embedded speech recognition system is implemented and ported in a DSP board. Because MIC type and speech codecs affect the accuracy of speech recognition. And also, we optimize the simulation and test environment to effectively remove the real noises on a car. We applied a noise suppression and feature compensation algorithm to increase an accuracy of sppech recognition on a car. And we used a context dependent tied-mixture acoustic modeling. The performance evaluation showed high accuracy of proposed system in office environment and even real car environment.

A Study about the Medical Communication Proficiency of Korean Traditional Medical Students Using Standardized Patients with Hwa-Byoung (표준화 화병환자를 활용한 한의대생의 진료 및 의사소통 수준연구)

  • Kim, Kyeong-Ok;Kim, Hee-Kyung;An, Hyo-Ja;Shin, Heon-Tae
    • Journal of Society of Preventive Korean Medicine
    • /
    • v.17 no.1
    • /
    • pp.163-179
    • /
    • 2013
  • Objectives : After analyzing the proficiency of medical communication of the students in College of Korean Traditional Medicine using standardized patients, we suggests ways to improve clinical practice in the future class and medical communication curriculum development. Methods : 20 students before clinical practice class (3rd grade) and 20 students after 1 year clinical practice class (4th grade) participated and did their medical interview on Standardized patient. They were evaluated on patient-physician communication skills by standardized patients and professor evaluator. In addition to be evaluated on patient-physician relationship, medical interview skills by professor evaluator. Results : As follows in the evaluation of clinical practice with standardized patients 1. More than half of the participated students regardless of their grade received poor score in their medical communication evaluated by SP(Standardized patient) and PE(Professor evaluator). 2. Greeting, History taking parts were higher in the 4th students who received 1 year clinical practice class, but verbal-nonverbal response, voice tone parts were higher in the 3rd students who do not received clinical practice lesson. 3. Pronunciation&Voice tone parts were higher in the male students but, gathering information part was higher in the female students. Conclusions : We think that the current clinical practice lessons are insufficient as a way to learn and improve medical knowledge and medical communication skills, and it is necessary a new form of clinical practice class. Participatory lesson using standardized patient could be a good alternative of that in the future class.

Comparison of Pre and Post-operational Phonatory Aerodynamic Parameters in Vocal Polyp and Vocal Cord Palsy Patients (성대마비 및 성대용종 환자의 수술 전과 후의 공기역학적 변수 비교)

  • Lee, Dahye;Kim, Jaeock;Oh, JaeKoon;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.112-116
    • /
    • 2015
  • Background and Objectives : Aerodynamic analysis is an examination which provides information regarding various vocalization measures indicating laryngeal efficiency. Voice evaluation using such examination must be capable of distinguishing between normal to abnormal voice. It also observes variables on aerodynamic characteristics by gender in regards to patients of vocal disorders, especially of vocal cord paralysis and vocal polyp, and compares the conditions before and after surgery. This paper therefore, seeks to build a framework for establishing standard levels of aerodynamical characteristic on vocal disorders. Subjects and Methods : The study was intended for a total number of 20 patients with vocal polyp or unilateral vocal cord paralysis. Those with the vocal polyp underwent laryngomycroscopy surgery and the vocal cord paralysis, vocal fold injection using Restylane. Aerodynamic analysis fulfilled the Maximum sustained Phonation (MXPH) and Voicing Efficiency (VOEF) by using PAS Model 6600 (KayPENTAX, USA). Results : In MXPH, increase in PHOT were evident with vocal polyp after surgery. As for patients with vocal cord paralysis, MAXDB, MEADB, DHODB, PHOT all have increased and MEAP, PEF, MEAF decreased after surgery. In VOEF, patients with vocal cord paralysis who underwent surgery showed increase in MAXDB, MEADB, DHODB, FET100, ARES, but decreases in PEF, TARF. Conclusion : Overall, it can be concluded that patients with the vocal polyp and vocal cord paralysis seemed to get closer to the normal values after than before surgery in majority of measures. This confirms that the function of their vocal cord has improved nearly to normality through operations.

  • PDF

Auditory-Perceptual and Acoustic Assessment in Measuring Dysphonia Severity of Vocal Fold Nodules (성대결절 환자의 음성장애에 대한 청지각적 및 음향학적 평가)

  • Kim, Geun-Hyo;Kwon, Soon-Bok
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.1
    • /
    • pp.108-116
    • /
    • 2018
  • The purpose of this study was to investigate the relationship between the differences in the acoustic measurements (AVQI) and the auditory-perceptual assessments (GRBAS, CAPE-V) of the normal and vocal fold nodules. For this purpose, Total 335 voice samples were analyzed acoustically and three raters performed auditory-perceptual assessments. in the results, AVQI, G, and OS scores of the normal group were lower than those of the vocal fold nodules group. The correlations between the G scale and the OS scale were highly correlated, and the correlation between the AVQI, and auditory-perceptual results (G and OS) was also high value. The threshold values for discriminating AVQI, G, and OS between the two groups were ${\leq}4.06$, ${\leq}1$, and ${\leq}26$, respectively, and the predictive diagnostic power was 0.840, 0.860, and 0.848. In conclusion, AVQI and auditory-perceptual evaluation can improve potentiality the screening of vocal fold nodules and help to determine the diagnosis and treatment plan of voice disorders.

Usefullness of the Vibration Pick-Up in Detection of Pitch for Synchronization of Laryngeal Stroboscopy (후두 스트로보스코프 검사의 신호 동기화를 위한 진동 검출기의 유용성)

  • Lee, Jin-Choon;Lee, Byung-Joo;Wang, Soo-Geun;Roh, Jung-Hoon;Kwon, Sun-Bok;Jo, Cheol-Woo
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.18 no.1
    • /
    • pp.26-32
    • /
    • 2007
  • Objective and Background: Laryngeal stroboscope is an useful equipment in evaluation of vocal cord vibration and in early detection of mucosal lesion including invasive cancer of the vocal cord. Recently Lee et al. (2006) developed portable stroboscope using voice as synchronization signal. It has been frequently impaired ability to synchronize the flashes even in normal female. Authors tried to investigate various methods including vibration pick-up, microphone, laryngeal microphone, and contact microphone for development of simple and accurate method like electroglottograph signal. The purpose of this study was to estimate wheher the vibration pick-up is available and is consistent with the signal of EGG. Subjects and Methods: Authors compared the signals between EGG and noncontact method such as voice, contact methods including vibration pick-up, laryngeal microphone, and contact microphone in normal twenty adults (male 10 and female 10). The number of peak in one cycle was compared with the number of the peak in EGG, and the percent of phase difference in the peak was compared with EGG Also, authors tried to investigate which site of vibration pick-up was most effective for synchronization of stobo flashes. Three site including anterior neck below the cricoid cartilage, thyroid ala, and suprahyoid region were analysed. Results: Among various methods for synchronization of strobo flashes, vibration pick-up was most effective method in peak detection. And anterior neck below cricoid cartilage was the most available site of the vibration pick-up. Conclusion: Authors suggest that vibration pick-up is most available and effective method for synchronization of strobo flashes.

  • PDF

A Case Report of Korean Medical Treatment on Parkinsonism Patient Complaining of Motor Disorder and Aphonia (한방치료로 운동 기능장애와 실성증이 호전된 파킨슨증후군 환자 치험 1례)

  • Hye-Min, Heo;Kyeong-Hwa, Lee;Ye-Chae, Hwang;Gyu-Ri, Jeon;Seung-Yeon, Cho;Seong-Uk, Park;Jung-Mi, Park;Chang-Nam, Ko
    • The Journal of the Society of Stroke on Korean Medicine
    • /
    • v.23 no.1
    • /
    • pp.13-24
    • /
    • 2022
  • ■Objectives This case study is to report the effectiveness of Korean medicine in Parkinsonism patient's treatment. ■Methods We used the acupuncture, electro-acupuncture, moxibustion, cupping therapy, herbal medicine, especially Palmulgunja-tang to the Parkinsonism patient with motor disorder such as Postural Instability and Gait Difficulty(PIGD) and aphonia. Unified Parkinson's Disease Rating Scale(UPDRS), analysis of gait pattern, voice dB and self-evaluation of speed and volume were used to assess the change of symptoms. ■Results ‌After treatment, the UPDRS score decreased in overall category and the walking pattern has improved. In addition, the improvement was observed in voice volume and in self assessment of the patient. ■Conclusion This case suggests the effect of Korean medical treatment on motor disorder and aphonia in Parkinsonism.

A Bypass Scheme for INVITE Messages With Priority in SIP Proxies (SIP 프록시에서 우선순위를 가지는 INVITE 메시지의 우회 방법)

  • Kwon, Oh-Jun;Jang, Hee-Suk;Lee, Jong-Min
    • Journal of the Korea Society for Simulation
    • /
    • v.19 no.4
    • /
    • pp.51-58
    • /
    • 2010
  • SIP is a flexible and extensible call setup protocol that may be combined with other protocols used in the Internet to make various services like voice communication. Voice communication can be classified into normal calls used for communication between common users and emergency calls for 112, 119 and other services through public safety networks. It is required to research to process effectively these normal calls and emergency calls through public networks such as the Internet. In this paper, we propose a bypass scheme for emergency calls by giving priority to INVITE messages for them and processing them with priority in the SIP proxy queue. We perform simulation studies using the network simulator ns-2 for the performance evaluation. Simulation results show that the proposed scheme processes emergency calls faster than normal calls and thus it is expected to make a special purpose network like the national disaster network efficiently by using the existing Internet.

A Basic Performance Evaluation of the Speech Recognition APP of Standard Language and Dialect using Google, Naver, and Daum KAKAO APIs (구글, 네이버, 다음 카카오 API 활용앱의 표준어 및 방언 음성인식 기초 성능평가)

  • Roh, Hee-Kyung;Lee, Kang-Hee
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.7 no.12
    • /
    • pp.819-829
    • /
    • 2017
  • In this paper, we describe the current state of speech recognition technology and identify the basic speech recognition technology and algorithms first, and then explain the code flow of API necessary for speech recognition technology. We use the application programming interface (API) of Google, Naver, and Daum KaKao, which have the most famous search engine among the speech recognition APIs, to create a voice recognition app in the Android studio tool. Then, we perform a speech recognition experiment on people's standard words and dialects according to gender, age, and region, and then organize the recognition rates into a table. Experiments were conducted on the Gyeongsang-do, Chungcheong-do, and Jeolla-do provinces where the degree of tongues was severe. And Comparative experiments were also conducted on standardized dialects. Based on the resultant sentences, the accuracy of the sentence is checked based on spacing of words, final consonant, postposition, and words and the number of each error is represented by a number. As a result, we aim to introduce the advantages of each API according to the speech recognition rate, and to establish a basic framework for the most efficient use.

A Nonlinear Regression Analysis Method for Frame Erasure Concealment in VoIP Networks (VoIP 망에서의 프레임손실은닉을 위한 비선형 회귀분석 기법)

  • Choi, Seung-Ho;Sung, Ho-Sang
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.9 no.5
    • /
    • pp.129-132
    • /
    • 2009
  • Frame erasure is one of the most difficult problems in voice over IP (VoIP) networks and is a major source of speech quality degradation. In this paper, a frame erasure concealment algorithm based on nonlinear regression analysis is presented to minimize speech quality deterioration in code-excited linear prediction (CELP) based coders. We applied the proposed scheme to the ITU-T G.729 standard and obtained improved perceptual evaluation of speech quality (PESQ) scores compared to the conventional methods.

  • PDF

Dynamic Transmission Control Design in Buffered MC-CDMA System (버퍼를 가진 다중코드-코드분할다중접속(MC-CDMA) 시스템에서 동적 전송 제어 프로토콜 설계)

  • Kim, Young-Yong
    • Journal of the Institute of Electronics Engineers of Korea TC
    • /
    • v.39 no.9
    • /
    • pp.20-27
    • /
    • 2002
  • The demand for multimedia transmission in wireless networking is rapidly increasing. Performance evaluation in the CDMA system has been carried out on the voice-oriented system without re-transmission or buffering. We propose a multimedia access protocol with buffering and ARQ which can meet a variety of QoS requirements. We study the effect of buffering in MC-CDMA(Multi-Code CDMA) and design a dynamic rate control algorithms, whose simulation shows the efficiency of MC-CDMA.