• Title/Summary/Keyword: Voice evaluation

Search Result 357, Processing Time 0.022 seconds

An Ultrasonic Wave Encoder and Decoder for Indoor Positioning of Mobile Marketing System

  • Kim, Young-Mo;Jang, Se-Young;Park, Byeong-Chan;Bang, Kyung-Sik;Kim, Seok-Yoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.7
    • /
    • pp.93-100
    • /
    • 2019
  • In this paper, we propose an intelligent marketing service system that can provide custom advertisements and events to both businesses and customers by identifying the location and contents using the ultrasonic signals and feature information in voice signals. We also develop the encoding and decoding algorithm of ultrasonic signals for this system and analyze the performance evaluation results. With the development of the hyper-connected society, the on-line marketing has been activated and is growing in size. Existing store marketing applications have disadvantages that customers have to find out events or promotional materials that the headquarters or stores throughusing the corresponding applications whenever they visit them. To solve these problems, there are attempts to create intelligent marketing tools using GPS technology and voice recognition technology. However, this approach has difficulties in technology development due to accuracy of location and speed of comparison and retrieval of voice recognition technology, and marketing services for customer relation are also much simplified.

A Study on the Sasang Constitutional Diagnosis by Perceptual Voice Analysis (청각적(聽覺的) 성음분석(聲音分析)을 통한 사상체질진단(四象體質診斷)에 관한 연구(硏究))

  • Yoo, Jun-Sang;Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.16 no.3
    • /
    • pp.46-58
    • /
    • 2004
  • 1. Objectives This study was performed by means of perceptual evaluation of the voices of Sasang Constitution. 2. Methods 73 female subjects were classified by means of 3 kinds of Questionnaire papers(QSCCII, QSCCI, Sasang Pattern Identification Questionnaire). So they were categorized into 3 groups, 23 Soyangin, 28 Taeumin and 22 Soeumin. 73 voice samples were presented three times to a group of 5 judges. The time interval between ratings was 14 days. The four goals of this study were to evaluate the intraobserver reliability between each rating, to evaluate the interobserver reliability, to evaluate the reliability between the each rating and Questionnaire result and to make the notion of the consensus of Sasang Constitution's Voice. 3. Results & Conclusions The intraobserver reliability between the first and second rating showed significance statistically among all observers. And the intraobserver reliability between the second and third rating showed significance except one observer. The interobserver reliability among the three ratings showed significance statistically except one to two observers in the first rating and other one to another one in the second rating. In the reliability between the each rating and Questionnaire result, one in the first rating, other one in the second rating and another two in the third rating showed significance. To make the notion of the consensus of Sasang Constitution's Voice, classification into 4 categories was made: clear/hoarse, high/low, fast/slow, powerful/powerless. The voice of Soyangin group was classified as powerful and fast, and that of Taeumin group as powerful, hoarse and low and that of Soeumin group as powerless and slow.

  • PDF

Voice Analysis before and after Radioactive Iodine Ablation in Patients with Total Thyroidectomy (적갑상선 전절제술 환자의 방사성 동위원소치료 전.후 음성의 변화에 대한 연구)

  • Hong, Ki Hwan;Seo, Eun Ji;Lee, Hyun Doo;Yoon, Yun Sub;Lim, Seok Tae
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.24 no.1
    • /
    • pp.33-40
    • /
    • 2013
  • Background and Objectives:This study is to objectively compare and analyze the acoustic changes in the patients with total thyroidectomy before and after RI therapy. Subjects and Methods:For this study, a total of 50 patients with total thyroidectomy were participated as subjects. Voice samples were obtained at the time of post-operation (Post-OP), before high-dose radioactive iodine therapy (Pre-RIT), and after high-dose radioactive iodine therapy (Post-RIT). Acoustic analysis, the maximum phonation time and K-VHI (Korea-Voice handicap index) were used for subjective evaluation. Results:According to the comparison analysis of the three periods, mFo (Hz) was significantly reduced in all of the vowels /a/ and /i/ as the hormone was discontinued. This can be related to the reduction in vocal range. As thyroid hormone was discontinued, Shim (%) and APQ (%) values, which are the parameters related to the degree of aggressiveness, showed a significant increase in the middle vowel /a/. As thyroid hormone was discontinued, emotional index was significantly decreased in VHI (voice handicap index). Conclusion:These results can be assumed that thyroid hormone suspension is related to the increased changes in the vocal intensity, the increase in noise and the reduction in vocal range. Emotionally, these data can be assumed that the responsive factors of one's own voice disorders were significantly decreased in the patients with vocal handicap.

  • PDF

Effects of EAI and VAS on perceptual judgement and confidence rating by listeners for voice disorders (청지각적 평가 방식에 따른 음성장애 심한 정도 판단과 자가 신뢰도에 대한 차이)

  • Lee, Ok-Bun;Kim, Sun-Hee;Jeong, Hanjin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.5
    • /
    • pp.3046-3050
    • /
    • 2014
  • The purpose of the present study was to evaluate the effect of 7-point interval scale(EAI) and visual analogue scale(VAS) on perceptual judgement and the reliability of severity on voice problems by dysphonic speakers. 30 undergraduate students studying communication disorder were enrolled in the perceptual evaluation. Those listeners judged overall voice severity within the anchored(condition 1) and non-anchored scales(condition 2) for vowel prolongation and reading tasks by 25 speakers with voice disorder. The results of this study showed that the scores by VAS was significantly higher than EAI in both condition 1 and condition 2 for vowel prolongation and reading task. However, the scores by EAI method was higher than by VAS method on voice severity of vowel prolongation (condition 1) and reading task(condition 2). These results suggest auditory-perceptual scaling procedures must be more studied in the aspects of clinical application of voice disorder.

Ship s Maneuvering and Winch Control System with Voice Instruction Based Learning (음성지시에 의한 선박 조종 및 윈치 제어 시스템)

  • Seo, Ki-Yeol;Park, Gyei-Kark
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.12 no.6
    • /
    • pp.517-523
    • /
    • 2002
  • In this paper, we propose system that apply VIBL method to add speech recognition to LIBL method based on human s studying method to use natural language to steering system of ship, MERCS and winch appliances and use VIBL method to alternate process that linguistic instruction such as officer s steering instruction is achieved via ableman and control steering gear, MERCS and winch appliances. By specific method of study, ableman s suitable steering manufacturing model embodies intelligent steering gear controlling system that embody and language direction base studying method to present proper meaning element and evaluation rule to steering system of ship apply and respond more efficiently on voice instruction of commander using fuzzy inference rule. Also we embody system that recognize voice direction of commander and control MERCS and winch appliances. We embodied steering manufacturing model based on ableman s experience and presented rudder angle for intelligent steering system, compass bearing arrival time, evaluation rule to propose meaning element of stationary state and correct steerman manufacturing model rule using technique to recognize voice instruction of commander and change to text and fuzzy inference. Also we apply VIBL method to speech recognition ship control simulator and confirmed the effectiveness.

Preliminary Study on Developing Test Items of Swallowing & Communication Screening Protocols for Patients with Head and Neck Burns (안면부 및 경부 화상 환자의 삼킴 및 의사소통능력 선별 프로토콜 개발을 위한 예비 연구)

  • Kim, JungWan;Lee, HyoJin;Lee, Hyun-Joung
    • 재활복지
    • /
    • v.21 no.2
    • /
    • pp.217-231
    • /
    • 2017
  • We have to consider two parts of the evaluation and treatment for the patients with head and neck burns. The primary consideration is swallowing function for nutrition supply for them and the next is speech function for efficient communication and aesthetic impression of them. The purpose of this study is to summarize the preliminary questions of Communication Screening Protocols which can help understand comprehensively on swallowing disorder, motor speech disorder and voice disorder of patients with head and neck burns. We divided the evaluation into 4 evaluation areas including 'oral mechanism', 'respiration/voice', 'articulation', and 'swallowing' by referring to overseas studies dealing with various communication disorders caused by burns, and prepared the final questionnaires by conducting the content validity verification by five expert (speech & language pathologist). The range of Content Validity Index was shown relatively appropriate with .50~.84. There was a conflict of opinions in experts whether the items in the areas of respiration/voice and swallowing may be appropriate, whereas there was no different view of the oral mechanism and articulation area. Through the different characteristics of communication difficulties of patients with head and neck burns, we expect it will be modified appropriately according to the patients through evaluation of burn patients by type and severity.

A comparison study of the characteristics of pauses and breath groups during paragraph reading for normal female adults with and without voice disorders (정상성인 여성 화자와 음성장애 성인 여성 화자의 문단 낭독 시 휴지 및 호흡단락 특성의 비교)

  • Pyo, Hwa Young
    • Phonetics and Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.109-116
    • /
    • 2019
  • This study was conducted to identify the characteristics of pauses and breath groups made by normal adults and patients with voice disorders while reading a paragraph. Forty normal female adults and forty female patients with a functional voice disorder (18-45 yrs.) read the "Gaeul" paragraph with the "Running Speech" protocol of the Phonatory Aerodynamic System (PAS), by which the pauses with or without inspiration and between or within syntactic words and breath groups were analyzed. The number of pauses with inspiration was found to be higher in the patient group, but the number of pauses without inspiration was higher in the normal group. The rate of syntactic word boundaries with pauses with inspiration was higher in the patient group, while the number of syllables per breath group was higher in the normal group. As these results can be explained by patients' poor breath support due to glottal insufficiency, the question of whether voice disorder patients use their pauses and breath groups properly should be considered carefully in evaluation and intervention.

An Implementation of Multimodal Speaker Verification System using Teeth Image and Voice on Mobile Environment (이동환경에서 치열영상과 음성을 이용한 멀티모달 화자인증 시스템 구현)

  • Kim, Dong-Ju;Ha, Kil-Ram;Hong, Kwang-Seok
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.45 no.5
    • /
    • pp.162-172
    • /
    • 2008
  • In this paper, we propose a multimodal speaker verification method using teeth image and voice as biometric trait for personal verification in mobile terminal equipment. The proposed method obtains the biometric traits using image and sound input devices of smart-phone that is one of mobile terminal equipments, and performs verification with biometric traits. In addition, the proposed method consists the multimodal-fashion of combining two biometric authentication scores for totally performance enhancement, the fusion method is accompanied a weighted-summation method which has comparative simple structure and superior performance for considering limited resources of system. The performance evaluation of proposed multimodal speaker authentication system conducts using a database acquired in smart-phone for 40 subjects. The experimental result shows 8.59% of EER in case of teeth verification 11.73% in case of voice verification and the multimodal speaker authentication result presented the 4.05% of EER. In the experimental result, we obtain the enhanced performance more than each using teeth and voice by using the simple weight-summation method in the multimodal speaker verification system.

A Study on Motion Control of the Pet-Robot using Voice-Recognition (음성인식을 이용한 반려 로봇의 모션제어에 대한 연구)

  • Ye-Jin, Cho;Hyun-Seok, Kim;Tae-Sung, Bae;Su-Haeng, Lee;Jin-Hyean, Kim;Jae-Wook, Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.6
    • /
    • pp.1089-1094
    • /
    • 2022
  • In this paper, a human coexistence-type companion robot that can communicate with people in daily life and alleviate the gap in care personnel was studied. Based on the voice recognition module, servo motor, and Arduino board, a companion robot equipped with a robot arm control function using voice recognition, a position movement function using RC cars, and a voice recognition function was tested and manufactured. As a result of the experiment, the speech recognition experiment according to distance showed the optimal recognition rate at a distance of 5 to 30 cm, and the speech recognition experiment according to gender showed a higher recognition rate in the first tone, monotonous tone. Through the evaluation results of these motion experiments, it was confirmed that a companion robot could be made.

A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth (Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究))

  • Park, Sung-Jin;Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.16 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".

  • PDF