• Title/Summary/Keyword: Voice Training

Search Result 177, Processing Time 0.022 seconds

Study of Korean Symptom Expression in 119 Emergency Calls (119 구급 신고 전화의 한국어 증상 표현 연구)

  • Jang, Yoonhee;Kang, Kyunghee;Jang, Kyungho;Kim, Kyeonghae
    • Fire Science and Engineering
    • /
    • v.30 no.4
    • /
    • pp.135-140
    • /
    • 2016
  • To help emergency medical dispatchers receive rapid and accurate identification and corrective action status determination of an emergency call, and to support the automatic processing of a voice recognition system to the Korean emergency medical dispatch system, emergency call records were analyzed. Furthermore, a list of Korean symptoms expression were produced and the characteristics of the symptoms that appear on the actual wording of the telephone records were identified. This language list and its characteristics will be useful for training emergency medical dispatchers.

Synthesis of Expressive Talking Heads from Speech with Recurrent Neural Network (RNN을 이용한 Expressive Talking Head from Speech의 합성)

  • Sakurai, Ryuhei;Shimba, Taiki;Yamazoe, Hirotake;Lee, Joo-Ho
    • The Journal of Korea Robotics Society
    • /
    • v.13 no.1
    • /
    • pp.16-25
    • /
    • 2018
  • The talking head (TH) indicates an utterance face animation generated based on text and voice input. In this paper, we propose the generation method of TH with facial expression and intonation by speech input only. The problem of generating TH from speech can be regarded as a regression problem from the acoustic feature sequence to the facial code sequence which is a low dimensional vector representation that can efficiently encode and decode a face image. This regression was modeled by bidirectional RNN and trained by using SAVEE database of the front utterance face animation database as training data. The proposed method is able to generate TH with facial expression and intonation TH by using acoustic features such as MFCC, dynamic elements of MFCC, energy, and F0. According to the experiments, the configuration of the BLSTM layer of the first and second layers of bidirectional RNN was able to predict the face code best. For the evaluation, a questionnaire survey was conducted for 62 persons who watched TH animations, generated by the proposed method and the previous method. As a result, 77% of the respondents answered that the proposed method generated TH, which matches well with the speech.

Six Sigma and the Cost of(Poor) Quality

  • Aca;U, Jichao-X
    • International Journal of Quality Innovation
    • /
    • v.3 no.2
    • /
    • pp.159-173
    • /
    • 2002
  • Any organization's Six Sigma program may be at high risk without heeding the lessons learned from the past and that tries to operate without a robust business foundation. A foundation that preferably should consist of stepping-stones such as a 5-S house-keeping program, an effective Integrated Management System (IMS), which includes a strong focus on planning for quality to fully capture the Voice of the Customer (VOC), and an organization-wide training scheme, as well as a reliable Cost of Poor Quality (COPQ) system. That's the best advise I can give to any organization that wishes to embark on a Six Sigma improvement program and hope to be successful. The paper will elaborate on the above issues and provide suggested solutions based on the review of published historical information and the experiences encountered over the last four decades by the author, as a quality practitioner and consultant, in industries that produced safety-critical product. This author maintains that few fundamentally new or useful things have been created in the field of Quality during the last couple of decades. Nevertheless, this paper deliberates on a number of relatively “newer” issues including the concept of “three types of customers”, the CTC, “Critical To Customer” term, the eight Quality Management Principles of the new ISO 9000 family, the growth of industry-specific standards, the adoption of Integrated Management Systems, the rebirth of AS2561 COQ standard, the spread of Six Sigma as well as related ASQ certification and the need for a robust business foundation to ensure Six Sigma survival.

A Study on the Educational Uses of Smart Speaker (스마트 스피커의 교육적 활용에 관한 연구)

  • Chang, Jiyeun
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.11
    • /
    • pp.33-39
    • /
    • 2019
  • Edutech, which combines education and information technology, is in the spotlight. Core technologies of the 4th Industrial Revolution have been actively used in education. Students use an AI-based learning platform to self-diagnose their needs. And get personalized training online with a cloud learning platform. Recently, a new educational medium called smart speaker that combines artificial intelligence technology and voice recognition technology has emerged and provides various educational services. The purpose of this study is to suggest a way to use smart speaker educationally to overcome the limitation of existing education. To this end, the concept and characteristics of smart speakers were analyzed, and the implications were derived by analyzing the contents provided by smart speakers. Also, the problem of using smart speaker was considered.

Evaluation of Larynx Cancer via Chemometrics Assisted Raman Spectroscopy

  • Senol, Onur;Albayrak, Mevlut
    • Current Optics and Photonics
    • /
    • v.3 no.2
    • /
    • pp.150-153
    • /
    • 2019
  • Larynx cancer is a potentially terminal and severe type of neck and head cancer in which malignant cells start to grow and spread upwards in the larynx, or voice box. Smoking tobacco, drinking hot beverages and drinking alcohol are the main risk factors for these tumors. In this study, we aimed to develop a precise, accurate and rapid chemometrics assisted Raman spectroscopy method for diagnosis of larynx cancer in deparaffinized tissue samples. In the proposed method, samples were deparaffinized and 20 microns of each tissue were located on a coverslip. Both healthy (n = 13) and cancerous tissues (n = 13) were exposed to a Raman laser (785 nm) and excitations were recorded between wavenumbers of $50{\sim}1500cm^{-1}$. An Orthogonal Partial Least Square algorithm was applied to evaluate the Raman spectrum obtained. Sensitivity and specificity of the proposed method is high enough with the aid of Principal Component Analysis (PCA) to test the whole model. Healthy and cancerous tissues were accurately and precisely clustered. A rapid, easy and precise diagnosis algorithm was developed for larynx cancer. By this method, some useful data about differences in biomolecules of each group (phospholipids, amides, tyrosine, phenylalanine collagen etc.) was also obtained from the spectra. It is claimed that the optimized method has a great potential for clustering and separating tumor tissues from healthy ones. This novel, rapid, precise and objective diagnosis method may be an alternative for the conventional methods in literature for diagnosis of larynx cancer.

Environment Factors Affecting Childhood Obesity: Voices from Students, Parents, and Teachers with Photograph (포토보이스를 이용한 아동 비만에 영향을 주는 환경 요인에 관한 탐색적 연구)

  • Park, Eunok;Lee, Hyo Young
    • Journal of Korean Academy of Nursing
    • /
    • v.49 no.3
    • /
    • pp.254-262
    • /
    • 2019
  • Purpose: This study aimed to explore the environmental factors affecting childhood obesity using photovoice from the perspectives of students, parents, and teachers in the community. Methods: Six school students, seven parents, and seven school teachers completed an assignment requiring them to take 24 pictures and participate in group discussions. After training session, the participants were asked to take pictures associated with food and physical activity environments related to childhood obesity at home, school, and within their communities for two weeks and to submit the pictures with records. Each group had four sessions for discussion. Results: School cafeteria, convenience stores near schools, instant food and fast food joints, food delivery, and high-calorie snacks comprised the food environmental factors. Lack of physical activity classes at school, commuting by car, barriers to physical activity, and use of smart-phone were environmental factors that inhibited physical activity. Conclusion: To reduce childhood obesity, the creation of a supportive environment for encouraging the consumption of healthy foods and enhancing physical activity should be considered. Modifications of and improvement to the obesogenic environment might be a good strategy to prevent and reduce childhood obesity.

Knowledge-driven speech features for detection of Korean-speaking children with autism spectrum disorder

  • Seonwoo Lee;Eun Jung Yeo;Sunhee Kim;Minhwa Chung
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.53-59
    • /
    • 2023
  • Detection of children with autism spectrum disorder (ASD) based on speech has relied on predefined feature sets due to their ease of use and the capabilities of speech analysis. However, clinical impressions may not be adequately captured due to the broad range and the large number of features included. This paper demonstrates that the knowledge-driven speech features (KDSFs) specifically tailored to the speech traits of ASD are more effective and efficient for detecting speech of ASD children from that of children with typical development (TD) than a predefined feature set, extended Geneva Minimalistic Acoustic Standard Parameter Set (eGeMAPS). The KDSFs encompass various speech characteristics related to frequency, voice quality, speech rate, and spectral features, that have been identified as corresponding to certain of their distinctive attributes of them. The speech dataset used for the experiments consists of 63 ASD children and 9 TD children. To alleviate the imbalance in the number of training utterances, a data augmentation technique was applied to TD children's utterances. The support vector machine (SVM) classifier trained with the KDSFs achieved an accuracy of 91.25%, surpassing the 88.08% obtained using the predefined set. This result underscores the importance of incorporating domain knowledge in the development of speech technologies for individuals with disorders.

Influence of Temporo-mandibular Joint Training Using Physical Therapy on the Vowel Acoustic Characteristics (TM Joint의 물리치료를 통한 훈련이 모음의 음향학적 특성에 미치는 영향)

  • Min, Dong-Gi;Lee, Jae-Hong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.5
    • /
    • pp.2203-2208
    • /
    • 2011
  • This study was to examine the change of vowel acoustic characteristics of the temporomandibular joint disorder patients by maintaining normal vocalization pattern of the temporomandibular joint through increasing the range of motion, that was, the oral cavity sonorant cavity of the temporomandibular joint, related to vowel articulation through temporomandibular training using the physical therapy. The subjects of this study were 3 male adults in 20-30s that were diagnosed with temporomandibular joint disorder. As a result of conducting temporomandibular training program using the physical therapy, the $1^{st}$ Formant Frequency(F1), $2^{nd}$ Formant Frequency(F2), and Fundamental Frequency(F0) of the temporomandibular joint disorder patients were increased compared to before and this showed the change of the $1^{st}$ Formant Frequency(F1) related to the open mouth grade of a vowel, as well as the $2^{nd}$ Formant Frequency(F2), and Fundamental Frequency(F0) related to the front-back of a vowel which shows the relationship between the temporomandibular joint, vowels and voice calculation.

Feedback on Peer Feedback in EFL Composing: Four Stories

  • Huh, Myung-Hye;Lee, Jang Ho
    • Journal of English Language & Literature
    • /
    • v.57 no.6
    • /
    • pp.977-998
    • /
    • 2011
  • The purpose of this study is to investigate prospective teachers' perceptions of the peer review comments readily available to them during the writing process in a teacher training class. Given these needs, we employ a qualitative method of inquiry giving voice to the learner's own view of peer feedback. The data we wish to consider is first-person narratives elicited from four EFL college students, who are prospective teachers of English. With regard to the EFL students' narrative considered here, all were attentive to the feedback they received. Moreover, the way in which these EFL writers talk about peer response activity reflects that they still welcome peer feedback because of the benefits to be accrued from it. Although this study, covering only four EFL students in total, can hardly be considered conclusive, we attempt to offer a synthesis of their stories. First of all, students indicate that they received responses from "authentic readers" (Mittan 1989, 209). We do note, consequently, that students gain a clear understanding of readers' needs by receiving feedback on what they did well and on what seems unclear. Perhaps the greater effect of peer feedback claimed by these students is that they take active roles in utilizing peer comments. Since they feel uncertain about the validity of their classmates' responses, students feel that they have autonomy over their own text and can make their own decisions on whether they should accept their peer comments or not. This contrasts with their treatment of teacher comments that they accept begrudgingly even if they disagree with them. Four EFL writers talked a lot, typically in a positive way, about peer response to their writing, yet they have expressed reservations about the extent to which they should put any credence in comments offered by their fellow students. Perhaps this is because their fellow students are still developing writers and EFL learners. In turn, they were sometimes reluctant to accept the peers' comments. Thus, in EFL contexts, L1 use can be suggested during peer feedback sessions. In particular, we have come to feel that L1 use enables both reviewers and receivers to have more productive peer review experiences. Additionally, we need to train students not "to see peer feedback as potentially bad advice" (Silva et al. 2003, 111). Teachers should focus on training students to utilize their peers' comments. Without such training, students will either ignore feedback or fail to use it constructively.

Laryngeal Findings and Phonetic Characteristics in Prelingually Deaf Patients (언어습득기 이전 청각장애인의 후두소견 및 음성학적 특성)

  • Kim, Seong-Tae;Yoon, Tae-Hyun;Kim, Sang-Yoon;Choi, Seung-Ho;Nam, Soon-Yuhl
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.20 no.1
    • /
    • pp.57-62
    • /
    • 2009
  • Background and Objectives : There are few studies reported that specifically examine the laryngeal function in patients with profound hearing loss or deafness, This study was designed to examine videostroboscopic findings and phonetic characteristics in adult patients with prelingually deaf. Materials and Method: Sixteen patients (seven males, nine females) diagnosed as prelingually deaf aged from 19 to 54 years, and were compared with a 20 normal control group with no laryngeal pathology and normal hearing group, Videostroboscopic evaluations were rated by experienced judges on various parameters describing the structure and function of the laryngeal mechanism during comfortable pitch and loudness phonations. Acoustic analysis test were done, and a nasalance test performed to measure rabbit, baby, and mother passage. CSL were measured to determine the first and two formant frequencies of vowels /a/, /i/, /u/, Statistical analysis was done using Mann-Whitney U or Wilcoxon signed ranks test. Results: Videostroboscopic findings showed phase symmetry but significantly more occurrences decrement in the amplitude of vibration, mucosal wave, irregularity of the vibration and increased glottal gap size during the closed phase of phonation, In addition, group of prelingually deaf patients were observed to have significantly more occurrences of abnormal supraglottic activities during phonation. The percentage of shimmer in the group of prelingually deaf patients were higher than in the control group. Characteristics of vowels were lower of the second formant of the vowel /i/. Nasalance in prelingually deaf patients showed normal nasality for all passages, Conclusion: Prelingually deaf patients show stroboscopic abnormal findings without any mucosal lesion, suggesting that they have considerable functional voice disorder. We suggest that prelingually deaf adults should perform vocal training for normalized laryngeal function after cochlear implantation.

  • PDF