• 제목/요약/키워드: Voice Training

검색결과 177건 처리시간 0.022초

119 구급 신고 전화의 한국어 증상 표현 연구 (Study of Korean Symptom Expression in 119 Emergency Calls)

  • 장윤희;강경희;장경호;김경혜
    • 한국화재소방학회논문지
    • /
    • 제30권4호
    • /
    • pp.135-140
    • /
    • 2016
  • 구급 신고 전화를 받는 상황요원의 신속하고 정확한 신고자 상태 파악과 적확한 조치 판단을 돕고, 정확한 다중출동을 위한 한국형 응급상황관제 시스템의 음성인식 시스템의 자동적인 처리를 지원하기 위하여 실제 구급 신고 전화의 음성 자료 분석을 통해 한국어의 응급 증상 표현 목록을 작성하고 실제 신고전화에 나타난 증상 언어표현의 특징을 파악하였다. 이러한 언어 목록과 특성들은 일선 현장의 상황요원의 교육에도 유용할 것이다.

RNN을 이용한 Expressive Talking Head from Speech의 합성 (Synthesis of Expressive Talking Heads from Speech with Recurrent Neural Network)

  • 사쿠라이 류헤이;심바 타이키;야마조에 히로타케;이주호
    • 로봇학회논문지
    • /
    • 제13권1호
    • /
    • pp.16-25
    • /
    • 2018
  • The talking head (TH) indicates an utterance face animation generated based on text and voice input. In this paper, we propose the generation method of TH with facial expression and intonation by speech input only. The problem of generating TH from speech can be regarded as a regression problem from the acoustic feature sequence to the facial code sequence which is a low dimensional vector representation that can efficiently encode and decode a face image. This regression was modeled by bidirectional RNN and trained by using SAVEE database of the front utterance face animation database as training data. The proposed method is able to generate TH with facial expression and intonation TH by using acoustic features such as MFCC, dynamic elements of MFCC, energy, and F0. According to the experiments, the configuration of the BLSTM layer of the first and second layers of bidirectional RNN was able to predict the face code best. For the evaluation, a questionnaire survey was conducted for 62 persons who watched TH animations, generated by the proposed method and the previous method. As a result, 77% of the respondents answered that the proposed method generated TH, which matches well with the speech.

Six Sigma and the Cost of(Poor) Quality

  • Aca;U, Jichao-X
    • International Journal of Quality Innovation
    • /
    • 제3권2호
    • /
    • pp.159-173
    • /
    • 2002
  • Any organization's Six Sigma program may be at high risk without heeding the lessons learned from the past and that tries to operate without a robust business foundation. A foundation that preferably should consist of stepping-stones such as a 5-S house-keeping program, an effective Integrated Management System (IMS), which includes a strong focus on planning for quality to fully capture the Voice of the Customer (VOC), and an organization-wide training scheme, as well as a reliable Cost of Poor Quality (COPQ) system. That's the best advise I can give to any organization that wishes to embark on a Six Sigma improvement program and hope to be successful. The paper will elaborate on the above issues and provide suggested solutions based on the review of published historical information and the experiences encountered over the last four decades by the author, as a quality practitioner and consultant, in industries that produced safety-critical product. This author maintains that few fundamentally new or useful things have been created in the field of Quality during the last couple of decades. Nevertheless, this paper deliberates on a number of relatively “newer” issues including the concept of “three types of customers”, the CTC, “Critical To Customer” term, the eight Quality Management Principles of the new ISO 9000 family, the growth of industry-specific standards, the adoption of Integrated Management Systems, the rebirth of AS2561 COQ standard, the spread of Six Sigma as well as related ASQ certification and the need for a robust business foundation to ensure Six Sigma survival.

스마트 스피커의 교육적 활용에 관한 연구 (A Study on the Educational Uses of Smart Speaker)

  • 장지연
    • 한국융합학회논문지
    • /
    • 제10권11호
    • /
    • pp.33-39
    • /
    • 2019
  • 교육업계에 교육과 정보기술(IT)을 융합한 '에듀테크' 바람이 불고 있다. 4차 산업혁명 핵심 기술이 최근 교육 분야에 적극 활용되고 있는데 학습자들은 인공지능 기반 학습 플랫폼을 이용해 자신이 부족한 부분을 스스로 진단하고, 클라우드 학습 플랫폼으로 온라인상에서 개인 맞춤형 교육을 받는다. 최근 인공지능 기술과 음성인식 기술을 접목한 스마트 스피커와 같은 새로운 교육 매체가 등장하게 되어 다양한 교육서비스가 시도되고 있다. 본 연구에서는 기존 교육의 한계를 극복하기 위해 스마트 스피커를 교육적으로 활용하는 방안을 제시하고자 하였다. 이를 위해 스마트 스피커의 개념 및 특성을 알아보고 스마트 스피커에서 제공하는 콘텐츠를 분석하여 시사점을 도출하였다. 또한 스마트 스피커이용의 문제점에 대해서도 고찰하였다.

Evaluation of Larynx Cancer via Chemometrics Assisted Raman Spectroscopy

  • Senol, Onur;Albayrak, Mevlut
    • Current Optics and Photonics
    • /
    • 제3권2호
    • /
    • pp.150-153
    • /
    • 2019
  • Larynx cancer is a potentially terminal and severe type of neck and head cancer in which malignant cells start to grow and spread upwards in the larynx, or voice box. Smoking tobacco, drinking hot beverages and drinking alcohol are the main risk factors for these tumors. In this study, we aimed to develop a precise, accurate and rapid chemometrics assisted Raman spectroscopy method for diagnosis of larynx cancer in deparaffinized tissue samples. In the proposed method, samples were deparaffinized and 20 microns of each tissue were located on a coverslip. Both healthy (n = 13) and cancerous tissues (n = 13) were exposed to a Raman laser (785 nm) and excitations were recorded between wavenumbers of $50{\sim}1500cm^{-1}$. An Orthogonal Partial Least Square algorithm was applied to evaluate the Raman spectrum obtained. Sensitivity and specificity of the proposed method is high enough with the aid of Principal Component Analysis (PCA) to test the whole model. Healthy and cancerous tissues were accurately and precisely clustered. A rapid, easy and precise diagnosis algorithm was developed for larynx cancer. By this method, some useful data about differences in biomolecules of each group (phospholipids, amides, tyrosine, phenylalanine collagen etc.) was also obtained from the spectra. It is claimed that the optimized method has a great potential for clustering and separating tumor tissues from healthy ones. This novel, rapid, precise and objective diagnosis method may be an alternative for the conventional methods in literature for diagnosis of larynx cancer.

포토보이스를 이용한 아동 비만에 영향을 주는 환경 요인에 관한 탐색적 연구 (Environment Factors Affecting Childhood Obesity: Voices from Students, Parents, and Teachers with Photograph)

  • 박은옥;이효영
    • 대한간호학회지
    • /
    • 제49권3호
    • /
    • pp.254-262
    • /
    • 2019
  • Purpose: This study aimed to explore the environmental factors affecting childhood obesity using photovoice from the perspectives of students, parents, and teachers in the community. Methods: Six school students, seven parents, and seven school teachers completed an assignment requiring them to take 24 pictures and participate in group discussions. After training session, the participants were asked to take pictures associated with food and physical activity environments related to childhood obesity at home, school, and within their communities for two weeks and to submit the pictures with records. Each group had four sessions for discussion. Results: School cafeteria, convenience stores near schools, instant food and fast food joints, food delivery, and high-calorie snacks comprised the food environmental factors. Lack of physical activity classes at school, commuting by car, barriers to physical activity, and use of smart-phone were environmental factors that inhibited physical activity. Conclusion: To reduce childhood obesity, the creation of a supportive environment for encouraging the consumption of healthy foods and enhancing physical activity should be considered. Modifications of and improvement to the obesogenic environment might be a good strategy to prevent and reduce childhood obesity.

Knowledge-driven speech features for detection of Korean-speaking children with autism spectrum disorder

  • Seonwoo Lee;Eun Jung Yeo;Sunhee Kim;Minhwa Chung
    • 말소리와 음성과학
    • /
    • 제15권2호
    • /
    • pp.53-59
    • /
    • 2023
  • Detection of children with autism spectrum disorder (ASD) based on speech has relied on predefined feature sets due to their ease of use and the capabilities of speech analysis. However, clinical impressions may not be adequately captured due to the broad range and the large number of features included. This paper demonstrates that the knowledge-driven speech features (KDSFs) specifically tailored to the speech traits of ASD are more effective and efficient for detecting speech of ASD children from that of children with typical development (TD) than a predefined feature set, extended Geneva Minimalistic Acoustic Standard Parameter Set (eGeMAPS). The KDSFs encompass various speech characteristics related to frequency, voice quality, speech rate, and spectral features, that have been identified as corresponding to certain of their distinctive attributes of them. The speech dataset used for the experiments consists of 63 ASD children and 9 TD children. To alleviate the imbalance in the number of training utterances, a data augmentation technique was applied to TD children's utterances. The support vector machine (SVM) classifier trained with the KDSFs achieved an accuracy of 91.25%, surpassing the 88.08% obtained using the predefined set. This result underscores the importance of incorporating domain knowledge in the development of speech technologies for individuals with disorders.

TM Joint의 물리치료를 통한 훈련이 모음의 음향학적 특성에 미치는 영향 (Influence of Temporo-mandibular Joint Training Using Physical Therapy on the Vowel Acoustic Characteristics)

  • 민동기;이재홍
    • 한국산학기술학회논문지
    • /
    • 제12권5호
    • /
    • pp.2203-2208
    • /
    • 2011
  • 턱관절의 물리치료를 통한 훈련으로 모음 조음에 관여하는 구강 공명강을 증가시켜, 턱관절의 정상적인 발성패턴을 유지하도록 하여 턱관절장애 환자의 모음의 음향학적 특성 변화를 보고자 하였다. 연구 대상은 턱관절장애로 진단 받은 3명의 20~30대 성인 남자를 대상으로 하였다. 실험방법은 대상자간 중다기초선 설계를 이용하여 기초선 단계, 치료 단계, 유지 단계로 진행하였다. 치료는 한 회기를 30분으로 하여 주 3회 실시하였고 3회기를 한 평가단위로 5회의 중간평가를 실시하였다. 평가방법은 Praat 음성 분석프로그램을 이용하여 /ㅏ/ 모음의 연장 발성에 대한 제 1포먼트 주파수(F1), 제 2포먼트 주파수(F2) 그리고 기본주파수(F0)를 분석하였다. 물리치료를 통한 훈련 프로그램을 실시한 결과 턱관절장애 환자의 제 1포먼트 주파수(F1), 제 2포먼트 주파수(F2) 그리고 기본주파수(F0)는 치료 전 보다 증가하는 변화를 보였고, 이는 모음의 개구도와 관계된 제 1 포먼트 주파수(F1) 뿐만 아니라 모음의 전후설, 성대의 움직임과 관련이 있는 제 2포먼트 주파수(F2) 그리고 기본주파수(F0)의 변화도 함께 보임으로써 턱관절과 모음 및 음성 산출의 연관성을 보여주었다.

Feedback on Peer Feedback in EFL Composing: Four Stories

  • Huh, Myung-Hye;Lee, Jang Ho
    • 영어영문학
    • /
    • 제57권6호
    • /
    • pp.977-998
    • /
    • 2011
  • The purpose of this study is to investigate prospective teachers' perceptions of the peer review comments readily available to them during the writing process in a teacher training class. Given these needs, we employ a qualitative method of inquiry giving voice to the learner's own view of peer feedback. The data we wish to consider is first-person narratives elicited from four EFL college students, who are prospective teachers of English. With regard to the EFL students' narrative considered here, all were attentive to the feedback they received. Moreover, the way in which these EFL writers talk about peer response activity reflects that they still welcome peer feedback because of the benefits to be accrued from it. Although this study, covering only four EFL students in total, can hardly be considered conclusive, we attempt to offer a synthesis of their stories. First of all, students indicate that they received responses from "authentic readers" (Mittan 1989, 209). We do note, consequently, that students gain a clear understanding of readers' needs by receiving feedback on what they did well and on what seems unclear. Perhaps the greater effect of peer feedback claimed by these students is that they take active roles in utilizing peer comments. Since they feel uncertain about the validity of their classmates' responses, students feel that they have autonomy over their own text and can make their own decisions on whether they should accept their peer comments or not. This contrasts with their treatment of teacher comments that they accept begrudgingly even if they disagree with them. Four EFL writers talked a lot, typically in a positive way, about peer response to their writing, yet they have expressed reservations about the extent to which they should put any credence in comments offered by their fellow students. Perhaps this is because their fellow students are still developing writers and EFL learners. In turn, they were sometimes reluctant to accept the peers' comments. Thus, in EFL contexts, L1 use can be suggested during peer feedback sessions. In particular, we have come to feel that L1 use enables both reviewers and receivers to have more productive peer review experiences. Additionally, we need to train students not "to see peer feedback as potentially bad advice" (Silva et al. 2003, 111). Teachers should focus on training students to utilize their peers' comments. Without such training, students will either ignore feedback or fail to use it constructively.

언어습득기 이전 청각장애인의 후두소견 및 음성학적 특성 (Laryngeal Findings and Phonetic Characteristics in Prelingually Deaf Patients)

  • 김성태;윤태현;김상윤;최승호;남순열
    • 대한후두음성언어의학회지
    • /
    • 제20권1호
    • /
    • pp.57-62
    • /
    • 2009
  • Background and Objectives : There are few studies reported that specifically examine the laryngeal function in patients with profound hearing loss or deafness, This study was designed to examine videostroboscopic findings and phonetic characteristics in adult patients with prelingually deaf. Materials and Method: Sixteen patients (seven males, nine females) diagnosed as prelingually deaf aged from 19 to 54 years, and were compared with a 20 normal control group with no laryngeal pathology and normal hearing group, Videostroboscopic evaluations were rated by experienced judges on various parameters describing the structure and function of the laryngeal mechanism during comfortable pitch and loudness phonations. Acoustic analysis test were done, and a nasalance test performed to measure rabbit, baby, and mother passage. CSL were measured to determine the first and two formant frequencies of vowels /a/, /i/, /u/, Statistical analysis was done using Mann-Whitney U or Wilcoxon signed ranks test. Results: Videostroboscopic findings showed phase symmetry but significantly more occurrences decrement in the amplitude of vibration, mucosal wave, irregularity of the vibration and increased glottal gap size during the closed phase of phonation, In addition, group of prelingually deaf patients were observed to have significantly more occurrences of abnormal supraglottic activities during phonation. The percentage of shimmer in the group of prelingually deaf patients were higher than in the control group. Characteristics of vowels were lower of the second formant of the vowel /i/. Nasalance in prelingually deaf patients showed normal nasality for all passages, Conclusion: Prelingually deaf patients show stroboscopic abnormal findings without any mucosal lesion, suggesting that they have considerable functional voice disorder. We suggest that prelingually deaf adults should perform vocal training for normalized laryngeal function after cochlear implantation.

  • PDF