• Title/Summary/Keyword: Voice Training

Search Result 177, Processing Time 0.02 seconds

소방공무원의 화생방테러 응급의료훈련 교육과목 개설에 대한 제언 (A Proposal on the Development of Chemical-Biological-Radiological-Nuclear-Explosive (CBRNE) Emergency Medical Training Program for Fire Officers)

  • 김지희
    • 한국화재소방학회논문지
    • /
    • 제21권4호
    • /
    • pp.99-104
    • /
    • 2007
  • 세계화 추세에 맞추어 우리나라에서 많은 국제회의 및 국제 운동경기가 개최되고 있다. 2001년 미국 911 테러 이후 이라크 파병 등으로 인해 우리나라도 테러로부터 안전지대가 아니라는 불안감의 목소리가 커지고 있는 상황이다. 생물테러와 폭탄테러 등이 세계 곳곳에서 발생하고 있어 우리나라도 소방공무원에 대한 화생방테러 교육훈련의 필요성이 대두되고 있어, 소방공무원을 위한 화생방 응급의료 교육훈련 교과목을 제언하고자 한다.

Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석 (Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers)

  • 이경철;홍석진;진성민
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.122-127
    • /
    • 2004
  • Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.

  • PDF

활창과 허밍을 이용한 음성치료가 성문틈 환자의 음성 개선에 미치는 효과 (Effects of Voice Therapy Using Gliding and Humming in Dysphonic Patients With Glottal Gap)

  • 정대용;심미란;황연신;김근전;선동일
    • 대한후두음성언어의학회지
    • /
    • 제32권2호
    • /
    • pp.81-86
    • /
    • 2021
  • Background and Objectives Therapies have been reported to treat the glottal gap previously. However, these voice therapies showed the limits because many techniques focused only on one among breathing, resonance and phonation. In addition patients often have difficulties visiting hospital frequently. 'Gliding and humming' is vocal training technique that readjusts total vocal patterns such as breathing, resonance and phonation. This technique can be easily applied during short term sessions. The purpose of this study is to evaluate the efficiency of voice therapy with 'gliding and humming' for patients with glottic gap during short-term treatment sessions. Materials and Method Twenty-three patients with glottal gap were selected. Of all patients, 14 patients had sulcus vocalis and 12 patients had muscle tension dysphonia (MTD). Voice therapies were performed 1.9 sessions in average. GRBAS, jitter, shimmer, noise to harmonic ratio, semitone range, closed quotient_vowel and maximum phonation time were compared before and after the therapies. In addition, changes of glottal gap and MTD severity were evaluated. Results Statistically significant improvement was observed. MTD improvement was observed only among the patients with glottal gap improvement. Also sulcus vocalis group showed the statistically significant improvement. Conclusion 'Gliding and humming' was effective to the patients with glottic gap and sulcus vocalis. Also, among patients who have both glottic gap and MTD, the data suggests that voice therapy for glottic gap also makes improvement in MTD.

과기능적 음성장애 환자의 물저항발성: 튜브 직경과 물 깊이가 물거품 높이 및 최대발성지속시간에 미치는 영향 (Tube phonation in water for patients with hyperfunctional voice disorders: The effect of tube diameter and water immersion depth on bubble height and maximum phonation time)

  • 김민경;최성희;윤종인
    • 말소리와 음성과학
    • /
    • 제15권2호
    • /
    • pp.31-40
    • /
    • 2023
  • 목적: 물 속에서 튜브 발성은 semi-occluded vocal tract(SOVT) 연습 중 하나로 환자가 튜브를 물 속에 잠기게 하여 거품을 내면서 발성을 하는 것으로 음성 훈련에 널리 사용되어 왔다. 본 연구는 과기능성 음성장애 환자를 대상으로 물저항발성 동안 튜브 직경과 튜브를 담그는 물 깊이가 물거품 높이와 최대발성지속시간(maximum phonation time, MPT)에 미치는 영향을 조사하는 것을 목적으로 한다. 방법: 과기능성 음성장애 환자 17명에게 튜브 직경(5, 7, 10 mm), 튜브를 담그는 물 깊이(4, 7, 10 cm)에 따라 지속적인 /u/발성을 하면서 거품을 내도록 하였다. 물거품 높이 및 MPT 기록을 위해 수위 센서를 이용한 물저항발성 바이오피드백 시스템을 사용하였다. 결과: 물거품 높이는 튜브 직경에 의해 유의하게 변화한 반면 MPT는 튜브 직경과 깊이에 따라 유의하게 변화하였다. 직경이 더 넓을수록 주어진 깊이에 대해 유의하게 낮은 물거품 높이를 나타냈지만, 상대적으로 일관된 버블 높이가 유지되었다. 물의 깊이에 따라 주어진 튜브 직경에서 물거품 높이는 유의한 차이가 없었으나, 물의 깊이에 따라 MPT는 유의하게 감소하였고 튜브가 넓을수록 MPT가 유의하게 감소하였다. 결론: 수위 센서 방식의 물저항 바이오피드백 시스템은 튜브 직경 및 수심에 따른 기포 특성 및 성대 진동에 대해 유용한 정보를 제공하였다. 또한, 수위센서를 이용한 물저항발성 바이오시스템은 과기능적 음성장애가 있는 환자의 물저항 발성 중 호흡 지지를 모니터링하는 데 유용하게 사용될 수 있다.

RawNet3 화자 표현을 활용한 임의의 화자 간 음성 변환을 위한 StarGAN의 확장 (Extending StarGAN-VC to Unseen Speakers Using RawNet3 Speaker Representation)

  • 박보경;박소민;홍현기
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제12권7호
    • /
    • pp.303-314
    • /
    • 2023
  • 음성 변환(Voice Conversion)은 개인의 음성 데이터를 다른 사람의 음향적 특성(음조, 리듬, 성별 등)으로 재생성할 수 있는 기술로, 교육, 의사소통, 엔터테인먼트 등 다양한 분야에서 활용되고 있다. 본 논문은 StarGAN-VC 모델을 기반으로 한 접근 방식을 제안하여, 병렬 발화(Utterance) 없이도 현실적인 음성을 생성할 수 있다. 고정된 원본(source) 및 목표(target)화자 정보의 원핫 벡터(One-hot vector)를 이용하는 기존 StarGAN-VC 모델의 제약을 극복하기 위해, 본 논문에서는 사전 훈련된 Rawnet3를 사용하여 목표화자의 특징 벡터를 추출한다. 이를 통해 음성 변환은 직접적인 화자 간 매핑 없이 잠재 공간(latent space)에서 이루어져 many-to-many를 넘어서 any-to-any 구조가 가능하다. 기존 StarGAN-VC 모델에서 사용된 손실함수 외에도, Wasserstein-1 거리를 사용하여 생성된 음성 세그먼트가 목표 음성의 음향적 특성과 일치하도록 보장했다. 또한, 안정적인 훈련을 위해 Two Time-Scale Update Rule (TTUR)을 사용한다. 본 논문에서 제시한 평가 지표들을 적용한 실험 결과에 따르면, 제한된 목소리 변환만이 가능한 기존 StarGAN-VC 기법 대비, 본 논문의 제안 방법을 통해 다양한 발화자에 대한 성능이 개선된 음성 변환을 제공할 수 있음을 정량적으로 확인하였다.

한국어 text-to-speech(TTS) 시스템을 위한 엔드투엔드 합성 방식 연구 (An end-to-end synthesis method for Korean text-to-speech systems)

  • 최연주;정영문;김영관;서영주;김회린
    • 말소리와 음성과학
    • /
    • 제10권1호
    • /
    • pp.39-48
    • /
    • 2018
  • A typical statistical parametric speech synthesis (text-to-speech, TTS) system consists of separate modules, such as a text analysis module, an acoustic modeling module, and a speech synthesis module. This causes two problems: 1) expert knowledge of each module is required, and 2) errors generated in each module accumulate passing through each module. An end-to-end TTS system could avoid such problems by synthesizing voice signals directly from an input string. In this study, we implemented an end-to-end Korean TTS system using Google's Tacotron, which is an end-to-end TTS system based on a sequence-to-sequence model with attention mechanism. We used 4392 utterances spoken by a Korean female speaker, an amount that corresponds to 37% of the dataset Google used for training Tacotron. Our system obtained mean opinion score (MOS) 2.98 and degradation mean opinion score (DMOS) 3.25. We will discuss the factors which affected training of the system. Experiments demonstrate that the post-processing network needs to be designed considering output language and input characters and that according to the amount of training data, the maximum value of n for n-grams modeled by the encoder should be small enough.

요들송에 대한 전기성문파형검사(EGG)를 이용한 발성학적 접근 (A Phonetic Analysis of Yodel Singing by the Electroglottographic(EGG) Measurement)

  • 서동일;최헝식
    • 음성과학
    • /
    • 제7권2호
    • /
    • pp.113-126
    • /
    • 2000
  • A comparative phonetic analysis of Yodel singing and Belcanto singing by the electroglottographic(EGG) measurement was done in three singers. One professional tenor singer(SDI) who is also well trained in Yodel singing, another yodler(KWS) who is not so trained in Belcanto singing, and the other training tenor singer(CSK) who is not well trained both yodel and Belcanto singing. Closed quotient(CQ), speed quotient(SQ) and fundamental frequency (F0) at the initial modal part(I) , middle falsetto part(M), and final modal part(F) of the same phrase were measured by EGG machine and program(Kay model 4338). In the middle part, not only CQ but also SQ of the Yodel singing were much smaller than that of Belcanto singing in all three singers. However, accuracy of parameters in Belcanto singing of the yodler(KWS) and both Yodel singing and Belcanto singing of the training singer(CSK) were inferior to that of trained tenor singer(SDI). Possible advantages of utilizing Yodel singing training under the guidance of feedback control by the EGG for hyperfunctional voice disorders such as vocal nodules were discussed.

  • PDF

최근 미국의 정보전문가 교육의 동향과 한국 사서교육과정 개정의 기본방향 (Recent Trends in Education and Training for Information Professionals in the U. S. and Their Impact on Library Education Programs in Korea)

  • 한순정
    • 한국문헌정보학회지
    • /
    • 제12권
    • /
    • pp.149-163
    • /
    • 1985
  • This short survey article examines the current curricula for library and information science education in the U. S. in order to implement them for our professional education in the field in Korea so as to produce qualified and competent graduates. Some of the prevailing trends in education and training for information professionals in the U. S. are as follows: 1. Library schools tend to incorporate information science into library school curricula to reflect their emphasis on this area, and attempt to develop close ties with all segments of the information industry; 2. Library schools actively participate in cooperative research with other agencies to explore ways of solving problems; 3. There is a diversity of education and training programs to meet the needs of a wide variety of information professionals, with library school faculty members being drawn from a wide range of scholarly disciplines; 4. New methods of teaching are being developed to support research and instructional activities; 5. There has been a significant change in the composition of the student body, now given a strong voice in the administration of the library school.

  • PDF

Tone Deafness and Implications for Music Therapy Strategies for Treatment

  • Chong, Hyun Ju
    • 인간행동과 음악연구
    • /
    • 제2권2호
    • /
    • pp.69-79
    • /
    • 2005
  • This study was purported to examine the definition of tone deafness, various factors for the cause based on literature review of research findings, and to examine therapeutic application of music for treatment of tone deafness. With research, it was found that there can be three different kinds of tone deafness; amusia, agnosia, and asonia. Literature review showed that tone deafness has been frequently dealt in many research in order to verify the causal factors, such as gender, age, and environments. With time, the research trend on tone deafness has shifted towards neurological approach closely examining brain activity, presenting the statement that the brain's capacity to perceive modest pitch changes may be congenitally impaired. Also physiological factors contribute to tone deafness called diplacusis, which is a phenomenon wherein a given tone is heard as different pitches by the two ears, resulting in conflicting bilateral perception of pitch. Music can be used for treatment of various factors causing tone deafness. The most efficient intervention was singing program. Pitch-matching training can be effective training using operant conditioning procedure. Successive approximation or reinforcement of correct response alone was more efficient procedure in helping uncertain singers to sing on pitch. Also progressive breathing exercises helped the training the pitch-matching where one had to coordinate hearing and voice.

  • PDF

3D 가상현실기반의 발표훈련시스템 (Presentation Training System based on 3D Virtual Reality)

  • 정영기
    • 문화기술의 융합
    • /
    • 제4권4호
    • /
    • pp.309-316
    • /
    • 2018
  • 본 연구에서는 실제와 같은 가상발표환경을 구현하여 실전에서 자신감 있게 발표할 수 있도록 도와주기 위한 3D 가상현실기반의 발표훈련시스템을 제안한다. 제안시스템은 발표자의 음성과 행동을 실시간으로 분석하여 가상공간의 청중들에게 반영되게 함으로서 사실감 있고 몰입도 높은 발표 및 면접 환경을 제공하였다. 발표자는 6DOF Tracking이 되는 HMD와 VR Controller를 착용하고 Kinect를 이용하여 가상공간 속에서의 시점 변화 및 인터랙션을 줄 수 있으며 가상공간은 사용자가 설정한 다양한 환경으로 변경이 가능하도록 하였다. 발표자는 가상공간 속에 별도로 제공되는 뷰에 표시된 프리젠테이션 파일 및 스크립트를 보며 내용 숙지 및 발표 숙달 연습을 하게 된다.