• 제목/요약/키워드: voice change

검색결과 360건 처리시간 0.025초

음성질환의 후두스트로보스코피 소견 (Laryngo-stroboscopic Findings in Voice Disorders)

  • 김영호;김광문;최홍식;홍원표
    • 대한기관식도과학회:학술대회논문집
    • /
    • 대한기관식도과학회 1993년도 제27차 학술대회 초록집
    • /
    • pp.72-72
    • /
    • 1993
  • 음성질환의 진단을 위하여 사용하는 검사법은 여러가지가 있으며 음성발생의 기전에 근거하여 공기역학적 검사로부터 어음청취검사에 이르기까지 다양하게 시도되고 있다. 이중 성대점막의 진동양상은 간접후두경 만으로는 정확히 관찰하기 어려우므로 후두스트로보스코피, 초고속촬영법, 광전, 전기, 초음파등을 이용한 글로토그라피 및 카이모그라피 등이 사용되고 있는데 임상적으로는 후두스트로보스코피가 가장 널리 사용되어지고 있다. 저자들은 1992년 4월 부터 1993년 3월까지 연세대학교 의과대학 음성언어의학연구소에서 음성검사를 시행하였던 환자들을 대상으로 질환별 스트로보스코피소견의 특징을 파악함으로써 후두질환의 진단 및 치료에 도움을 얻고자 하였다.

  • PDF

Voice Frequency Synthesis using VAW-GAN based Amplitude Scaling for Emotion Transformation

  • Kwon, Hye-Jeong;Kim, Min-Jeong;Baek, Ji-Won;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권2호
    • /
    • pp.713-725
    • /
    • 2022
  • Mostly, artificial intelligence does not show any definite change in emotions. For this reason, it is hard to demonstrate empathy in communication with humans. If frequency modification is applied to neutral emotions, or if a different emotional frequency is added to them, it is possible to develop artificial intelligence with emotions. This study proposes the emotion conversion using the Generative Adversarial Network (GAN) based voice frequency synthesis. The proposed method extracts a frequency from speech data of twenty-four actors and actresses. In other words, it extracts voice features of their different emotions, preserves linguistic features, and converts emotions only. After that, it generates a frequency in variational auto-encoding Wasserstein generative adversarial network (VAW-GAN) in order to make prosody and preserve linguistic information. That makes it possible to learn speech features in parallel. Finally, it corrects a frequency by employing Amplitude Scaling. With the use of the spectral conversion of logarithmic scale, it is converted into a frequency in consideration of human hearing features. Accordingly, the proposed technique provides the emotion conversion of speeches in order to express emotions in line with artificially generated voices or speeches.

음성검사 중 공기역학적 검사에서 한국인의 정상 평균치 (Mean Value of Aerodynamic Study in Normal Korean)

  • 서장수;송시연;권오철;김준우;이희경;정옥란
    • 대한후두음성언어의학회지
    • /
    • 제8권1호
    • /
    • pp.27-32
    • /
    • 1997
  • Recently, many people suffering from voice color change visit otolaryngologist. There is no specific data which can be evaluated objectively for voice color change in korean. In aerodynamic study, maximum phonation time, mean air flow rate, phonatory flow volume and subglottal pressure were tested by using Aerophone II voice function analyzer in korean. 112 male and 122 female aged from 10 to 69 years were randomly selected. Maximum phonation time was 20.8${\pm}$6.4sec in male and 17.2${\pm}$4.1sec in female. Mean air flow rate was 167.1${\pm}$61.4ml/sec. in male and 129.6${\pm}$49.3ml/sec in female. Phonatory flow volume was 3184.5${\pm}$646.0ml in male and 2122.1${\pm}$670.5ml in female. Subglottal pressure was 4.1${\pm}$1.8 cmH2O in male and 3.5${\pm}$1.4cm $H_2O$ in female. There was no statistically significant difference among age groups in all above results.

  • PDF

Voice Change Due to Paratracheal Air Cysts

  • Rhee, Youn Ju;Han, Sung Joon;Chong, Yoo Young;Cho, Hyun Jin;Kang, Shin Kwang;Lee, Choong-Sik;Kang, Min-Woong
    • Journal of Chest Surgery
    • /
    • 제49권4호
    • /
    • pp.313-316
    • /
    • 2016
  • Paratracheal air cysts are a rare entity in which cystic formation occurs adjacent to the trachea. Most patients with paratracheal air cysts are asymptomatic, and the cysts are detected incidentally on chest radiograph or computed tomography (CT) scan. Most symptomatic patients complain of pulmonary symptoms or repeated respiratory infection. Rarely, the air cysts can lead to paralysis of the recurrent laryngeal nerve as a result of direct compression. We report a case of a 59-year-old male patient who presented with voice change, and the cause was identified as paratracheal air cysts on a chest CT scan. Surgical resection via video-assisted mediastinoscopy was performed, and the voice recovered immediately after the operation.

갑상선 절제 술 후 기능적 음성장애의 공기역학적 특징과 음성치료 중재 (Aerodynamic Features and Voice Therapy Interventions of Functional Voice Disorder after Thyroidectomy)

  • 이창윤;안수연;장현;정희석;손희영
    • 대한후두음성언어의학회지
    • /
    • 제26권1호
    • /
    • pp.25-33
    • /
    • 2015
  • Background and Objectives:The objective of this study was to investigate the features of post-thyroidectomy subjective voice disorder by Voice Handicap Index (VHI) and Voice Symptom Scale (VOISS) through aerodynamic analysis and to investigate the appropriate voice therapy intervention. Materials and Methods:Twenty post-thyroidectomy patients who had no recurrent laryngeal nerve paralysis through laryngeal stroboscopy were enrolled for this study. Acoustic and aerodynamic evaluations were performed before operation, 2 weeks and 3 months after operation. Subjective voice evaluation was performed by VHI and VOISS. Aerodynamic evaluation was compared and analysed by maximum phonation time(MPT), phonation threshold pressure(PTP), mean air flow rate(MFR), etc. Subjective voice evaluation was surveyed through VHI and VOISS. To evaluate patients' symptoms related to functional voice disorder, scores on physical domain in VHI and VOISS were selected to be compared for each session. Results: The 10 out of 20 participants who complained of voice symptoms had no significant difference with pre-operation in acoustic evaluation, but all showed higher scores on 2 weeks and 3 months after operation compared to pre-operation, in VHI-physical domain and selected questionnaires in VOISS. They reduced MPT and increased PTP value simultaneously. Laryngeal massage and breathing training were simultaneously treated to them, 5 participants resulting in improvement in MPT and PTP compared to pre-treatment. Conclusion:Patients who complained voice change with no organic damage after thyroidectomy were all shown to have reduced MPT and increased PTP in some by aerodynamic evaluations. Reduced MPT may imply some problem in air flow beneath glottis. Increased PTP suggests much more effort in vocalization mechanism than pre-operation. Comparing aerodynamic evaluations in post-thyroidectomy may provide information on behavioral interventions. Additionally, study on laryngeal massage and breathing training simultaneously treated to patients with such voice disorder is needed to be conducted with larger number of participants.

  • PDF

u-Health 시스템을 위한 음성신호 분석 기반의 간 기능 모니터링에 관한 연구 (A Study on Monitoring of Liver Function Based on Voice Signal Analysis for u-Health System)

  • 김봉현;조동욱
    • 정보처리학회논문지B
    • /
    • 제18B권6호
    • /
    • pp.389-396
    • /
    • 2011
  • 현대 사회에서 식습관의 변화, 스트레스, 음주 등으로 인해 다양한 간 질환이 발생되거나 악화되어 가고 있다. 따라서 본 논문에서는 간 질환이 음성에 미치는 영향을 연구하여 간 질환을 조기에 진단할 수 있는 방법론을 제안하였다. 이를 위해 간 질환자를 대상으로 입원했을 때와 치료로 인해 정상적으로 퇴원했을 때의 음성을 각각 수집하여 음성 분석 요소를 적용한 실험을 수행하였다. 특히, 한의학적으로 간(肝)과 관련 있는 발음인 아음(牙音)에 대한 분석 실험으로 제3포먼트 주파수 대역폭과 발음 요소값을 적용한 실험을 수행하였으며 이를 통해 간 질환이 공명강과 발성에 미치는 영향을 객관적 지표로 출력하는 연구를 행하였다. 또한 실험 결과를 기반으로 u-Health 환경에서 간 기능을 모니터링하는 시스템 설계에 관한 연구를 수행하였다.

자궁 내막증으로 치료 받은 여성들의 음성 변화 (Voice Changes in Women Treated for Endometriosis)

  • 서민철;주준범;남순열
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.46-50
    • /
    • 2000
  • Background and Objectives : Hormonal treatments which have an androgenic effect have the potential to cause vocal changes. The changes in vocal fold structure and voice quality are considered to be irreversible. To date, studies have documeted subjective vocal changes or documented single cases without detailed, baseline voice assessments. Materials and Methods : We have performed objective voice analyses of 20 women who were treated with androgenic hormones for endometriosis and compared the results with those of normal control women. Results : The averages of fundamental frequency were 194.7${\pm}$28.2 in study group, 207.0${\pm}$14.1 in control group. The means of closed quotient which were measured with electroglottography were 45.13${\pm}$2.06 in study group, 45.1${\pm}$3.03 in control group. Results of acoustic analysis are as follows. The averages of jitter were 0.95${\pm}$0.46 in study group, 1.10${\pm}$0.65 in control group. The means of shimmer were 2.44${\pm}$0.60 in study group, 2.32${\pm}$1.09 in control group. The averages of noise to harmonic ratio were 0.13${\pm}$0.028 in study group, 0.15$\pm$0.18 in control group. Conclusion : Although there were no statistically meaningful differences between the two groups, we could detect the masculinizing tendency of the therapeutic hormones of endomentriosiss(lowering of fundamental frequency). Given the availability of objective voice assessments today and the continued use of these potent hormones, comprehensive voice assessment and vocal monitoring would appear vital for women commencing hormonal treatment.

  • PDF

갑상선 기능저하 음성에 대한 청지각적 및 파열음 분석에 대한 연구 (The Perceptual and Consonant Analysis for the Voice with Hypothyroidism)

  • 한백화;이다해;김준선;홍기환
    • 대한후두음성언어의학회지
    • /
    • 제27권2호
    • /
    • pp.95-101
    • /
    • 2016
  • Background and Objectives : The main purpose of this study is to clarify perceptual and acoustic analysis for the patients with hypothyroidism after thyroidectomy especially focused on the characteristics of speech articulation with special reference to the consonant production. Materials and Methods : The subjects of the research were 40 male and female adults (males : 5, females : 35). They were all received radioactive iodine treatment which after total thyroidectomy. Voice samples were collected during the three stages of after surgery, pre-radioisotope treatment (RIT), and post-RIT. The acoustic analysis was conducted by using Pratt (ver.5.2.21) after measuring voice onset time (VOT). The subjective evaluation of the voices used CAPE-V. Results : A significant decrease in overall severity was displayed in the CAPE-V following RIT. It may be conjectured that this is connected to the change in voice following RIT. The loudness of the sound displayed a significant decrease in the CAPE-V following RIT. It is conjectured that this is connected to the decrease in vocal intensity following RIT. No statistically significant results were revealed for the comparative analysis on the voice onset time (VOT) in all plosives during the three periods. Conclusion : Perceptually, the overall severity of the voice with hypothyroidism was changed significantly before and after RIT. Eventhough VOT were not significantly changed, it tended to decrease VOT in patients with hypothyroidism.

  • PDF

Signal Enhancement of a Variable Rate Vocoder with a Hybrid domain SNR Estimator

  • Park, Hyung Woo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권2호
    • /
    • pp.962-977
    • /
    • 2019
  • The human voice is a convenient method of information transfer between different objects such as between men, men and machine, between machines. The development of information and communication technology, the voice has been able to transfer farther than before. The way to communicate, it is to convert the voice to another form, transmit it, and then reconvert it back to sound. In such a communication process, a vocoder is a method of converting and re-converting a voice and sound. The CELP (Code-Excited Linear Prediction) type vocoder, one of the voice codecs, is adapted as a standard codec since it provides high quality sound even though its transmission speed is relatively low. The EVRC (Enhanced Variable Rate CODEC) and QCELP (Qualcomm Code-Excited Linear Prediction), variable bit rate vocoders, are used for mobile phones in 3G environment. For the real-time implementation of a vocoder, the reduction of sound quality is a typical problem. To improve the sound quality, that is important to know the size and shape of noise. In the existing sound quality improvement method, the voice activated is detected or used, or statistical methods are used by the large mount of data. However, there is a disadvantage in that no noise can be detected, when there is a continuous signal or when a change in noise is large.This paper focused on finding a better way to decrease the reduction of sound quality in lower bit transmission environments. Based on simulation results, this study proposed a preprocessor application that estimates the SNR (Signal to Noise Ratio) using the spectral SNR estimation method. The SNR estimation method adopted the IMBE (Improved Multi-Band Excitation) instead of using the SNR, which is a continuous speech signal. Finally, this application improves the quality of the vocoder by enhancing sound quality adaptively.

A study of L1 phonetic drift in the voice onset times of Korean learners of English with long L2 exposure

  • Kim, Mi-Ryoung
    • 말소리와 음성과학
    • /
    • 제11권4호
    • /
    • pp.35-43
    • /
    • 2019
  • This study examines the voice onset times (VOTs) of Korean stops produced by Korean learners of English with high language proficiency and long L2 exposure (i.e., Korean-English bilinguals) to assess whether the VOTs of their lax and aspirated stops are merging and, if so, which types of stops are being changed. Thirteen Korean speakers (six female and seven male) who had studied in the USA for more than three to ten years participated. The results show that the speakers in this study with long L2 exposure are participating in the VOT merger, in which VOTs for aspirated stops are reduced while those for lax stops are increased. In other words, change in VOT affects not only aspirated stops but also lax stops. The results indicate that L1 phonetic drift may not be primarily affected by the amount of L2 exposure, and language contact may not be the primary factor triggering a sound change in the Korean stop system. Further study is necessary focusing on the phonetic shift of the "lax" category because it may play a pivotal role in a tonogenetic-like sound change in present-day Korean.