• Title/Summary/Keyword: voice parameter

Search Result 179, Processing Time 0.022 seconds

Correlation Analysis of Between Paranasal Sinuses and Formant Frequency According to External Stimulation (외부 자극에 따른 부비동과 포먼트주파수와의 상관성 분석)

  • Kim, Bong-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.8
    • /
    • pp.1955-1961
    • /
    • 2013
  • Paranasal sinuses of the empty space is filled with air that exists in the bones in the face. However, the pus becomes inflamed paranasal sinuses sinusitis onset brings the voice of change, and complained of headaches and lethargy. Therefore, in this paper, paranasal sinuses related diseases to predict voice analysis parameter as measured by changes in paranasal sinuses through external stimuli is investigated and carried out a study to analysis the function consisting of the frontal sinus, ethmoid sinus, maxillary sinus, sphenoid sinus. From this, cold pack stimulation in the paranasal sinus area for stimulation before and after voice was performed by measuring formant frequency and external stimuli through correlation analysis of the mutual impact on paranasal sinuses were analyzed.

Deep Learning based Singing Voice Synthesis Modeling (딥러닝 기반 가창 음성합성(Singing Voice Synthesis) 모델링)

  • Kim, Minae;Kim, Somin;Park, Jihyun;Heo, Gabin;Choi, Yunjeong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.127-130
    • /
    • 2022
  • This paper is a study on singing voice synthesis modeling using a generator loss function, which analyzes various factors that may occur when applying BEGAN among deep learning algorithms optimized for image generation to Audio domain. and we conduct experiments to derive optimal quality. In this paper, we focused the problem that the L1 loss proposed in the BEGAN-based models degrades the meaning of hyperparameter the gamma(𝛾) which was defined to control the diversity and quality of generated audio samples. In experiments we show that our proposed method and finding the optimal values through tuning, it can contribute to the improvement of the quality of the singing synthesis product.

  • PDF

The Correlation between The Size and Location of Vocal Polyp and Voice Quality, Before and After Laryngeal Microsurgery (후두미세수술 전후 성대 용종의 크기 및 위치가 음성의 질의 변화에 미치는 영향)

  • Han, Won Gue;Kim, Min-Su;Oh, Kyung Ho;Woo, Jeung Soo;Jung, Kwang Yoon;Kwon, Soon Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.27 no.2
    • /
    • pp.102-107
    • /
    • 2016
  • Background and Objectives : Vocal polyps are caused by inflammation induced by stress or irritation. Many patients with vocal polyps complain voice discomfort. For vocal polyps, surgery such as laryngeal microsurgery has been the mainstay of management. We analyzed the clinical features of vocal polyps, and how the size and location of vocal polyps affect the outcomes of surgery. Methods : We retrospectively reviewed 42 patients from March 2014 to December 2015, who were diagnosed as unilateral single vocal polyp. When we operated on a vocal polyp with laryngeal microscopy, we measured their size and location. The quality of voice was evaluated by GRABS scale, jitter, shimmer, NHR (noise to harmonic ratio), MPT (maximum phonation time), and VHI (voice handicap index) before operation and 4 weeks after operation. Results : When we divided the patients into large-sized vocal polyp group (the longest length >3 mm) and small-sized vocal polyp group (the longest length ${\leq}3mm$), all parameter differences tend to be greater at large sized vocal polyp. However, these differences were not statistically significant (p>0.05). When we divided into two groups depending on the volume of vocal polyp, no distinct tendency was found. When we compared the location (anterior, mid and posterior) of vocal polyp with the improvement of voice quality, more change was found at mid portion vocal polyp, except the difference of VHI. However, these differences were also not statistically significant (p>0.05). Conclusion : All parameter differences tend to be greater at large vocal polyp and polyp of the mid location.

  • PDF

The Analysis of Tracheoesophageal Voice after Near-Total Laryngectomy and Implantation of Provox Prosthesis (후두근전적출술과 Provox 삽입술 후 기관식도발성에 관한 연구)

  • Choi, In-Ja;Choi, Young-Soo;Kim, Jin-Hwan;Ahn, Hwoe-Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.2
    • /
    • pp.141-144
    • /
    • 2004
  • Background and Objectives : To compare acoustic, aerodynamic analysis of voice and intelligibility score in patients with near-total laryngectomy and implantation of Provox prothesis. Material and Methods : In order to evaluate the voice characteristics, acoustic, aerodynamic parameter and speech intelligibility were measured in 5 patients after near-total laryngectomy, 5 patients after implantation of Provox prosthesis with total bility were measured in 5 patients after near-total laryngectomy, 5 patients after implantation of Provox prosthesis with total laryngectomy and 10 adults normal speaker. Acoustic analysis was carried out using CSL and aerodynamic analysis was carried out using Aerophon II. Speech sample was recorded and 10 listener was scored for speech intelligibility using a percentage of words correctly identified. Results. Fundamental frequency($F_0$), intensity, jitter, shimmer, maximal phonation time(MPT), subglottic air pressure were used for parameters for voice analysis. There were no significant difference between two group except on fundamental frequency and shimmer. The fundamental frequency was higher in patients with near-total laryngectomy and shimmer was higher in patients after implantation of Provox prosthesis with total laryngectomy. In addition, speech intelligibility was no significant difference between two groups. Conclusion : This results confirm that near-total laryngectomy and implantation of Provox prosthesis provides good voice rehabilitation.

  • PDF

The Analysis of Voice after Vertical Partial Laryngectomy with Mucosal Flap and Fat Graft Reconstruction (수직후두부분절제술 및 점막 피판과 지방 이식을 통한 성대 재건술 후의 음성분석)

  • Chu, Hyung-Ro;Choi, In-Ja;Kim, Jin-Hwan;Ahn, Hwoe-Young;Rho, Young-Soo
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.18 no.2
    • /
    • pp.134-137
    • /
    • 2007
  • Background and Objectives: The goals of laryngeal reconstruction have been prevention of aspiration, production of a functional voice, and maintenance of an adequate airway for decannulation. It is generally believed that the reconstruction of the glottic region after vertical partial laryngectomy (VPL) can improve laryngeal function. The objective of this study is to evaluate of voice function after VPL with mucosal flap and fat graft reconstruction. Materials and Methods: From 1994 to 2006, 13 patients, who had been treated with VPL with mucosal flap and fat graft reconstruction. The voice characteristics, acoustic, aerodynamic parameter were measured in 13 patients after vertical partial laryngectomy with mucosal flap and fat graft reconstruction. Acoustic analysis was carried out using Computerized Speech Lab (CSL) and aerodynamic analysis were carried out using Aerophon II,3 months and 12 months after surgery. Results: The GRBAS scale, jitter, shimmer, NHR were improved as time goes on after surgery. But, maximum phonation time was shortened after surgery and there is no significant differences between before and after surgery in mean flow rate. Conclusion: The voice function of the mucosal flap and fat graft reconstruction after VPL were satisfactory. This can be an excellent reconstruction method after vertical partial laryngectomy.

  • PDF

A Performance Analysis of VoIP in the FMC Network to provide QoE for users (융합 망에서 사용자에게 QoE를 제공하기 위한 VoIP 성능 분석)

  • Lee, Kyu-Hwan;Oh, Sung-Min;Kim, Jae-Hyun
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.3B
    • /
    • pp.398-407
    • /
    • 2010
  • Due to increase of user requirement for various traffics and the advance of network technology, each distinct network has converge into FMC(Fixed Mobile Convergence) networks. However, we need to research the performance analysis of VoIP(Voice over Internet Protocol) in the FMC network to provide QoE for the voice user of FMC network. Therefore, this paper introduces the scenario which is the situation of voice quality degradation when a user uses VoIP to communicate with other users in the FMC network. Especially, this paper presents scenario in terms of the component of the network and finds the improvement point of voice quality. In the simulation results, three improvement points of voice quality are found as following: voice quality degradation by packet loss in the physical layer of the HSDPA network, by utilizing GGSN without QoS parameter mapping mechanism which is gateway between 3GPP and IP backbone, and by using non-QoS AP in the WLAN network.

Acoustic analysis of wet voice among patients with swallowing disorders (삼킴장애 환자의 wet voice 관련 음향학적 분석)

  • Kang, Young Ae;Koo, Bon Seok;Kwon, In Sun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.147-154
    • /
    • 2018
  • Wet voice quality (WVQ) is a characteristic that appears after swallowing. Although the concept is accepted by many clinicians worldwide, it is nevertheless ambiguous. In this study, we investigated WVQ in patients with swallowing disorders using acoustic analysis. A total of 106 patients diagnosed with penetration-aspiration by the videofluoroscopic swallowing study (VFSS) were recruited. A voice recording of vowel /a/ was conducted before and after the VFSS, and an acoustic analysis was then performed using PRAAT. Voice after VFSS was used for a perceptual judgment and divided into two groups: the Wet group (48 patients) and the Non-wet group (58 patients). At the post-VFSS stage, the two groups displayed significant differences in many acoustic parameters including F0_SD, Jitter, RAP, Shimmer, APQ, HNR, NHR, FUF, DVB, and CPP. The parameter affecting judging wetness resulted into Jitter and NHR by the logistic regression test. At the pre-VFSS stage, the two groups differed significantly in many acoustic parameters including Intensity, Jitter, RAP, Shimmer, NHR, FUF, DVB, and CPP. Both pre-and post-VFSS, the mean values of all significant parameters, except Intensity, HNR, and CPP, were higher in the Wet group. According to pre-and post-VFSS, the two groups displayed interactions in many parameters (Intensity, F0_SD, Jitter, RAP, Shimmer, APQ, HNR, NHR, FUF, DVB, and CPP). In particular, Intensity increased in both groups after the VFSS, although the increase in the Non-wet group was greater. Based on these results, it was conjectured that the WVQ after swallowing resulted from the secretion effect of the mucous membrane due to the dry laryngeal characteristic of elderly patients, rather than aspiration resulting in food on the vocal cords.

The Effect of Auditory Condition on Voice Parameter of Teacher (청각 환경이 교사의 음성 파라미터에 미치는 영향)

  • Lee Ju-Young;Baek Kwang-Hyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.5
    • /
    • pp.207-212
    • /
    • 2006
  • The purpose of this study was to compare voice parameters in auditory conditions (normal/noise/music) between a teacher group and a control group. Results of statistical analysis showed that the teacher group had higher jitter (%) and shimmer (%) values than the control group. It indicated that the teacher group had larger variations in pitch and dynamic of their voice. In the teacher group, the voice under noisy condition showed a higher value of fundamental frequency than that under normal condition. though its fundamental frequency did not show any significant difference between the noisy condition and the musical condition. In the control group, however, although the voice under noisy condition also showed a higher value of fundamental frequency than that under normal condition, its fundamental frequency was significantly different between the noisy condition and the musical condition.

The Effect of An Increase of Closed Quotient on Improvement of Voice Quality after Type I Thyroplasty in Patients with Unilateral Vocal Cord Paralysis (일측 성대마비 환자에서 성대내전술 후 성대접촉율의 증가가 음질 개선에 미치는 영향)

  • Kim, Han-Su;Choi, Seung-Hee;Lim, Jae-Yol;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.16-20
    • /
    • 2004
  • Purpose : To assess perceptual, acoustic and aerodynamic measure of voice quality in patients with unilateral vocal cord paralysis before and after type I thyroplasty. Methods : The clinical records of patients operated type I thyroplasty in the Departement of otorhinoalryngolgy, Yongdong Severance hospital from November 2001 to November 2003 were reviewed. All patients uderwent a vocal function evaluation including perceptual, acoustic and aerodynamic measures of voice preoperative and on $60^{th}$ postoperative day. The perceptual and acoustic measures were obtained from recording of patients' reading a 'Sanchak' passage. The perceptual evaluation was performed by 2 speech pathologist using a 4-point rating scale. Acoustic parameters(voice range profile low(RAL), voice range profile high(RAH), average fundamental frequency(AFX), closed quotient, harmonic to noise ratio, jitter and shimmer) were investigated by Lx speech studio. Mean flow rate(MFR), subglottic pressure(Psub) and intensity were measured using the Phonatory function analyzer. The maximum phonation time was also measured. The data were statistically analyzed. A paired t-test (p<0.1) was used to compare preoperative and postoperative results. And multiple regression test was used to find which parameter was most correlated to improvement of postoperative voice quality. Results : Among aerodynamic parameters, Psub $(88.11mmH_2O{\rightarrow}58.7mmH_2O)$, MPT(7.87sec${\rightarrow}$12.53sec), MFR (359.8ml/sec${\rightarrow}$161.06ml/sec) were statistically improved. AFx(205.5Hz${\rightarrow}$163.27Hz), AQx(23.9%${\rightarrow}$48.3%), RAL, RAH. Jotter and shimmer were improved. In multiple regression test, AFx and AQx was noted as the two meost correlated parameters to improvement of postoperative breathiness. But general grade of voice quality was more correlated to Psub and shimmer. Conclusion : Vocal fold medialization procedures effectively reduce glottic gap. Increasing of contact area of both vocal folds induced improvement in aerodynamic parameters and leaded stabilizing of vocal fold vibration. That effect results in improvement in acoustic parameters (shimmer, jitter, signal-to-noise ratio, voice range profile) and voice quality.

  • PDF