• Title/Summary/Keyword: Multi-Dimensional Voice Program

Search Result 34, Processing Time 0.021 seconds

A Study of Extracting Acoustic Parameters for Individual Speakers (개별화자의 음성파라미터 추출에 관한 연구: 음성파라미터의 상관관계를 중심으로)

  • Ko, Do-Heung
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.129-143
    • /
    • 2003
  • Fundamental frequency (Fo), jitter, shimmer, and harmonics-to-noise ratio (NHR) have been measured to see their interactions between the parameters using Multi-Dimensional Voice Program (MDVP). 100 Korean normal adults (50 males and 50 females) ranging from their early 20's to their early 30's produced the eight sustained vowels including /a/, /i/, /u/, /c/, /e/,/$\varepsilon$/, /i/, and /e/. The subjects were asked to read the above vowels five times in isolation with the interval of five seconds, respectively. Male voices, on the average, showed 130.7 Hz in Fo, 0.6696% in jitter, 1.8151% in shimmer, and 0.12 in NHR, while female voices showed 232.8 Hz in Fo, 0.9222% in jitter, 1.9199% in shimmer, and 0.1098 in NHR. As to the correlation coefficient, it was found that for male speakers jitter vs. shimmer, shimmer vs. NHR, Fo vs. shimmer, and Fo vs. NHR are statistically significant. It was found that for female subjects jitter vs. shimmer and Fo vs. shimmer are statistically significant. However, it is concluded that the correlation coefficient in females are not meaningful in a practical way though they are all statistically significant.

  • PDF

Analysis of the Effect of Intralesional Steroid Injection on the Voice During Laryngeal Microsurgery (후두 미세수술 중 병변 내 스테로이드 주입이 음성에 미치는 효과 분석)

  • Jae Seon, Park;Hyun Seok, Kang;In Buhm, Lee;Sung Min, Jin;Sang Hyuk, Lee
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.33 no.3
    • /
    • pp.166-171
    • /
    • 2022
  • Background and Objectives Vocal fold (VF) scar is known to be the most common cause of dysphonia after laryngeal microsurgery (LMS). Steroids reduce postoperative scar formation by inhibiting inflammation and collagen deposition. However, the clinical evidence of whether steroids are helpful in reducing VF scar formation after LMS is still lacking. The purpose of this study is to determine whether intralesional VF steroid injection after LMS helps to reduce postoperative scar formation and voice quality. Materials and Method This study was conducted on 80 patients who underwent LMS for VF polyp, Reinke's edema, and leukoplakia. Among them, 40 patients who underwent VF steroid injection after LMS were set as the injection group, and patients who had similar sex, age, and lesion size and who underwent LMS alone were set as the control group. In each group, stroboscopy, multi-dimensional voice program, Aerophone II, and voice handicap index (VHI) were performed before and 1 month after surgery, and the results were statistically analyzed. Results There were no statistically significant differences in the distribution of sex, age, symptom duration, occupation and smoking status between each group. Both groups consisted of VF polyp (n=21), Reinke's edema (n=11), and leukoplakia (n=9). On stroboscopy, the lesion disappeared after surgery, and the amplitude and mucosal wave were symmetrical on both sides of the VFs in all patients. Acoustic parameters and VHI significantly improved after surgery in all patients. However, there was no significant difference between the injection and control group in most of the results. Conclusion There was no significant difference in the results of stroboscopy, acoustic, aerodynamic, and subjective evaluation before and after surgery in the injection group and the control group.

The Efficacy of Percutaneous Steroid Injection via Cricothyroid Membrane for Reinke's Edema (라인케씨 부종 환자에서 경윤상 갑상막 접근을 통한 성대 내 스테로이드 주입술의 효용)

  • Nam, Woojoo;Kim, Sun Woo;Jin, Sung Min;Lee, Sang Hyuk
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.30 no.2
    • /
    • pp.101-106
    • /
    • 2019
  • Background and Objectives Reinke's edema is a benign vocal fold disease caused by an edematous laryngeal superficial layer of lamina propria. The first line treatment is cessation of smoking and laryngeal microsurgery. The aim of the study is to evaluate the feasibility and efficacy of percutaneous steroid injection via cricothyroid membrane in patients with Reinke's edema. Materials and Method From Jan 2010 to July 2018, 33 Patients with Reinke's edema managed by vocal fold steroid injection via the cricothyroid membrane were included in this study. We compared medical records of laryngoscopy, stroboscopy and Multi-Dimensional Voice Program analysis at pre-treatment and post-treatment. Subjective voice improvement was evaluated using Voice Handicap Index-30 (VHI-30). Results 75.7% of the patients showed partial response and 6.06% showed complete response. 93.94% were present smokers and only 4 patients ceased smoking after the treatment. In acoustic analysis, the pre-treatment mean value of jitter, shimmer, and noise to harmonic ratio was 2.30±3.21, 9.34±10.37, 1.11±2.90 each. The post-treatment value was 2.20±1.89, 6.96±5.30, 0.20±0.09 respectively and none of the parameters were statistically significant. For subjective symptom improvement, 25 (75.8%) patients showed a better score on post-treatment VHI-30 compared to pre-treatment. Conclusion According to our study, steroid injection is a relatively safe and effective procedure for patients with Reinke's edema. A vocal fold steroid injection via the cricothyroid membrane can be an alternative treatment option for those who are not able to undergo conventional laryngeal microscopic surgery, however cessation of smoking is necessary for effective treatment.

Feasibility of Deep Learning Algorithms for Binary Classification Problems (이진 분류문제에서의 딥러닝 알고리즘의 활용 가능성 평가)

  • Kim, Kitae;Lee, Bomi;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.1
    • /
    • pp.95-108
    • /
    • 2017
  • Recently, AlphaGo which is Bakuk (Go) artificial intelligence program by Google DeepMind, had a huge victory against Lee Sedol. Many people thought that machines would not be able to win a man in Go games because the number of paths to make a one move is more than the number of atoms in the universe unlike chess, but the result was the opposite to what people predicted. After the match, artificial intelligence technology was focused as a core technology of the fourth industrial revolution and attracted attentions from various application domains. Especially, deep learning technique have been attracted as a core artificial intelligence technology used in the AlphaGo algorithm. The deep learning technique is already being applied to many problems. Especially, it shows good performance in image recognition field. In addition, it shows good performance in high dimensional data area such as voice, image and natural language, which was difficult to get good performance using existing machine learning techniques. However, in contrast, it is difficult to find deep leaning researches on traditional business data and structured data analysis. In this study, we tried to find out whether the deep learning techniques have been studied so far can be used not only for the recognition of high dimensional data but also for the binary classification problem of traditional business data analysis such as customer churn analysis, marketing response prediction, and default prediction. And we compare the performance of the deep learning techniques with that of traditional artificial neural network models. The experimental data in the paper is the telemarketing response data of a bank in Portugal. It has input variables such as age, occupation, loan status, and the number of previous telemarketing and has a binary target variable that records whether the customer intends to open an account or not. In this study, to evaluate the possibility of utilization of deep learning algorithms and techniques in binary classification problem, we compared the performance of various models using CNN, LSTM algorithm and dropout, which are widely used algorithms and techniques in deep learning, with that of MLP models which is a traditional artificial neural network model. However, since all the network design alternatives can not be tested due to the nature of the artificial neural network, the experiment was conducted based on restricted settings on the number of hidden layers, the number of neurons in the hidden layer, the number of output data (filters), and the application conditions of the dropout technique. The F1 Score was used to evaluate the performance of models to show how well the models work to classify the interesting class instead of the overall accuracy. The detail methods for applying each deep learning technique in the experiment is as follows. The CNN algorithm is a method that reads adjacent values from a specific value and recognizes the features, but it does not matter how close the distance of each business data field is because each field is usually independent. In this experiment, we set the filter size of the CNN algorithm as the number of fields to learn the whole characteristics of the data at once, and added a hidden layer to make decision based on the additional features. For the model having two LSTM layers, the input direction of the second layer is put in reversed position with first layer in order to reduce the influence from the position of each field. In the case of the dropout technique, we set the neurons to disappear with a probability of 0.5 for each hidden layer. The experimental results show that the predicted model with the highest F1 score was the CNN model using the dropout technique, and the next best model was the MLP model with two hidden layers using the dropout technique. In this study, we were able to get some findings as the experiment had proceeded. First, models using dropout techniques have a slightly more conservative prediction than those without dropout techniques, and it generally shows better performance in classification. Second, CNN models show better classification performance than MLP models. This is interesting because it has shown good performance in binary classification problems which it rarely have been applied to, as well as in the fields where it's effectiveness has been proven. Third, the LSTM algorithm seems to be unsuitable for binary classification problems because the training time is too long compared to the performance improvement. From these results, we can confirm that some of the deep learning algorithms can be applied to solve business binary classification problems.