• Title/Summary/Keyword: Voice Analysis

Search Result 1,163, Processing Time 0.04 seconds

Acoustic Analysis of Normal and Pathologic Voice Synthesized with Voice Synthesis Program of Dr. Speech Science (Dr. Speech Science의 음성합성프로그램을 이용하여 합성한 정상음성과 병적음성(Pathologic Voice)의 음향학적 분석)

  • 최홍식;김성수
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.12 no.2
    • /
    • pp.115-120
    • /
    • 2001
  • In this paper, we synthesized vowel /ae/ with voice synthesis program of Dr. Speech Science, and we also synthesized pathologic vowel /ae/ by some parameters such as high frequency gain (HFG), low frequency gain(LFG), pitch flutter(PF) which represents jitter value and flutter of amplitude(FA) which represents shimmer value, and grade ranked as mild, moderate and severe respectively. And then we analysed all pathologic voice by analysis program of Dr. Speech Science. We expect that this synthesized pathologic voices are useful for understanding the parameter such as noise, jitter and shimmer and feedback effect to patient with voice disorder.

  • PDF

Identification of Voice for Listeners who Feel Favor Using Voice Analysis (음성 분석을 이용한 청자가 호감을 느끼는 목소리에 대한 규명)

  • Choi, Ji Hyun;Cho, Dong Uk;Jeong, Yeon Man
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.1
    • /
    • pp.122-131
    • /
    • 2016
  • In the smart societies, such as the current unlike in the past, the voice that listeners will feel favor is changing through the development of ICT technologies and infrastructure. In other words, in the past, loud, intensive and fast voice is a favorite but now a new social and cultural situation that is changing them with ICT technologies. Now, this becomes one of the important things that we clarify 'Is it a voice that feels a favor?'. For this, in this paper, we identified what voice that listeners feel favor by applying ICT technologies. Studies were carried out to proceed largely divided into two categories. Firstly, as the quantified data, we extracted the impact on favorable feeling of listeners which related with emotional speech by empirical analysis work. To do this, we performed the experiment for the public. Secondly, we identified what kind of voice which listeners feel a good impression. For this, we identified voice characteristics that there are people who are influential in the real society. Also, we extracted both the voice characteristics of each influential people and common voice characteristics. In addition, we want to overcome the problems of qualitative methods that have originally limitations in objective respects which is significant to the voice analysis. For this, we performed the experiments of the voice analysis by numerical and visual approaches.

A Correlation Study among Pitch, Nasalance, and Voice Quality (정상 성인의 음도, 비성도, 음질 간의 상관 연구)

  • Park, Sung-Jong;Yoo, Jae-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.159-163
    • /
    • 2009
  • The purpose of this study is to conduct a correlational analysis among pitch, nasalance, and acoustic quality parameters estimated by two speech analysis softwares NasalView(version 1.31), Dr. Speech 4.5(Tiger Electronics). Thirty females and 25 males with normal voice participated in the study. The Pearson correlation coefficient was determined through a statistical analysis. The results came out as follows; Firstly, there was a correlation between $F_0$ and voice quality parameters, however there was no correlation between $F_0$ and nasalance. Secondly, nasalance showed a correlation with voice quality parameters.

  • PDF

The Effect of Voice Therapy for Functional Voice Disorder (기능적 음성장애 환자에서의 음성치료의 효과)

  • 정성민;조윤희;홍순관;변성완;김은아;손지연;박애경
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.2
    • /
    • pp.145-150
    • /
    • 2002
  • Background and Objectives : Patients with so-called 'functional voice disorders' who have structurally normal larynges and demonstrate muscle misuse in the larynx, and those with several interacting causes including habitual muscle tension, are probably better defined as having a 'muscle misuse voice disorder'. The purpose of this study was to analyze the voice and effectiveness of voice therapy in patients with functional voice disorders and to provide a guide for the treatment of functional voice disorder. Materials and Method : The records of 35 patients, presenting with functional voice disorder and receiving voice therapy during October, 2001 to September, 2002, were reviewed. Prior to voice therapy, the stroboscopic examination of their larynx, aerodynamic and acoustic analysis was done. The results of voice therapy were compared according to the patient's subjective, perceptual evaluation of voice, and maximal phonation time. Results : Patient's subjective, perceptual evaluation, and maximal phonation time showed superior results after voice therapy. Conclusion : The result of this study indicates that voice therapy is an effective treatment method of patients with functional voice disorder, especially muscular tension dysphonia.

  • PDF

A Clinical Study of Predicable Factors of Voice Therapy Effect in Vocal Nodule Patients (성대결절 환자에서 음성치료 효과를 예측할 수 있는 인자에 대한 연구)

  • Woo, Joo-Hyun;Baek, Min-Kwan;Kim, Dong-Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.20 no.1
    • /
    • pp.52-56
    • /
    • 2009
  • Background and Objectives : Vocal nodule is common inflammatory vocal cord lesion which could be improved by voice rest or voice therapy. But some patients, who do not have any improvement after voice therapy, should take laryngomicorsurgery or additional long-term voice therapy. So we try to find prognostic factors which affect the results of voice therapy. Materials and Methods: There are 36 patients (response group) whose symptoms improved after initial voice therapy and 16 patients (no response group) whose symptoms did not improve at all. We compared clinical features (durations of symptoms, voice abuse, laryngopharyngeal reflux), GRBAS scale, acoustic analysis, aerodynamic analysis and voice handicap index between the two groups from January, 2006 to June, 2008. Results: Response group underwent voice therapy 4.5 times (ave.) and no response group underwent 6.7 times (ave.). No response group has longer duration of symptoms, higher GRBAS scale score, higher NIH ratio, and higher MFR than those of response group. Conclusion : This study found that the prognosis of voice therapy in patients who have longer duration of symptoms, high NIH ratio, and bad perceptional test result is not likely to be good. In those cases, we should recommend earlier surgery, voice therapy after surgery, and inform about the necessity of long-term voice rehabilitation or voice therapy in order to get favorable compliance.

  • PDF

Correlation analysis of voice characteristics and speech feature parameters, and classification modeling using SVM algorithm (목소리 특성과 음성 특징 파라미터의 상관관계와 SVM을 이용한 특성 분류 모델링)

  • Park, Tae Sung;Kwon, Chul Hong
    • Phonetics and Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.91-97
    • /
    • 2017
  • This study categorizes several voice characteristics by subjective listening assessment, and investigates correlation between voice characteristics and speech feature parameters. A model was developed to classify voice characteristics into the defined categories using SVM algorithm. To do this, we extracted various speech feature parameters from speech database for men in their 20s, and derived statistically significant parameters correlated with voice characteristics through ANOVA analysis. Then, these derived parameters were applied to the proposed SVM model. The experimental results showed that it is possible to obtain some speech feature parameters significantly correlated with the voice characteristics, and that the proposed model achieves the classification accuracies of 88.5% on average.

Study of Event Recorder with Recording Voice Communication (음성 통화 저장 기능을 제공하는 고속전철용 Event Recorder 연구)

  • Song, Gyu-Youn;Lee, Sang-Nam;Ryu, Hee-Moon;Paik, Jin-Sung
    • Proceedings of the KSR Conference
    • /
    • 2008.06a
    • /
    • pp.1962-1967
    • /
    • 2008
  • A event recorder system stores a train speed and the related information for train operation in real time. Using those information, we can analysis the train operation and the reason of train accident. Currently the event recorder only manipulate the data related the train operation mechanically and electrically. In this paper we propose the event recorder to record the voice communication between the manager in the control center and train operator. By recording the voice communication in the high speed train, the correctness of analysis of train accident can be increased. The system architecture of the event recorder with voice recording is studied and interface between other equipment is proposed. And the software architecture of new event recorder is developed. We study the method of converting analog voice signal into digital data and compressing method. Also the architecture of memory to store the compressed voice data and regeneration of original analog voice are studied.

  • PDF

A Study of the SPR (Singing Power Ratio) on the Singing Voice in Singing Students (성악 전공 학생의 가칭 시 음성의 SPR(Singing Power Ratio)에 관한 연구)

  • Jo, Sung-Mi;Jeong, Ok-Ran;Lee, Sang-Ouk
    • Speech Sciences
    • /
    • v.11 no.4
    • /
    • pp.121-127
    • /
    • 2004
  • This study attempted to provide a spectrum analysis for quantitative evaluation of singing voice quality of singing students rather than the presence or absence of the singer's formant. The regression analysis was used to analyse the relationship between ringing quality, SPR, and SPP of singing voice of college student subjects majoring in music. This study measured singing. power ratio (SPR) in 41 singing students. Digital audio recordings were made in sung vowels for acoustic analyses. Each sample was judged by 1 experienced singing teacher and 4 voice pathologists on one semantic bipolar 7-point scales (ringing-dull). The results showed that the SPR and SPP had significant correlations with ringing quality. The SPR had a significant relationship with ringing quality on singing voice in singing students. The SPR can be an important quantitative measurement for evaluating singing voice quality.

  • PDF

A Study of Voice Improvement According to the Onset Time of Voice Therapy after Laryngomicrosurgery (레이저를 이용하여 후두미세수술을 시행한 환자에서 음성치료를 시작한 시기에 따른 음성 호전 결과에 관한 연구)

  • 김한균;정필상;오양희;김영훈
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.12 no.1
    • /
    • pp.22-27
    • /
    • 2001
  • Backgrounds and Objectives : There have been reported many studies which evaluate the effectiveness of combined laryngomicrosurgery(LMS) and voice therapy for the patients with benign vocal cord lesions. But the difference of voice improvement by onset time of voice therapy has not been reported. The purpose of this study is to analyze the differences of voice improvement by voice analysis test between the two groups with different onset time of voice therapy. Materials and Methods : Two groups, each of which comprises 15 patients, were analyzed. For the one group, the voice therapy was initiated 1 day after LMS. For the other, the therapy was initiated 1 week after LMS. Voice analytic parameters of the two groups were statistically analized to identify difference in voice improvement. Results : All measured parameters improved after voice therapy in two groups and showed no significant difference between two groups. Conclusions : The onset time of voice therapy after LMS has no significant impact on post-operative voice quality in the patients with benign vocal cord lesions. Early onset of post-operative voice therapy may serve as treatment modality for patients with benign vocal cord lesions.

  • PDF

Laryngeal Cancer Screening using Cepstral Parameters (켑스트럼 파라미터를 이용한 후두암 검진)

  • 이원범;전경명;권순복;전계록;김수미;김형순;양병곤;조철우;왕수건
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.14 no.2
    • /
    • pp.110-116
    • /
    • 2003
  • Background and Objectives : Laryngeal cancer discrimination using voice signals is a non-invasive method that can carry out the examination rapidly and simply without giving discomfort to the patients. n appropriate analysis parameters and classifiers are developed, this method can be used effectively in various applications including telemedicine. This study examines voice analysis parameters used for laryngeal disease discrimination to help discriminate laryngeal diseases by voice signal analysis. The study also estimates the laryngeal cancer discrimination activity of the Gaussian mixture model (GMM) classifier based on the statistical modelling of voice analysis parameters. Materials and Methods : The Multi-dimensional voice program (MDVP) parameters, which have been widely used for the analysis of laryngeal cancer voice, sometimes fail to analyze the voice of a laryngeal cancer patient whose cycle is seriously damaged. Accordingly, it is necessary to develop a new method that enables an analysis of high reliability for the voice signals that cannot be analyzed by the MDVP. To conduct the experiments of laryngeal cancer discrimination, the authors used three types of voices collected at the Department of Otorhinorlaryngology, Pusan National University Hospital. 50 normal males voice data, 50 voices of males with benign laryngeal diseases and 105 voices of males laryngeal cancer. In addition, the experiment also included 11 voices data of males with laryngeal cancer that cannot be analyzed by the MDVP, Only monosyllabic vowel /a/ was used as voice data. Since there were only 11 voices of laryngeal cancer patients that cannot be analyzed by the MDVP, those voices were used only for discrimination. This study examined the linear predictive cepstral coefficients (LPCC) and the met-frequency cepstral coefficients (MFCC) that are the two major cepstrum analysis methods in the area of acoustic recognition. Results : The results showed that this met frequency scaling process was effective in acoustic recognition but not useful for laryngeal cancer discrimination. Accordingly, the linear frequency cepstral coefficients (LFCC) that excluded the met frequency scaling from the MFCC was introduced. The LFCC showed more excellent discrimination activity rather than the MFCC in predictability of laryngeal cancer. Conclusion : In conclusion, the parameters applied in this study could discriminate accurately even the terminal laryngeal cancer whose periodicity is disturbed. Also it is thought that future studies on various classification algorithms and parameters representing pathophysiology of vocal cords will make it possible to discriminate benign laryngeal diseases as well, in addition to laryngeal cancer.

  • PDF