Search | Korea Science

Laryngeal Cancer Screening using Cepstral Parameters (켑스트럼 파라미터를 이용한 후두암 검진)

이원범;전경명;권순복;전계록;김수미;김형순;양병곤;조철우;왕수건
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.14 no.2
- /
- pp.110-116
- /
- 2003
Background and Objectives : Laryngeal cancer discrimination using voice signals is a non-invasive method that can carry out the examination rapidly and simply without giving discomfort to the patients. n appropriate analysis parameters and classifiers are developed, this method can be used effectively in various applications including telemedicine. This study examines voice analysis parameters used for laryngeal disease discrimination to help discriminate laryngeal diseases by voice signal analysis. The study also estimates the laryngeal cancer discrimination activity of the Gaussian mixture model (GMM) classifier based on the statistical modelling of voice analysis parameters. Materials and Methods : The Multi-dimensional voice program (MDVP) parameters, which have been widely used for the analysis of laryngeal cancer voice, sometimes fail to analyze the voice of a laryngeal cancer patient whose cycle is seriously damaged. Accordingly, it is necessary to develop a new method that enables an analysis of high reliability for the voice signals that cannot be analyzed by the MDVP. To conduct the experiments of laryngeal cancer discrimination, the authors used three types of voices collected at the Department of Otorhinorlaryngology, Pusan National University Hospital. 50 normal males voice data, 50 voices of males with benign laryngeal diseases and 105 voices of males laryngeal cancer. In addition, the experiment also included 11 voices data of males with laryngeal cancer that cannot be analyzed by the MDVP, Only monosyllabic vowel /a/ was used as voice data. Since there were only 11 voices of laryngeal cancer patients that cannot be analyzed by the MDVP, those voices were used only for discrimination. This study examined the linear predictive cepstral coefficients (LPCC) and the met-frequency cepstral coefficients (MFCC) that are the two major cepstrum analysis methods in the area of acoustic recognition. Results : The results showed that this met frequency scaling process was effective in acoustic recognition but not useful for laryngeal cancer discrimination. Accordingly, the linear frequency cepstral coefficients (LFCC) that excluded the met frequency scaling from the MFCC was introduced. The LFCC showed more excellent discrimination activity rather than the MFCC in predictability of laryngeal cancer. Conclusion : In conclusion, the parameters applied in this study could discriminate accurately even the terminal laryngeal cancer whose periodicity is disturbed. Also it is thought that future studies on various classification algorithms and parameters representing pathophysiology of vocal cords will make it possible to discriminate benign laryngeal diseases as well, in addition to laryngeal cancer.
PDF

Voice quality of normal elderly people after a 3oz water-swallow test: An acoustic analysis (3온스 물 삼킴검사 이후 정상 노년층의 음질 변화: 음향학적 분석)

Lee, Sol Hee;Choi, Hong-Shik;Choi, Seong-Hee;Kim, HyangHee
- Phonetics and Speech Sciences
- /
- v.10 no.2
- /
- pp.69-76
- /
- 2018
The elderly are at increased risk of developing dysphagia due to aging and illnesses. The aim of the current study was to analyze, via an acoustic study, the change in the voice quality of normal elderly people after a 3oz water-swallow test. Subjects included a group of 60 normal elderly people (age: $mean{\pm}SD=76.9{\pm}6.66$) and 60 healthy young adults (age: $mean{\pm}SD=25.1{\pm}2.36$). Every participant produced a five-second /a/ phonation pre- and post-swallowing, and the fractioned two-second sections were analyzed using the MDVP (multi dimensional voice program) analysis. The elderly group demonstrated a post-swallowing increase in the following related acoustic parameters: fundamental frequency, fundamental frequency variation, amplitude-variation, and noise in both two-second sections. However, the younger group showed an increase only in frequency related acoustic parameters (i.e., STD ) in the first two-second section. The significant changes in values in the post-swallowing parameters might indicate temporary irregularities in pitch and amplitude along with higher amounts of noise in the voice. The results could be attributed to water residues in the vocal fold and vocal tract, as well as a deterioration of the motor and sensory functions caused by anatomical and physiological changes that result from aging.
https://doi.org/10.13064/KSSS.2018.10.2.069 인용 PDF KSCI

Age and Sex Differences in Acoustic Parameter of Middle Age and Elderly Adult Voice (장.노년기 성인 음성의 성별과 연령에 따른 음향음성학적 특성 비교)

Lee, Hyo-Jin;Kim, Soo-Jin
- MALSORI
- /
- no.60
- /
- pp.13-28
- /
- 2006
This study focused on comparing the following acoustic changes according to age and sex in adulthood: Fo, Jitter, Shimmer, and NHR. One hundred twenty Korean adults were divided into three age groups (20's, 50's, and 70's) and two sex groups (male and female). The subjects of this study performed three tasks: (1) sustained three vowels; (2) read on paragraph of 'Taking a Walk' (3) explained a picture. The data was analyzed using the MDVP of Multi-Speech. In the parameter of Fo, sex and age were influential factors. In the parameters of Jitter, Shimmer and NHR, the effect of sex and age was different in all three parameters. When the groups organized by sex were analyzed by age, the 20's group showed a statistical difference in all four parameters (Fo, Jitter, Shimmer, and NKR), when compared to the other two age ranges of 50's and 70's. We need to consider our standard parameter for the normal voice in the Korean elderly because the 50's and 70's age normal groups in our study are out of the current range of normal in MDVP.
PDF

A Study of Extracting Acoustic Parameters for Individual Speakers (개별화자의 음성파라미터 추출에 관한 연구: 음성파라미터의 상관관계를 중심으로)

Ko, Do-Heung
- Speech Sciences
- /
- v.10 no.2
- /
- pp.129-143
- /
- 2003
Fundamental frequency (Fo), jitter, shimmer, and harmonics-to-noise ratio (NHR) have been measured to see their interactions between the parameters using Multi-Dimensional Voice Program (MDVP). 100 Korean normal adults (50 males and 50 females) ranging from their early 20's to their early 30's produced the eight sustained vowels including /a/, /i/, /u/, /c/, /e/,/$\varepsilon$/, /i/, and /e/. The subjects were asked to read the above vowels five times in isolation with the interval of five seconds, respectively. Male voices, on the average, showed 130.7 Hz in Fo, 0.6696% in jitter, 1.8151% in shimmer, and 0.12 in NHR, while female voices showed 232.8 Hz in Fo, 0.9222% in jitter, 1.9199% in shimmer, and 0.1098 in NHR. As to the correlation coefficient, it was found that for male speakers jitter vs. shimmer, shimmer vs. NHR, Fo vs. shimmer, and Fo vs. NHR are statistically significant. It was found that for female subjects jitter vs. shimmer and Fo vs. shimmer are statistically significant. However, it is concluded that the correlation coefficient in females are not meaningful in a practical way though they are all statistically significant.
PDF

A Comparison of Voice Analysis of Children with Cochlear Implant and with Normal Hearing (인공와우이식 아동과 건청 아동의 음성 분석 비교)

Yoon, Misun;Choi, Eunah;Sung, Youngju
- Phonetics and Speech Sciences
- /
- v.5 no.4
- /
- pp.71-78
- /
- 2013
The purpose of this study was to compare the acoustic voice outcomes of children with cochlear implant to those of children with normal hearing. Participants were 41 children using unilateral cochlear implant (18 males and 23 females), and children with normal hearing from the same age and sex. Mean age of implantation was approximately 3 years old, mean duration of implant use was 4 years in CI group. Acoustic analyses were performed using MDVP of CSL. Speech samples were 3 sustained vowels, /a, i, u/. 9 parameters (F0, Fhi, Flo, Jitter, Shimmer, vF0, vAm, NHR, and SPI) were analyzed. Children with CI did not show the significant differences in those parameters after the vowel /a/ phonation. Meanwhile, there were significantly different results in F0, Fhi, vF0, and SPI after /i, u/ phonation. These results revealed that differences of voice characteristics in children with CI compare to children with NH persist regarding vowel context. It suggests that high vowels would recommend as speech samples for acoustic evaluation. Futhermore perceptual analysis and speech therapy for phonation control would be necessary for children with CI.
https://doi.org/10.13064/KSSS.2013.5.4.071 인용 PDF

A Study for Acoustic Features of Benign Laryngeal Disease (양성 성대 점막 질환의 음향학적 특성에 관한 연구)

Lee, Jae Seok;Kim, Jin Pyeong;Park, Jeong Je;Kwon, Oh Jin;Woo, Seung Hoon
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.24 no.1
- /
- pp.47-50
- /
- 2013
Background and Objectives:The purpose of this study is to find features in acoustics and to learn useful features of parameters in order to distinguish laryngeal diseases through many acoustic variables. Materials and Methods:The subjects of this study were 125-male patients who had been diagnosed with vocal nodule, vocal polyp, vocal cyst, Reinke's edema, leukoplakia. To research the features of each disease in acoustics, they are measured 34 parameters by using MDVP. Results:It is clear that in order to see a meaning result when distinguishing laryngeal diseases, $F_0$, $MF_0$, $T_0$, Fhi, Flo, PER variables are significant (p<.05). It means that variables related to fundamental frequency are important to anticipate which group will be diagnosed with Reinke's edema and leukoplakia. vAm had an effect on getting a significant result in terms of amplitude perturbation parameters, which is useful to distinguish between laryngeal polyp/cyst and other laryngeal disease (p<.05). ATRI made a significant result in related to tremor parameters, which is useful to distinguish between laryngeal polyp and other laryngeal disease (p<.05). Conclusion:$F_0$, $MF_0$, $T_0$, Fhi, Flo, PER, vAm, ATRI might be meaningful parameters distinguishing pathologic from benign laryngeal diseases. Especially, the vAm and ATRI are an important factor when forecasting which group would be diagnosed with vocal polyp.
PDF

Comparison of Acoustic Parameters According to the Section of Analysis in Sustained Vowel Phonation (모음연장 음성 샘플의 분석 구간에 따른 음향학적 파라미터 비교)

Shin, Yu-Jeong
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.18 no.7
- /
- pp.269-274
- /
- 2017
This study aimed to investigate the acoustic differences that occur in diverse sections of sustained vowel phonation, which is often used in an objective speech analysis of voice disorder patients. The subjects included 17 voice disorder patients (vocal nodules) and 12 normal individuals without any voice disorder. The participants' sustained vowel phonation of /a/ was divided into onset, middle, and offset, and the jitter, shimmer, and NHR in each section were analyzed using the MDVP(Multi-Dimensional Voice Program). The Friedman test and post hoc analysis were used. In the vocal nodules group, the jitter, shimmer and NHR were significantly higher in the off section of sustained vowel phonation than in the middle section, and there were no significant differences between the beginning and middle sections. In contrast, in the group of normal individuals, there were no significant differences between any of the sections. The values of the acoustic parameters according to the section of analysis in the sustained vowel phonation are different and the vocal in the end section is significantly more unstable than that in the middle section. The results of this study will be useful for selecting the sections to be analyzed in sustained vowel phonation and interpreting the results of the analysis.
https://doi.org/10.5762/KAIS.2017.18.7.269 인용 PDF KSCI

The Voice Characteristics of Children with Palatine Tonsil and Adenoid Hypertrophy (구개편도와 아데노이드 비대 아동의 음성 특성)

Song, Yun-Kyung
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.8
- /
- pp.790-795
- /
- 2009
This study evaluated the voice of 68 normal children and 50 children with palatine tonsil and adenoid hypertrophy with MDVP to examine the hypothesis that their mouth breathing makes the vocal folds dry and this condition contributes to lower the level of voice quality. The results showed that children with palatine tonsil and adenoid hypertrophy had statistically significant elevations in Jitt, RAP, PPQ, Shim and APQ parameters, and had the lower level of voice quality. Therefore, the children with palatine tonsil and adenoid hypertrophy need vocal hygiene education.
https://doi.org/10.7776/ASK.2009.28.8.790 인용 PDF KSCI

Acoustic screening test for laryngeal cancer (음성을 이용한 후두암의 집단선별검사)

박헌수
- Korean Journal of Bronchoesophagology
- /
- v.7 no.2
- /
- pp.161-167
- /
- 2001
Background and Objectives： Total laryngectomy is often required for advanced cases. But this operation induced the many inconvenience of basic daily life. Early diagnosis of laryngeal cancer is very important to prevent from this disastrous condition. In this point of view, mass screening test for early detection of laryngeal cancer is necessary. Screening test using voice has many advantages such as simple, less interventional. Voice collection by Automatic Response System(ARS) is comfortable and easy to got acoustic sample. Thus author tried to got the acoustic parameters which can differentiate normal, benign. and malignant laryngeal diseases and also checked the availability of parameters on neural network system. Materials and Methods: Author has evaluated the voice from 17 laryngeal cancer patients and 45 benign laryngeal disease patients who visited at Department of Otolaryngology, Pusan National University Hospital from May 1998 to April 2001, and 15 normal control. Author chose the sir Parameters (Jitt. vFo, Shim, vAm, NHR, SPI) that was thought to be related with voice collected by ARS among thirty-three parameters analysed by a Multi-Dimensional Voice Program (MDVP). Two-step neural network was used for the availability of six parameters. Results: The detection rate of normal voice by ARS voice analysis is 78.5% and detection rate of abnormal voice was 97.1 o/o. Among abnormal voice, the detection rate of benign laryngeal diseases and laryngeal cancers were 82.4 o/o, 70.6% respectively. Conclusion: Author concluded that six parameters and Matlab based neural network software may be effective in development of acoustic screening system for laryngeal cancer and further study should be necessary for development of new acoustic parameters.
PDF

Qualitative Classification of Voice Quality of Normal Speech and Derivation of its Correlation with Speech Features (정상 음성의 목소리 특성의 정성적 분류와 음성 특징과의 상관관계 도출)

Kim, Jungin;Kwon, Chulhong
- Phonetics and Speech Sciences
- /
- v.6 no.1
- /
- pp.71-76
- /
- 2014
In this paper voice quality of normal speech is qualitatively classified by five components of breathy, creaky, rough, nasal, and thin/thick voice. To determine whether a correlation exists between a subjective measure of voice and an objective measure of voice, each voice is perceptually evaluated using the 1/2/3 scale by speech processing specialists and acoustically analyzed using speech analysis tools such as the Praat, MDVP, and VoiceSauce. The speech parameters include features related to speech source and vocal tract filter. Statistical analysis uses a two-independent-samples non-parametric test. Experimental results show that statistical analysis identified a significant correlation between the speech feature parameters and the components of voice quality.
https://doi.org/10.13064/KSSS.2014.6.1.071 인용 PDF KSCI

Search Result 36, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)