• Title/Summary/Keyword: speech analysis

Search Result 1,585, Processing Time 0.027 seconds

Auditory-Perceptual and Acoustic Evaluation in Measuring Dysphonia Severity of Vocal Cord Paralysis (성대마비의 음성장애 측정을 위한 청지각적 및 음향학적 평가)

  • Kim, Geun-Hyo;Lee, Yeon-Woo;Park, Hee-June;Bae, In-Ho;Lee, Byung-Joo;Kwon, Soon-Bok
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.28 no.2
    • /
    • pp.106-111
    • /
    • 2017
  • Background and Objectives : The purpose of this study was to investigate the criterion-related concurrent validity of two standardized auditory-perceptual assessments and the Acoustic Voice Quality Index (AVQI) for measuring dysphonia severity in patients with vocal cord paralysis (VCP). Materials and Methods : Total 210 patients with VCP and 236 normal voice subjects were asked to sustain the vowel [a:] and to read aloud the Korean text "Walk". A 2 second mid-vowel portion of the sustained vowel and two sentences (with 26 syllables) were recorded. And then voice samples were edited, concatenated, and analyzed according to Praat script. Two standardized auditory-perceptual assessment (GRBAS and CAPE-V) were performed by three raters. Results : The VCP group showed higher AVQI, Grade (G) and Overall Severity (OS) values than normal voice group. And the correlation among AVQI, G, and OS ranged from 0.904 to 0.926. In ROC curve analysis, cutoff values of AVQI, G, and OS were <3.79, <0.00, and <30.00, respectively, and the AUC of each analysis was over .89. Conclusion : AVQI and auditory evaluation can improve the early screening ability of VCP voice and help to establish effective diagnosis and treatment plan for VCP-related dysphonia.

  • PDF

Comparison of Pre and Post-operational Phonatory Aerodynamic Parameters in Vocal Polyp and Vocal Cord Palsy Patients (성대마비 및 성대용종 환자의 수술 전과 후의 공기역학적 변수 비교)

  • Lee, Dahye;Kim, Jaeock;Oh, JaeKoon;Choi, Hong-Shik
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.112-116
    • /
    • 2015
  • Background and Objectives : Aerodynamic analysis is an examination which provides information regarding various vocalization measures indicating laryngeal efficiency. Voice evaluation using such examination must be capable of distinguishing between normal to abnormal voice. It also observes variables on aerodynamic characteristics by gender in regards to patients of vocal disorders, especially of vocal cord paralysis and vocal polyp, and compares the conditions before and after surgery. This paper therefore, seeks to build a framework for establishing standard levels of aerodynamical characteristic on vocal disorders. Subjects and Methods : The study was intended for a total number of 20 patients with vocal polyp or unilateral vocal cord paralysis. Those with the vocal polyp underwent laryngomycroscopy surgery and the vocal cord paralysis, vocal fold injection using Restylane. Aerodynamic analysis fulfilled the Maximum sustained Phonation (MXPH) and Voicing Efficiency (VOEF) by using PAS Model 6600 (KayPENTAX, USA). Results : In MXPH, increase in PHOT were evident with vocal polyp after surgery. As for patients with vocal cord paralysis, MAXDB, MEADB, DHODB, PHOT all have increased and MEAP, PEF, MEAF decreased after surgery. In VOEF, patients with vocal cord paralysis who underwent surgery showed increase in MAXDB, MEADB, DHODB, FET100, ARES, but decreases in PEF, TARF. Conclusion : Overall, it can be concluded that patients with the vocal polyp and vocal cord paralysis seemed to get closer to the normal values after than before surgery in majority of measures. This confirms that the function of their vocal cord has improved nearly to normality through operations.

  • PDF

A Study on the Research Trends of the Influential Factors on Multicultural Acceptability of Korean Teens (우리나라 청소년의 다문화 수용성 관련 요인에 관한 연구 동향 분석)

  • Cha, Seulki;Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.3
    • /
    • pp.211-216
    • /
    • 2018
  • The study provided basic data that could be used for future research by identifying trends in the factors that affect the multicultural acceptability of Korean teens from 2008 to 2017. Research methods have searched for papers using keywords from 'Multicultural', 'Youth', 'Middle School Student', 'High School Student' and 'Acceptance' in academic data base. As a result of analysis, journal articles and dissertations soared as of 2012, and the academic field identified the largest number of studies being carried out in the field of social science and pedagogy. Quantitative research and cross-sectional study were the most important methods of previous research. On the other hand, few studies have concurrently analyzed new factors related to multicultural acceptability. The results of this study indicate the need for a combined analysis of the new factors that may affect the multicultural acceptability of adolescents.

A Study on Frequency-Time Plane Analysis of Wavelet (웨이브렛의 주파수-시간 평면 해석에 관한 연구)

  • Bae, Sang-Bum;Ryu, Ji-Goo;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.2
    • /
    • pp.451-454
    • /
    • 2005
  • Recently, many methods to analyze signal have been proposed and representative methods are the Fourier transform and wavelet transform. In these methods, the Fourier transform represents signal with combination cosine and sine at all locations in the frequency domain. However, it doesn't provide time information that particular frequency occurs in signal and depends on only the global feature of the signal. So, to improve these points the wavelet transform which is capable of multiresolution analysis has been applied to many fields such as speech processing, image processing and computer vision. And the wavelet transform, which uses changing window according to scale parameter, presents time-frequency localization. In this paper, we proposed a new approach using a wavelet of cosine and sine type and analyzed features of signal in a limited point of frequency-time plane.

  • PDF

Part-Of-Speech Tagging and the Recognition of the Korean Unknown-words Based on Machine Learning (기계학습에 기반한 한국어 미등록 형태소 인식 및 품사 태깅)

  • Choi, Maeng-Sik;Kim, Hark-Soo
    • The KIPS Transactions:PartB
    • /
    • v.18B no.1
    • /
    • pp.45-50
    • /
    • 2011
  • Unknown morpheme errors in Korean morphological analysis are divided into two types: The one is the errors that a morphological analyzer entirely fails to return any morpheme sequences, and the other is the errors that a morphological analyzer returns incorrect combinations of known morphemes. Most previous unknown morpheme estimation techniques have been focused on only the former errors. This paper proposes a unknown morpheme estimation method which can handle both of the unknown morpheme errors. The proposed method detects Eojeols (Korean spacing units) that may include unknown morpheme errors using SVM (Support Vector Machine). Then, using CRFs (Conditional Random Fields), it segments morphemes from the detected Eojeols and annotates the segmented morphemes with new POS tags. In the experiments, the proposed method outperformed the conventional method based on the longest matching of functional words. Based on the experimental results, we knew that the second type errors should be dealt with in order to increase the performance of Korean morphological analysis.

Performance analysis of multistage interference cancellation schemes for a DS/CDMA system subject to delay constraint (CD/CDMA 시스템에서의 제한된 처리 지연 시간을 고려한 단단계 간섭 제거 방식에 대한 성능 분석)

  • 황선한;강충구
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.22 no.12
    • /
    • pp.2653-2663
    • /
    • 1997
  • The successive and parallel interference cancellation schemes are two well-known types of multi-stage interference cancellation schemes using the conventional correlator receivers as a basic building block, which has been known to significantly improve the performance of DS/CDMA system in the multiple access communication. Performance comparison between these two schemes is made strictly based on the analytical and it has been shown that the successive interference cancellation (SIC) scheme is more resistant to fading than the parallel interference cancellation (PIC) scheme. We further investigate the performance of the successive IC scheme subject to the delay constraint, which may be imposed typically on most of service applications with a real-time transmission requirement, including speech and video applications. Our analysis demonstrates that the performance may be significantly improved by the groupwise successive interference cancellation (GSIC) scheme, which can be properly optimized to meet the given delay constraint.

  • PDF

Database Interface System with Dialog (대화를 통한 데이타베이스 인터페이스 시스템)

  • Woo, Yo-Seop;Kang, Seok-Hoon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.3
    • /
    • pp.417-428
    • /
    • 1996
  • In this paper, a database interface system with natural language dialogue is designed and implemented. The system is made up of language analysis, context processing, dialogue processing and DB processing unit. The method for classifying and processing an undefined word in language analysis is proposed. It reduces the dictionary size, which gives difficulties in DB Interface. And the current DB Interfaces dealt with an input utterance independently. But the system in this paper provides a user with the interface environment in which he or she can have a continuous conversation with the system and retrieve DB information. Thus in this paper, speech acts which include user's inattentions well as propositional contents are defined, and user action hierarchical model for library DB retrieval is constructed. And the system uses the defined knowledge to recognize-user's plan, effectively understanding and managing the ongoing dialogue. And the system is implemented in the domain of library database in order to prove the proposed methods in this paper.

  • PDF

Effect of Voice Reinforcement Method for Treatment of Vocal Nodules: Preliminary Study (음성강화기법의 성대결절 치료 효과)

  • Kim, Ji-Sung;Lee, Dong-Wook
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.31 no.1
    • /
    • pp.13-18
    • /
    • 2020
  • Background and Objective The purpose of this study is to report the effect of voice therapy using the voice reinforcement method (VRM) in patients with vocal nodules. It is one of the holistic voice therapy methods for improving vocal mechanisms. VRM includes not only direct and indirect voice therapy, but also trial therapy and self-practice. Composed of four stages: vocal hygiene education, relaxation, reinforcement, and generalization. Materials and Methods The subjects were 13 patients who were diagnosed with vocal nodules. Acoustic analysis, auditory perceptual assessment, K-VHI-10 and nodules size were compared before and after voice therapy. Voice therapy was conducted by speech-language pathologist and the mean number was 4.2. Results In acoustic analysis, Jitter, vF0, vAm, Shimmer, NHR, and VTI were significantly decreased. F0 was increased after voice therapy for women. 'Grade', 'Rough,' and 'Breathy' were significantly decreased in the GRBAS scale after voice therapy. In addition, K-VHI-10 and nodules size were significantly decreased. Conclusion VRM seems to be an effective voice therapy method in vocal nodules treatment. In VRM, especially, trial therapy is given motivation for vocal nodules treatments and self-practice has a continuous therapeutic effect in everyday life. VRM can be also applied to the voice therapy for other hyper-functional dysphonia.

Analysis of Treatment Result of Arachnoid Cyst (뇌지주막 낭종의 치료결과 분석)

  • Lee, Jeong-Hwan;Kim, Oh-Lyong;Kim, Seong-Ho;Bae, Jang-Ho;Choi, Byung-Yon;Cho, Soo-Ho
    • Journal of Korean Neurosurgical Society
    • /
    • v.30 no.sup2
    • /
    • pp.211-215
    • /
    • 2001
  • Objective : The present study was performed to analyze treatment results for 22 cases of arachnoid cyst and to have appropriate surgical method in our department. Material and Methods : We performed a retrospective study in 22 cases in 11 years between 1989 to 2000 that could be followed up. The analysis was based on the results of patients age, sex distribution, developed area, clinical symptom, treatment method, and complication. Results : The age range of cyst development was between 7 months to 60 years with the average age of 21 years. As for sex distribution, 20 were male and 2 were female, with significantly more cyst development in males than females. Thirteen cases were developed in the sylvian fissure, 3 cases in the posterior fossa, 4 cases in the cerebral convexity of the supratentorial area, 1 case in the suprasella and 1 case in interhemiphere. Those cases with the sylvian fissure involvement included 6 cases of Type I, 4 cases of Type II, and 3 cases of Type III. As for the distribution according to hemisphere, more arachnoidal cysts were seen in the right hemisphere. The most common clinical symptom was headache, followed by seizure and speech disturbance. As for the treatment method in 22 cases, surgery was performed in 17 cases and conservative treatment in 5 cases. Fenestration was performed in 14 cases. 13 cases of them showed good outcome, and 1 case with delayed development showed no improvement. Cyst-peritoneal shunt was done in 2 cases. Both fenestration and cyst-peritoneal shunt were done in 1 case. Conclusion : Patients who perforemed fenestration were showed good outcome with few complication. We concluded that fenestration is the most appropriate surgical method for arachnoid cyst.

  • PDF

Monophthong Recognition Optimizing Muscle Mixing Based on Facial Surface EMG Signals (안면근육 표면근전도 신호기반 근육 조합 최적화를 통한 단모음인식)

  • Lee, Byeong-Hyeon;Ryu, Jae-Hwan;Lee, Mi-Ran;Kim, Deok-Hwan
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.3
    • /
    • pp.143-150
    • /
    • 2016
  • In this paper, we propose Korean monophthong recognition method optimizing muscle mixing based on facial surface EMG signals. We observed that EMG signal patterns and muscle activity may vary according to Korean monophthong pronunciation. We use RMS, VAR, MMAV1, MMAV2 which were shown high recognition accuracy in previous study and Cepstral Coefficients as feature extraction algorithm. And we classify Korean monophthong by QDA(Quadratic Discriminant Analysis) and HMM(Hidden Markov Model). Muscle mixing optimized using input data in training phase, optimized result is applied in recognition phase. Then New data are input, finally Korean monophthong are recognized. Experimental results show that the average recognition accuracy is 85.7% in QDA, 75.1% in HMM.