• 제목/요약/키워드: speech quality

검색결과 807건 처리시간 0.021초

SMV와 G.723.1 음성부호화기를 위한 파라미터 직접 변환 방식의 상호부호화 알고리듬 (Transcoding Algorithm for SMV and G.723.1 Vocoders via Direct Parameter Transformation)

  • 서성호;장달원;이선일;유창동
    • 대한전자공학회논문지SP
    • /
    • 제40권6호
    • /
    • pp.61-70
    • /
    • 2003
  • 본 논문에서는 SMV와 G.723.1 음성부호화기를 위한 파라미터 직접 변환 방식의 상호부호화 알고리듬을 제안한다. 상호부호화를 위하여 부가적인 복호화, 부호화 과정을 거쳐야하는 Tandem 방식과 달리 제안된 방식에서는 양 음성부호화기가 음성을 부호화하는데 공통적으로 사용되는 파라미터들을 직접 변환한다. 제안된 알고리듬은 파라미터 복호화, LSP 변환, 피치 지연 변환, 여기신호 변환 그리고 비트율 결정으로 이루어진다. 제안된 알고리듬을 다양한 방법으로 평가해 본 결과 계산량과 지연시간을 줄이면서 tandem 방식과 동등한 수준의 음질을 구현함을 확인할 수 있었다.

한국 정상 노인층의 삼킴장애지수와 후두 기능에 따른 삼킴 특성 (Dysphagia Handicap Index and Swallowing Characteristics based on Laryngeal Functions in Korean Elderly)

  • 김근희;최성희;이경재;최철희
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.3-12
    • /
    • 2014
  • Larynx plays an important role in phonation and protection of the respiratory tract during swallowing. The reduced anatomical and physiological function in elevation of larynx and glottis closure can cause problems in voice and swallowing. The present study investigated the Korean version of handicap index of dysphagia in elderly Koreans. Therefore, 60 normal elderly Koreans ranged from 65 to 95 and 20 normal Korean young adults aged from 20 to 25 were participated in this study to compare total (T), physical (P), functional (F), and emotional (E) index scores between two groups as well as among sub groups (60s, 70s, 80s) in elderly. For swallowing, total and sub dysphagia handicap index (DHI) scores, voice quality during /a/phonation following swallowing (saliva and water), intensity of coughing, and L-DDK were measured. The results showed that functional (P), physical (P), emotional (E) scores as well as total (T) score were significantly different between young adults and old adults in DHI(p<.05). Additionally, there was a negative correlation between total DHI score and intensity of coughing (r=-.51) as well as L-DDK (r=-.70). These findings suggest that a slow rate in vocal fold adduction and reduced intensity of coughing in the elderly affect swallowing function. Thus, recently translated Korean version of DHI may be useful as supplement in evaluating the swallowing problems in elderly people.

아동의 음성문제와 음성 관련 행동특성에 대한 부모 및 담임교사의 인식 (The awareness of parents and teachers in the psycho- and voice behavioral characteristics related to children's voice problems)

  • 송경화;김재옥
    • 말소리와 음성과학
    • /
    • 제8권2호
    • /
    • pp.49-56
    • /
    • 2016
  • The study examined that parents and teachers were aware of what extents behavioral characteristics were related to the children's voice problems. The voice samples of 89 children in the ages of 3 to 5 were collected and their voice quality were graded by G scale of GRBAS. The parents and teachers of the children were asked to complete the questionnaire composed of the pediatric Voice Handicap Index (pVHI) and the psycho- and voice behavioral characteristics of their children. The results are as follows. First, there were no significant differences in both pVHI and behavioral characteristics of their children by G scale. However, significant differences were shown in the behavioral characteristics between parents and teachers, but no difference in pVHI between them. In addition, there was a significant correlation between the psycho-behavioral characteristics and the voice behavioral characteristics in both parents and teachers. These results represent that parents and teachers are not aware of the presence of their children's voice problems and such voice problems are affected by behavioral characteristics associated with the use of voice.

성대접촉이완훈련이 성대결절아동의 음성개선에 미치는 효과 (The Effects of Vocal Relaxation Training on Voice Improvement of Children with Vocal Nodules)

  • 한지은;성철재
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.147-154
    • /
    • 2012
  • The purpose of this study is to examine the effect of voice improvement when vocal training, which relaxes the vocal contact, is applied to children with vocal nodules. Subjects included 20 5- to 12-year-old boys with vocal nodules in Otolaryngology and for whom voice therapy had been advised. The vocal therapy was conducted for 40 minutes per a week for a total of eight times. Results were evaluated by videostroboscopy, auditory-perceptual evaluation of GRBAS Scale, aerodynamic test, and acoustic analysis before and after therapy. As a result, first, the size of vocal nodules was reduced and the unstable pattern of vocal contact was improved. Glottic closure was increased and Phase symmetry was decreased during vocal vibration. Mucosal wave was increased and muscle tension of the larynx was reduced. Second, auditory-perceptual evaluation showed that subjects' overall quality of voice improved. GRBAS Scale Evaluation showed that the characteristics of the subjects' voice which were rough, breathy, and strained and breathy were reduced after therapy. Third, the measurements of acoustic parameters showed a statistically significant improvement. The fundamental frequency of the subejects' voice was increased and values of Jitter and Shimmer, NHR, [H1-H2] decreased. Fourth, the maximum phonation time of children was increased. These results imply that vocal relaxation training conducted in this study has a very positive effect to improve the voice of children with vocal nodules.

The Comparisons of GRBAS Perceptual Judgments according to Levels of Utterances

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • 음성과학
    • /
    • 제8권1호
    • /
    • pp.135-142
    • /
    • 2001
  • The present study was performed to investigate adequate levels of utterances which can give essential as well as useful information about the patients' voice, by examining the degrees of correlation between the levels of utterances (vowels, words, and phrase paragraph reading) and the entire utterance including all of the levels. For this purpose, a total of 10 individual utterance samples (5 vowels, 3 words, 1 phrase, 1 paragraph reading) were collected from each of the 30 subjects with voice disorder patients, and four experienced voice therapists evaluated them using GRBAS. The results showed that four therapists highly agreed upon on 'G' parameter. The coefficient of the correlation between each level of utterance and entire utterance tended to be above 0.70. Judgements of the vowel /$\varepsilon$/ as well as /o/ highly correlated with the judgement of the entire utterance. Regardless of severity, the judgement of the entire utterance highly correlated with the judgements of the vowel /u/ and the paragraph reading. These results suggest that experienced voice therapists can precisely evaluate patients' voice quality with only one sustained vowel in the clinic field, as is done with the entire utterance evaluation.

  • PDF

Speaker-Dependent Emotion Recognition For Audio Document Indexing

  • Hung LE Xuan;QUENOT Georges;CASTELLI Eric
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
    • /
    • pp.92-96
    • /
    • 2004
  • The researches of the emotions are currently great interest in speech processing as well as in human-machine interaction domain. In the recent years, more and more of researches relating to emotion synthesis or emotion recognition are developed for the different purposes. Each approach uses its methods and its various parameters measured on the speech signal. In this paper, we proposed using a short-time parameter: MFCC coefficients (Mel­Frequency Cepstrum Coefficients) and a simple but efficient classifying method: Vector Quantification (VQ) for speaker-dependent emotion recognition. Many other features: energy, pitch, zero crossing, phonetic rate, LPC... and their derivatives are also tested and combined with MFCC coefficients in order to find the best combination. The other models: GMM and HMM (Discrete and Continuous Hidden Markov Model) are studied as well in the hope that the usage of continuous distribution and the temporal behaviour of this set of features will improve the quality of emotion recognition. The maximum accuracy recognizing five different emotions exceeds $88\%$ by using only MFCC coefficients with VQ model. This is a simple but efficient approach, the result is even much better than those obtained with the same database in human evaluation by listening and judging without returning permission nor comparison between sentences [8]; And this result is positively comparable with the other approaches.

  • PDF

내전형 경련성발성장애의 호흡압력과 공기역학적 특징 (The Aerodynamic & Respiratory Muscle Pressure Aspects of Patients with Adductor Spasmodic Dysphonia)

  • 남도현;최성희;최재남;최홍식
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.203-213
    • /
    • 2005
  • This study was conducted to investigate the respiratory and aerodynamic function of adductor spasmodic dysphonia (ADSD) patients. Participants were (1) 18 females SD patients with non- Botulinum toxin injection (2) 14 females SD patients who had taken treatment of Botulinum toxin injection. (3) 14 age- and sex- matched normal female controls. Spirometer and phonatory function analyzer were used for respiratory muscle pressure (MIP: Maximum inspiratory pressure), MEP: Maximum expiratory pressure)& MPT(Maximum phonation time) and aerodynamic(F0:Fundamental frequency, intensity, MFR: Mean flow late, Psub: Subglottal pressure) measurement. The results were as follows: (1) Normal group was significantly higher in MIP, MEP, MPT than two SD groups (p < .05); (2) MPT was significantly lower in SD with non-Botulinum toxin injection group than SD with the treatment experience of Botulinum toxin injection (p < .05); (3) All aerodynamic parameters, F0, intensity, MFR, Psub, were not significantly different among three groups(p > .05).The reason of short MPT in ADSD may use lower respiratory pressure than normal group as strategy to decrease their tremulous voice quality. Moreover respiratory muscle pressure was lower than normal group regardless of botulinum toxin injection treatment.

  • PDF

파킨슨병 환자와 정상 노인의 음성비교 (A Comparison of the Voice Differences of Patients with Idiopathic Parkinson's Disease and a Normal-Aging Group)

  • 강영애;김용덕;반재천;성철재
    • 말소리와 음성과학
    • /
    • 제1권1호
    • /
    • pp.99-107
    • /
    • 2009
  • In view of the hypothesis that the effects of Parkinson disease on voice production can be detected before pharmacological intervention, the voice differences of patients with Idiopathic Parkinson's disease and a healthy aging group were diagnostically analyzed with the long term object of establishing, for clinical purposes, early disease-progression biomarkers. Fifteen patients with Idopathic Parkinson's disease (prior to pharmacological intervention) and a healthy control group of 15 were selected and every voice was recorded three times using praat (ver. 5022) with a headset mic. Relevant parameters - acoustic measure of /a/ phonation, F0 related parameters, MPT related parameters, articulatory ratio, VOT - were then analyzed by MANOVA. Significant differences were found in the F0 related (low F0, high F0, F0 range) and MPT related parameters. There were also significant differences in acoustic measurements (intensity, shimmer, HNR, jitter), AMR (/$t{\Lambda}$/,/$k{\Lambda}$/) and VOT (/ta/), The findings indicated that the voice production of patients with Idiopathic Parkinson's disease have normal pitch but bad quality. In particular, with slow articulatory ratios and VOT values, the tongue tip functioning of patients was lower than for the healthy group.

  • PDF

Mieko Han의 한국어 음성학 연구 (Mieko Han and her Works on Korean Phonetics)

  • 고도흥
    • 음성과학
    • /
    • 제1권
    • /
    • pp.213-223
    • /
    • 1997
  • This paper deals with a general review of Mieko S. Han, who made a significant contribution to the studies of Korean phonetics during the 1960' s and early 1970' s. As both a single and joint author, Dr. Han published important papers in both quantity and quality, which have been cited among Korean phoneticians until today. Before Dr. M. Han' s work, professor of USC in the department of East Asian Languages & Cultures, there were only a few phonetics-related publications in Korea, most of which are papers or books based on non-experimental traditional approach. It is known that there was coexistence between traditionalism and structuralism in the field of Korean linguistics. It was, however, fortunate that we had two important phoneticians (M. Han and Chin-W Kim) abroad at that time. Mieko Han' s concern was to investigate experimental characteristics of the system of Korean vowels and consonants using a Spectrograph, which was the single most important tool for analysing phonetic data at that time. Dr. Han conducted her experimental studies on Korean phonetics, mostly funded by the Office of Naval Research, in terms of duration, fundamental frequency, Voice Onset Time (VOT), intensity, and so on. This paper aims to re-appreciate Dr. Han's specific contribution to the study of Korean phonetics since she played an important role as a pioneer of early Korean phonetics. Further, it is highly recommended that Dr. Han's works can be extremely useful for a graduate student, who seriously would like to specialize in Korean phonetics in the first step.

  • PDF

노인성 음성장애의 음성치료 효과 (The Effects of Voice Therapy in Age-related Dysphonia)

  • 김성태
    • 말소리와 음성과학
    • /
    • 제2권2호
    • /
    • pp.117-121
    • /
    • 2010
  • The This study aimed to evaluate the effects of the voice therapy we operated to the patients with age-related dysphonia. Thirty four participants who were diagnosed as age-related dysphonia in laryngoscopic finding from January, 2009 to December, 2009 completed the study. The participants were aged from 60 to 82 years old with a mean age of 70.6. All participants had received the abdominal breath technique, SKHPIP with laughter, and basic vocal training with description of their problem, the length of which ranged from four sessions to twelve sessions. We executed the videostroboscopy to compare the aspect of voicing change and the perceptual assessment, voice range profile, acoustic and aerodynamic measures to identify change of voice. Participants had glottal gap due to incomplete glottic closure during voicing on the pretest. After they took the voice therapy, the glottic gap became narrow and rough and breathy voice was reduced. There were significant difference in acoustic and aerodynamic measures. Jitter, Shimmer, MFR were reduced and MPT, Psub were increased(p<.05). Participants' pitch range and intensity range were increased on the posttest performance after taking voice therapy. Especially, most of them were showed that pitch range was increased significantly in high frequency area. The results of this investigation indicate that the voice therapy using abdominal breath, SKHPIP, and exercise together is effective for the patients who have age-related dysphonia to improve their voice quality. We recommend to apply this technique to functional voice disorders who are showed glottal gap.

  • PDF