Search | Korea Science

Improved Text-to-Speech Synthesis System Using Articulatory Synthesis and Concatenative Synthesis (조음 합성과 연결 합성 방식을 결합한 개선된 문서-음성 합성 시스템)

이근희;김동주;홍광석
- Proceedings of the IEEK Conference
- /
- 2002.06d
- /
- pp.369-372
- /
- 2002
In this paper, we present an improved TTS synthesis system using articulatory synthesis and concatenative synthesis. In concatenative synthesis, segments of speech are excised from spoken utterances and connected to form the desired speech signal. We adopt LPC as a parameter, VQ to reduce the memory capacity, and TD-PSOLA to solve the naturalness problem.
PDF

Development of Speech Training Aids Using Vocal Tract Profile (조음도를 이용한 발음훈련기기의 개발)

박상희;김동준;이재혁;윤태성
- The Transactions of the Korean Institute of Electrical Engineers
- /
- v.41 no.2
- /
- pp.209-216
- /
- 1992
Deafs train articulation by observing mouth of a tutor, sensing tactually the motions of the vocal organs, or using speech training aids. Present speech training aids for deafs can measure only single speech parameter, or display only frequency spectra in histogram of pseudo-color. In this study, a speech training aids that can display subject's articulation in the form of a cross section of the vocal organs and other speech parameters together in a single system is to be developed and this system makes a subject know where to correct. For our objective, first, speech production mechanism is assumed to be AR model in order to estimate articulatory motions of the vocal organs from speech signal. Next, a vocal tract profile model using LP analysis is made up. And using this model, articulatory motions for Korean vowels are estimated and displayed in the vocal tract profile graphics.
PDF

A Comparison of Parameters of Acoustic Vowel Space in Patients with Parkinson's Disease (파킨슨병 환자의 음향 모음 공간 파라미터 비교)

Kang, Young-Ae;Yoon, Kyu-Chul;Lee, Hak-Seung;Seong, Cheol-Jae
- Phonetics and Speech Sciences
- /
- v.2 no.4
- /
- pp.185-192
- /
- 2010
The acoustic vowel space has been used as an acoustic parameter in dysarthric speech. The aim of this work was to examine mathematical formulae for acoustic vowel space and to apply these to Korean speakers with idiopathic Parkinson's disease(IPD). Five acoustic parameters were chosen from earlier works and one new parameter was proposed, the pentagonal vowel space. The six parameters included triangular vowel space (3 area), irregular quadrilateral vowel space (4 area), irregular pentagonal vowel space (5 area), vowel articulatory index (VAI), formant centralization ratio (FCR) and F2i/F1u ratio (F2 ratio). An experimental group of 32 IPD patients(male:female=16:16) and a control group of twenty healthy people (male:female=8:12) participated in the study and repeated vowels (/a-i-u-e-o/) three times. A correlation analysis was performed among the six parameters, 2-way ANOVA was done with gender and groups as independent factors, and an independent sample t-test was conducted between the male and the female group as post hoc comparison. All parameters were highly correlated with each other and only the FCR showed a high negative correlation with the others. The results of ANOVA showed a significant difference in F2 ratio, 3 area, 4 area and 5 area between gender and in 4 area and 5 area between groups. For the male members of the two groups, significant statistical differences were found in all parameters whereas no such differences were found for the female members. These findings indicated that the vowel space of the female group was wider than the vowel space of the male group. These differences may have been caused by gender-specific speech styles rather than by patho-physiological mechanisms. We also claim that the pentagonal vowel space is better than the other vowel spaces at representing the disordered speech in natural speech situations.
PDF

A Study on Speech Recognition using Vocal Tract Area Function (성도 면적 함수를 이용한 음성 인식에 관한 연구)

송제혁;김동준
- Journal of Biomedical Engineering Research
- /
- v.16 no.3
- /
- pp.345-352
- /
- 1995
The LPC cepstrum coefficients, which are an acoustic features of speech signal, have been widely used as the feature parameter for various speech recognition systems and showed good performance. The vocal tract area function is a kind of articulatory feature, which is related with the physiological mechanism of speech production. This paper proposes the vocal tract area function as an alternative feature parameter for speech recognition. The linear predictive analysis using Burg algorithm and the vector quantization are performed. Then, recognition experiments for 5 Korean vowels and 10 digits are executed using the conventional LPC cepstrum coefficients and the vocal tract area function. The recognitions using the area function showed the slightly better results than those using the conventional LPC cepstrum coefficients.
PDF

A Study on Combining Bimodal Sensors for Robust Speech Recognition (강인한 음성인식을 위한 이중모드 센서의 결합방식에 관한 연구)

이철우;계영철;고인선
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.6
- /
- pp.51-56
- /
- 2001
Recent researches have been focusing on jointly using lip motions and speech for reliable speech recognitions in noisy environments. To this end, this paper proposes the method of combining the visual speech recognizer and the conventional speech recognizer with each output properly weighted. In particular, we propose the method of autonomously determining the weights, depending on the amounts of noise in the speech. The correlations between adjacent speech samples and the residual errors of the LPC analysis are used for this determination. Simulation results show that the speech recognizer combined in this way provides the recognition performance of 83 ％ even in severely noisy environments.
PDF

Search Result 5, Processing Time 0.02 seconds

Improved Text-to-Speech Synthesis System Using Articulatory Synthesis and Concatenative Synthesis (조음 합성과 연결 합성 방식을 결합한 개선된 문서-음성 합성 시스템)

Development of Speech Training Aids Using Vocal Tract Profile (조음도를 이용한 발음훈련기기의 개발)

A Comparison of Parameters of Acoustic Vowel Space in Patients with Parkinson's Disease (파킨슨병 환자의 음향 모음 공간 파라미터 비교)

A Study on Speech Recognition using Vocal Tract Area Function (성도 면적 함수를 이용한 음성 인식에 관한 연구)

A Study on Combining Bimodal Sensors for Robust Speech Recognition (강인한 음성인식을 위한 이중모드 센서의 결합방식에 관한 연구)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)