통합 검색 | Korea Science

조음 합성과 연결 합성 방식을 결합한 개선된 문서-음성 합성 시스템 (Improved Text-to-Speech Synthesis System Using Articulatory Synthesis and Concatenative Synthesis)

이근희;김동주;홍광석
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
- /
- pp.369-372
- /
- 2002
In this paper, we present an improved TTS synthesis system using articulatory synthesis and concatenative synthesis. In concatenative synthesis, segments of speech are excised from spoken utterances and connected to form the desired speech signal. We adopt LPC as a parameter, VQ to reduce the memory capacity, and TD-PSOLA to solve the naturalness problem.
PDF

조음도를 이용한 발음훈련기기의 개발 (Development of Speech Training Aids Using Vocal Tract Profile)

박상희;김동준;이재혁;윤태성
- 대한전기학회논문지
- /
- 제41권2호
- /
- pp.209-216
- /
- 1992
Deafs train articulation by observing mouth of a tutor, sensing tactually the motions of the vocal organs, or using speech training aids. Present speech training aids for deafs can measure only single speech parameter, or display only frequency spectra in histogram of pseudo-color. In this study, a speech training aids that can display subject's articulation in the form of a cross section of the vocal organs and other speech parameters together in a single system is to be developed and this system makes a subject know where to correct. For our objective, first, speech production mechanism is assumed to be AR model in order to estimate articulatory motions of the vocal organs from speech signal. Next, a vocal tract profile model using LP analysis is made up. And using this model, articulatory motions for Korean vowels are estimated and displayed in the vocal tract profile graphics.
PDF

파킨슨병 환자의 음향 모음 공간 파라미터 비교 (A Comparison of Parameters of Acoustic Vowel Space in Patients with Parkinson's Disease)

강영애;윤규철;이학승;성철재
- 말소리와 음성과학
- /
- 제2권4호
- /
- pp.185-192
- /
- 2010
The acoustic vowel space has been used as an acoustic parameter in dysarthric speech. The aim of this work was to examine mathematical formulae for acoustic vowel space and to apply these to Korean speakers with idiopathic Parkinson's disease(IPD). Five acoustic parameters were chosen from earlier works and one new parameter was proposed, the pentagonal vowel space. The six parameters included triangular vowel space (3 area), irregular quadrilateral vowel space (4 area), irregular pentagonal vowel space (5 area), vowel articulatory index (VAI), formant centralization ratio (FCR) and F2i/F1u ratio (F2 ratio). An experimental group of 32 IPD patients(male:female=16:16) and a control group of twenty healthy people (male:female=8:12) participated in the study and repeated vowels (/a-i-u-e-o/) three times. A correlation analysis was performed among the six parameters, 2-way ANOVA was done with gender and groups as independent factors, and an independent sample t-test was conducted between the male and the female group as post hoc comparison. All parameters were highly correlated with each other and only the FCR showed a high negative correlation with the others. The results of ANOVA showed a significant difference in F2 ratio, 3 area, 4 area and 5 area between gender and in 4 area and 5 area between groups. For the male members of the two groups, significant statistical differences were found in all parameters whereas no such differences were found for the female members. These findings indicated that the vowel space of the female group was wider than the vowel space of the male group. These differences may have been caused by gender-specific speech styles rather than by patho-physiological mechanisms. We also claim that the pentagonal vowel space is better than the other vowel spaces at representing the disordered speech in natural speech situations.
PDF

성도 면적 함수를 이용한 음성 인식에 관한 연구 (A Study on Speech Recognition using Vocal Tract Area Function)

송제혁;김동준
- 대한의용생체공학회:의공학회지
- /
- 제16권3호
- /
- pp.345-352
- /
- 1995
The LPC cepstrum coefficients, which are an acoustic features of speech signal, have been widely used as the feature parameter for various speech recognition systems and showed good performance. The vocal tract area function is a kind of articulatory feature, which is related with the physiological mechanism of speech production. This paper proposes the vocal tract area function as an alternative feature parameter for speech recognition. The linear predictive analysis using Burg algorithm and the vector quantization are performed. Then, recognition experiments for 5 Korean vowels and 10 digits are executed using the conventional LPC cepstrum coefficients and the vocal tract area function. The recognitions using the area function showed the slightly better results than those using the conventional LPC cepstrum coefficients.
PDF

강인한 음성인식을 위한 이중모드 센서의 결합방식에 관한 연구 (A Study on Combining Bimodal Sensors for Robust Speech Recognition)

이철우;계영철;고인선
- 한국음향학회지
- /
- 제20권6호
- /
- pp.51-56
- /
- 2001
최근 잡음이 심한 환경에서 음성인식을 신뢰성있게 하기 위하여 입모양의 움직임과 음성을 같이 사용하는 방법이 활발히 연구되고 있다 본 논문에서도 이러한 목적으로 영상언어인식기와 음성인식기의 결과에 각각 가중치를 주어 결합하는 방법을 제안한다. 특히 가중치를 입력음성의 잡음의 정도에 따라 자동적으로 결정하는 방법을 제안한다. 가중치의 결정을 위하여 입력샘플간의 상관도와 LPC분석의 잔여 오차를 이용한다. 모의실험 결과, 이런 방식으로 결합된 인식기는 잡음이 심한 환경에서도 약 83%의 인식성능을 보이고 있다.
PDF

검색결과 5건 처리시간 0.017초

조음 합성과 연결 합성 방식을 결합한 개선된 문서-음성 합성 시스템 (Improved Text-to-Speech Synthesis System Using Articulatory Synthesis and Concatenative Synthesis)

조음도를 이용한 발음훈련기기의 개발 (Development of Speech Training Aids Using Vocal Tract Profile)

파킨슨병 환자의 음향 모음 공간 파라미터 비교 (A Comparison of Parameters of Acoustic Vowel Space in Patients with Parkinson's Disease)

성도 면적 함수를 이용한 음성 인식에 관한 연구 (A Study on Speech Recognition using Vocal Tract Area Function)

강인한 음성인식을 위한 이중모드 센서의 결합방식에 관한 연구 (A Study on Combining Bimodal Sensors for Robust Speech Recognition)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)