Search | Korea Science

Noise Robust Speech Recognition Based on Noisy Speech Acoustic Model Adaptation (잡음음성 음향모델 적응에 기반한 잡음에 강인한 음성인식)

Chung, Yongjoo
- Phonetics and Speech Sciences
- /
- v.6 no.2
- /
- pp.29-34
- /
- 2014
In the Vector Taylor Series (VTS)-based noisy speech recognition methods, Hidden Markov Models (HMM) are usually trained with clean speech. However, better performance is expected by training the HMM with noisy speech. In a previous study, we could find that Minimum Mean Square Error (MMSE) estimation of the training noisy speech in the log-spectrum domain produce improved recognition results, but since the proposed algorithm was done in the log-spectrum domain, it could not be used for the HMM adaptation. In this paper, we modify the previous algorithm to derive a novel mathematical relation between test and training noisy speech in the cepstrum domain and the mean and covariance of the Multi-condition TRaining (MTR) trained noisy speech HMM are adapted. In the noisy speech recognition experiments on the Aurora 2 database, the proposed method produced 10.6% of relative improvement in Word Error Rates (WERs) over the MTR method while the previous MMSE estimation of the training noisy speech produced 4.3% of relative improvement, which shows the superiority of the proposed method.
https://doi.org/10.13064/KSSS.2014.6.2.029 인용 PDF KSCI

A Study on the Effects of Speech Training for Adults Focusing on the Analysis of Voices Before and After Speech Training (성인 스피치교육 전후 효과에 관한 목소리변화스펙트로그램 비교 연구)

Chung, Eun-Ee;Lee, Sang-Ho
- Journal of Digital Contents Society
- /
- v.18 no.6
- /
- pp.1049-1056
- /
- 2017
This study focused on the changes in the voices in determining the effects of speech training. This study aimed to make more visible and scientific evaluation of the changes in the voices among the substantial effects obtained from speech training. As a result, some objective differences from before the speech training could be found in the voice of every learner. Each learner showed gradual technical improvement in a variety of vocal elements, including resonance and timbre, accuracy of pronunciation, pause; that is, the voice became more powerful, more accurate pronounced, more pausing and more stable than before the speech training. This study determined if speech training could change a voice and the results are expected to help speech learners participate actively in speech training and see their speech ability improved.
https://doi.org/10.9728/dcs.2017.18.6.1049 인용 PDF KSCI

Development of Speech Training Aids Using Vocal Tract Profile (조음도를 이용한 발음훈련기기의 개발)

박상희;김동준;이재혁;윤태성
- The Transactions of the Korean Institute of Electrical Engineers
- /
- v.41 no.2
- /
- pp.209-216
- /
- 1992
Deafs train articulation by observing mouth of a tutor, sensing tactually the motions of the vocal organs, or using speech training aids. Present speech training aids for deafs can measure only single speech parameter, or display only frequency spectra in histogram of pseudo-color. In this study, a speech training aids that can display subject's articulation in the form of a cross section of the vocal organs and other speech parameters together in a single system is to be developed and this system makes a subject know where to correct. For our objective, first, speech production mechanism is assumed to be AR model in order to estimate articulatory motions of the vocal organs from speech signal. Next, a vocal tract profile model using LP analysis is made up. And using this model, articulatory motions for Korean vowels are estimated and displayed in the vocal tract profile graphics.
PDF

Design and Implementation of Speech-Training System for Voice Disorders (발성장애아동을 위한 발성훈련시스템 설계 및 구현)

정은순;김봉완;양옥렬;이용주
- Journal of Internet Computing and Services
- /
- v.2 no.1
- /
- pp.97-106
- /
- 2001
In this paper, we design and implement complement based speech training system for voice disorder. The system consists of three level of training: precedent training, training for speech apprehension and training for speech enhancement. To analyze speech of voice disorder, we extracted speech features as loudness, amplitude, pitch using digital signal processing technique. Extracted features are converted to graphic interface for visual feedback of speech by the system.
PDF

Korean Speakers' Pronunciation and Pronunciation Training of English Stops (한국인의 영어 폐쇄음 발화와 발화 훈련)

Kim, Ji-Eun
- Phonetics and Speech Sciences
- /
- v.2 no.3
- /
- pp.29-36
- /
- 2010
The purposes of this study are (1) to see if language transfer effect is found in Korean speakers' pronunciation of English stops and to correct them and (2) to investigate the effectiveness of mimicry training and Speech Analyzer training on subjects' pronunciation of English stops. For these purposes, 20 Korean speakers' VOT values of English stops were measured using Speech Analyzer and their post-training production was compared with their pre-training production. The result shows that Korean speakers have no difficulty in correcting pronunciation errors of English voiceless stops and voiced stops and such a result indicates that language transfer effect is not noticed as expected. In addition, the result of pronunciation training shows that the training using Speech Analyzer is more effective than mimicry training.
PDF

MCE Training Algorithm for a Speech Recognizer Detecting Mispronunciation of a Foreign Language (외국어 발음오류 검출 음성인식기를 위한 MCE 학습 알고리즘)

Bae, Min-Young;Chung, Yong-Joo;Kwon, Chul-Hong
- Speech Sciences
- /
- v.11 no.4
- /
- pp.43-52
- /
- 2004
Model parameters in HMM based speech recognition systems are normally estimated using Maximum Likelihood Estimation(MLE). The MLE method is based mainly on the principle of statistical data fitting in terms of increasing the HMM likelihood. The optimality of this training criterion is conditioned on the availability of infinite amount of training data and the correct choice of model. However, in practice, neither of these conditions is satisfied. In this paper, we propose a training algorithm, MCE(Minimum Classification Error), to improve the performance of a speech recognizer detecting mispronunciation of a foreign language. During the conventional MLE(Maximum Likelihood Estimation) training, the model parameters are adjusted to increase the likelihood of the word strings corresponding to the training utterances without taking account of the probability of other possible word strings. In contrast to MLE, the MCE training scheme takes account of possible competing word hypotheses and tries to reduce the probability of incorrect hypotheses. The discriminant training method using MCE shows better recognition results than the MLE method does.
PDF

Speech training aids for deafs (청각 장애자용 발음 훈련 기기의 개발)

김동준;윤태성;박상희
- 제어로봇시스템학회:학술대회논문집
- /
- 1991.10a
- /
- pp.746-751
- /
- 1991
Deafs train articulation by observing mouth of a tutor. sensing tactually the notions of the vocal organs, or using speech training aids. Present speech training aids for deafs can measure only single speech ter, or display only frequency spectra in histogrm or pseudo-color. In this study, a speech training aids that can display subject's articulation in the form of a cross section of the vocal organs and other speech parameters together in a single system Is aimed to develop and this system makes a subject to know where to correct. For our objective, first, speech production mechanism is assumed to be AR model in order to estimate articulatory notions of the vocal tract from speech signal. Next, a vocal tract profile mode using LPC analysis is made up. And using this model, articulatory notions for Korean vowels are estimated and displayed in the vocal tract profile graphics.
PDF

Effect of Carnatic Music Listening Training on Speech in Noise Performance in Adults

Amemane, Raksha;Gundmi, Archana;Mohan, Kishan Madikeri
- Journal of Audiology & Otology
- /
- v.25 no.1
- /
- pp.22-26
- /
- 2021
Background and Objectives: Music listening has a concomitant effect on structural and functional organization of the brain. It helps in relaxation, mind training and neural strengthening. In relation to it, the present study was aimed to find the effect of Carnatic music listening training (MLT) on speech in noise performance in adults. Subjects and Methods: A total of 28 participants (40-70 years) were recruited in the study. Based on randomized control trial, they were divided into intervention and control group. Intervention group underwent a short-term MLT. Quick Speech-in-Noise in Kannada was used as an outcome measure. Results: Results were analysed using mixed method analysis of variance (ANOVA) and repeated measures ANOVA. There was a significant difference between intervention and control group post MLT. The results of the second continuum revealed no statistically significant difference between post training and follow-up scores in both the groups. Conclusions: In conclusion short-term MLT resulted in betterment of speech in noise performance. MLT can be hence used as a viable tool in formal auditory training for better prognosis.
https://doi.org/10.7874/jao.2020.00255 인용

Effect of Carnatic Music Listening Training on Speech in Noise Performance in Adults

Amemane, Raksha;Gundmi, Archana;Mohan, Kishan Madikeri
- Korean Journal of Audiology
- /
- v.25 no.1
- /
- pp.22-26
- /
- 2021
Background and Objectives: Music listening has a concomitant effect on structural and functional organization of the brain. It helps in relaxation, mind training and neural strengthening. In relation to it, the present study was aimed to find the effect of Carnatic music listening training (MLT) on speech in noise performance in adults. Subjects and Methods: A total of 28 participants (40-70 years) were recruited in the study. Based on randomized control trial, they were divided into intervention and control group. Intervention group underwent a short-term MLT. Quick Speech-in-Noise in Kannada was used as an outcome measure. Results: Results were analysed using mixed method analysis of variance (ANOVA) and repeated measures ANOVA. There was a significant difference between intervention and control group post MLT. The results of the second continuum revealed no statistically significant difference between post training and follow-up scores in both the groups. Conclusions: In conclusion short-term MLT resulted in betterment of speech in noise performance. MLT can be hence used as a viable tool in formal auditory training for better prognosis.
https://doi.org/10.7874/jao.2020.00255 인용

The Effects of Voice and Speech Intelligibility Improvements in Parkinson Disease by Training Loudness and Pitch: A Case Study (강도 및 음도 조절을 이용한 훈련이 파킨슨병 환자의 음성 및 발화명료도 개선에 미치는 효과: 사례연구)

Lee, Ok-Bun;Jeong, Ok-Ran;Ko, Do-Heung
- Speech Sciences
- /
- v.8 no.3
- /
- pp.173-184
- /
- 2001
The purpose of this study was to examine the effects of manipulating loudness and pitch in terms of speech intelligibility and voice of a patient with Parkinson's Disease. The subject, who was diagnosed as a patient with Parkinson's disease 11 years ago, demonstrated a severely breath voice with low intensity. The accuracy of articulation in consonants was intelligible only at the single word level, and the overall intelligibility in continuous speech was low. The results showed that the subject's articulation accuracy and speech intelligibility was significantly improved after having loudness and pitch training. Habitual Fo, Jitter, Shimmer, Fo tremor, Amp tremor were decreased after training. In addition, the value of HNR also increased after training. It was shown that the changes of these acoustic parameters were closely related to the decrease of breathiness in Parkinson's voice, and this decrease of breathiness affected speech intelligibility considerably. Based on the experimental results, it was claimed that the vocal training by manipulating the loudness and pitch could be highly effective in improving the voice quality and speech intelligibility in Parkinson's Disease.
PDF

Search Result 580, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)