A Study on the Algorithm Development for Speech Recognition of Korean and Japanese

Lee, Sung-Hwa;Kim, Hyung-Lae;

Journal of IKEEE (전기전자학회논문지)

Volume 2 Issue 1 Serial No. 2
/
Pages.61-67
/
1998
/
1226-7244(pISSN)
/
2288-243X(eISSN)

Institute of Korean Electrical and Electronics Engineers (한국전기전자학회)

A Study on the Algorithm Development for Speech Recognition of Korean and Japanese

한국어와 일본어의 음성 인식을 위한 알고리즘 개발에 관한 연구

Lee, Sung-Hwa (Dept. of Electronic Eng. KonKuk Univ.) ;
Kim, Hyung-Lae (Dept. of Electronic Eng. KonKuk Univ.)

이성화 (건국대학교 전자공학과) ;
김병래 (건국대학교 전자공학과)

Published : 1998.08.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this thesis, experiment have performed with the speaker recognition using multilayer feedforward neural network(MFNN) model using Korean and Japanese digits . The 5 adult males and 5 adult females pronounciate form 0 to 9 digits of Korean, Japanese 7 times. And then, they are extracted characteristics coefficient through Pitch deletion algorithm, LPC analysis, and LPC Cepstral analysis to generate input pattern of MFNN. 5 times among them are used to train a neural network, and 2 times is used to measure the performance of neural network. Both Korean and Japanese, Pitch coefficients is about 4%t more enhanced than LPC or LPC Cepstral coefficients.

본 연구에서는 다층 순방향 신경망(MFNN) 모델을 이용해서 한국어 및 일본어 숫자음 인식 실험을 수행하였다. 각각 5명의 한국인 남성 및 여성 화자가 0부터 9까지의 10개의 숫자를 7회 발음토록 하였고, 그중 2회 발음한 것을 인식 실험에 사용하였다. 이들 음성 데이터로부터 각각 추출된 피치 계수, 선형 예측 계수, 선형 예측 켑스트럼 계수들을 신경망의 입력 패턴으로 입력시켜 인식 성능을 측정하였다. 한국어를 사용한 실험과 일본어를 사용한 실험 모두에서 피치 계수를 사용하는 것이 다른 계수를 사용하는 것보다 약 4% 정도 우수한 성능을 나타내었다.

Keywords

References

デイジタル音聲處理古井貞熙
Text-Independent Speaker Identification Gish, H.;Schmidt, M.
Proc. of the IEEE v.63 no.4 Linear Prediction : A Tutorial Review Marhoul, J.
IEEE Trans. ASSP no.Apr. Direct Relations Between Cepstrum and Predictor Coefficients Schroeder, M.R.
Discrete-time Processing of Speech Signals Deller, J.R.;Proakis, J.G.;Hansen, J.H.
J. Acoust. Soc. Am. v.46 no.2 Parallel Processing Techniques for Estimating Pitch Periods of Speech in Time Domain Gold, B.;Rabbiner, L.R.
Digital Neural Networks Kung, S.Y.
Neural Networks in C++ Blum, Adam
Eurospeech Speaker Recognition Experiments in Estonian using Multi Layer Feed-Forward Neural Nets Altosaar, T.;Meister, E.

Journal of IKEEE (전기전자학회논문지)

A Study on the Algorithm Development for Speech Recognition of Korean and Japanese

한국어와 일본어의 음성 인식을 위한 알고리즘 개발에 관한 연구

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)