Continuous Digit Recognition Using the Weight Initialization and LR Parser

Choi, Ki-Hoon;Lee, Seong-Kwon;Kim, Soon-Hyob;

The Journal of the Acoustical Society of Korea

Volume 15 Issue 2E
/
Pages.14-23
/
1996
/
1225-4428(pISSN)

The Acoustical Society of Korea (한국음향학회)

Continuous Digit Recognition Using the Weight Initialization and LR Parser

Choi, Ki-Hoon (Compute engineering Department Kwang-Woon University) ;
Lee, Seong-Kwon (Compute engineering Department Kwang-Woon University) ;
Kim, Soon-Hyob (Compute engineering Department Kwang-Woon University)

Published : 1996.06.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper is a on the neural network to recognize the phonemes, the weight initialization to reduce learning speed, and LR parser for continuous speech recognition. The neural network spots the phonemes in continuous speech and LR parser parses the output of neural network. The whole phonemes recognized in neural network are divided into several groups which are grouped by the similarity of phonemes, and then each group consists of neural network. Each group of neural network to recognize the phonemes consisits of that recognize the phonemes of their own group and VGNN(Verify Group Neural Network) which judges whether the inputs are their own group or not. The weights of neural network are not initialized with random values but initialized from learning data to reduce learning speed. The LR parsing method applied to this paper is not a method which traces a unique path, but one which traces several possible paths because the output of neural network is not accurate. The parser processes the continuous speech frame by frame as accumulating the output of neural network through several possible paths. If this accumulated path-value drops below the threshold value, this path is deleted in possible parsing paths. This paper applies the continuous speech recognition system to the threshold value, this path is deleted in possible parsing paths. This paper applies the continuous speech recognition system to the continuous Korea digits recognition. The recognition rate of isolated digits is 97% in speaker dependent, and 75% in speaker dependent. The recognition rate of continuous digits is 74% in spaker dependent.

The Journal of the Acoustical Society of Korea

Continuous Digit Recognition Using the Weight Initialization and LR Parser

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)