Browse > Article

A Real-Time Embedded Speech Recognition System  

남상엽 (경문대학교 정보통신과)
전은희 (단국대학교 전자공학과)
박인정 (단국대학교 전자공학과)
Publication Information
Abstract
In this study, we'd implemented a real time embedded speech recognition system that requires minimum memory size for speech recognition engine and DB. The word to be recognized consist of 40 commands used in a PCS phone and 10 digits. The speech data spoken by 15 male and 15 female speakers was recorded and analyzed by short time analysis method, which window size is 256. The LPC parameters of each frame were computed through Levinson-Burbin algorithm and they were transformed to Cepstrum parameters. Before the analysis, speech data should be processed by pre-emphasis that will remove the DC component in speech and emphasize high frequency band. Baum-Welch reestimation algorithm was used for the training of HMM. In test phone, we could get a recognition rate using likelihood method. We implemented an embedded system by porting the speech recognition engine on ARM core evaluation board. The overall recognition rate of this system was 95%, while the rate on 40 commands was 96% and that 10 digits was 94%.
Keywords
임베디드 시스템;선형예측계수;음성인식;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Rabiner, L. R. 'Application of Voice Processing to Telecommunications'. Proceeding of the IEEE, Vol. 82, No.2, pp. 199-228, 1994   DOI   ScienceOn
2 D.van Compemolle, 'Speech Recognition in the Car From Phone Dialing to Car Navigation', Proceedings of EURO SPEECH '97. vol.5, pp, 2431-2433, 1997
3 김순협 '음성인식 기술 현황 및 연구 동향', 2000년도 한국음향학회 학술발표대회 논문집, Vol. 19, No. 2(s), pp. 25-28, 2000
4 J. Mariani, 'Recent advances in speech processing,' Proc, of ICASSP, pp, 429-440, 1989   DOI
5 X. D. Huang, Y.Ariki, M. A. Jack, 'Hidden Markov Models for Speech Recognition', Edinburgh Information Technology Series, 1990
6 J. A. Haigh, J. S. Mason, 'Robust Voice Detection using Cepstral Features'. IEEE TENCON-93, pp. 321-324, 1993   DOI
7 남상엽, 이상원, 박인정 '임베디드 시스템을 위한 소형 음성인식 시스템 구현에 관한 연구', 대한전자공학회 논문집 Vol. 37-TE-6-9, No. 2, pp. 152-158,2001
8 Allen Gersho, Robert M. Gray, Vector Quantization and Signal Compression, Kluwer Academic Publisher, 1992