Search | Korea Science

Real-time Handwriting Recognizer based on Partial Learning Applicable to Embedded Devices (임베디드 디바이스에 적용 가능한 부분학습 기반의 실시간 손글씨 인식기)

Kim, Young-Joo;Kim, Taeho
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.24 no.5
- /
- pp.591-599
- /
- 2020
Deep learning is widely utilized to classify or recognize objects of real-world. An abundance of data is trained on high-performance computers and a trained model is generated, and then the model is loaded in an inferencer. The inferencer is used in various environments, so that it may cause unrecognized objects or low-accuracy objects. To solve this problem, real-world objects are collected and they are trained periodically. However, not only is it difficult to immediately improve the recognition rate, but is not easy to learn an inferencer on embedded devices. We propose a real-time handwriting recognizer based on partial learning on embedded devices. The recognizer provides a training environment which partially learn on embedded devices at every user request, and its trained model is updated in real time. As this can improve intelligence of the recognizer automatically, recognition rate of unrecognized handwriting increases. We experimentally prove that learning and reasoning are possible for 22 numbers and letters on RK3399 devices.
https://doi.org/10.6109/jkiice.2020.24.5.591 인용 PDF KSCI

Implementation of HMM-Based Speech Recognizer Using TMS320C6711 DSP

Bae Hyojoon;Jung Sungyun;Bae Keunsung
- MALSORI
- /
- no.52
- /
- pp.111-120
- /
- 2004
This paper focuses on the DSP implementation of an HMM-based speech recognizer that can handle several hundred words of vocabulary size as well as speaker independency. First, we develop an HMM-based speech recognition system on the PC that operates on the frame basis with parallel processing of feature extraction and Viterbi decoding to make the processing delay as small as possible. Many techniques such as linear discriminant analysis, state-based Gaussian selection, and phonetic tied mixture model are employed for reduction of computational burden and memory size. The system is then properly optimized and compiled on the TMS320C6711 DSP for real-time operation. The implemented system uses 486kbytes of memory for data and acoustic models, and 24.5 kbytes for program code. Maximum required time of 29.2 ms for processing a frame of 32 ms of speech validates real-time operation of the implemented system.
PDF

Implementation of HMM-Based Speech Recognizer Using TMS320C6711 DSP

Bae Hyojoon;Jung Sungyun;Son Jongmok;Kwon Hongseok;Kim Siho;Bae Keunsung
- Proceedings of the IEEK Conference
- /
- summer
- /
- pp.391-394
- /
- 2004
This paper focuses on the DSP implementation of an HMM-based speech recognizer that can handle several hundred words of vocabulary size as well as speaker independency. First, we develop an HMM-based speech recognition system on the PC that operates on the frame basis with parallel processing of feature extraction and Viterbi decoding to make the processing delay as small as possible. Many techniques such as linear discriminant analysis, state-based Gaussian selection, and phonetic tied mixture model are employed for reduction of computational burden and memory size. The system is then properly optimized and compiled on the TMS320C6711 DSP for real-time operation. The implemented system uses 486kbytes of memory for data and acoustic models, and 24.5kbytes for program code. Maximum required time of 29.2ms for processing a frame of 32ms of speech validates real-time operation of the implemented system.
PDF

Implementation of HMM Based Speech Recognizer with Medium Vocabulary Size Using TMS320C6201 DSP (TMS320C6201 DSP를 이용한 HMM 기반의 음성인식기 구현)

Jung, Sung-Yun;Son, Jong-Mok;Bae, Keun-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.1E
- /
- pp.20-24
- /
- 2006
In this paper, we focused on the real time implementation of a speech recognition system with medium size of vocabulary considering its application to a mobile phone. First, we developed the PC based variable vocabulary word recognizer having the size of program memory and total acoustic models as small as possible. To reduce the memory size of acoustic models, linear discriminant analysis and phonetic tied mixture were applied in the feature selection process and training HMMs, respectively. In addition, state based Gaussian selection method with the real time cepstral normalization was used for reduction of computational load and robust recognition. Then, we verified the real-time operation of the implemented recognition system on the TMS320C6201 EVM board. The implemented recognition system uses memory size of about 610 kbytes including both program memory and data memory. The recognition rate was 95.86% for ETRI 445DB, and 96.4%, 97.92%, 87.04% for three kinds of name databases collected through the mobile phones.
PDF KSCI

Design and Implementation of Hand Gesture Recognizer Based on Artificial Neural Network (인공신경망 기반 손동작 인식기의 설계 및 구현)

Kim, Minwoo;Jeong, Woojae;Cho, Jaechan;Jung, Yunho
- Journal of Advanced Navigation Technology
- /
- v.22 no.6
- /
- pp.675-680
- /
- 2018
In this paper, we propose a hand gesture recognizer using restricted coulomb energy (RCE) neural network, and present hardware implementation results for real-time learning and recognition. Since RCE-NN has a flexible network architecture and real-time learning process with low complexity, it is suitable for hand recognition applications. The 3D number dataset was created using an FPGA-based test platform and the designed hand gesture recognizer showed 98.8% recognition accuracy for the 3D number dataset. The proposed hand gesture recognizer is implemented in Intel-Altera cyclone IV FPGA and confirmed that it can be implemented with 26,702 logic elements and 258Kbit memory. In addition, real-time learning and recognition verification were performed at an operating frequency of 70MHz.
https://doi.org/10.12673/jant.2018.22.6.675 인용 PDF KSCI HTML

Implementation of Speech Recognizer using DSP(Digital Signal Processor) (DSP를 이용한 음성인식기 구현)

임창환;문철홍;전경남
- Proceedings of the IEEK Conference
- /
- 2000.11d
- /
- pp.187-190
- /
- 2000
In this paper, implementation of speech Recognizer system, Separated from Personal computer. By using DSP, this intends to extend the voice recognizing, limited into PC because of amount of data and calculations. For this performance The thesis uses the real time End point detector and organizes no additional device between human and the system, characteristic vector are that detects End point and voice from absolute energy and ZCR, that uses 12 difference Cepstrum from LPC, that uses the method to compensate the process of pattern separating and pre-calculated standard pattern limitation.
PDF

Development of a Speech Recognizer on PDAs (PDA 기반 음성 인식기 개발)

Koo Myoung-Wan;Park Sung-Joon;Son Dan-Young;Han Ki-Soo
- Proceedings of the KSPS conference
- /
- 2006.05a
- /
- pp.33-36
- /
- 2006
This paper describes a speech recognizer implemented on PDAs. The recognizer consists of feature extraction module, search module and utterance verification module. It can recognize 37 words that can be used in the telematics application and fixed-point operation is performed for real-time processing. Simulation results show that recognition accuracy is 94.5% for the in-vocabulary words and 56.8% for the out-of-task words.
PDF

Robust Speech Segmentation Method in Noise Environment for Speech Recognizer (음성인식기 구현을 위한 잡음에 강인한 음성구간 검출기법)

김창근;박정원;권호민;허강인
- Journal of the Institute of Convergence Signal Processing
- /
- v.4 no.2
- /
- pp.18-24
- /
- 2003
One of the most important subjects in the implementation of real time speech recognizer is to design both reliable VAD(Voice Activity Detection) and suitable speech feature vector. But, because it is difficult to calculate reliable VAD in the environment having surrounding noise, designed suitable speech feature vector may not be obtained. Solving this problem, in this paper, we implement not only short time power spectrum which is generally used but also two additive parameters, the comparison measure of spectrum density having robust property in noise and linear discriminant function using linear regression, then perform VAD by using the combination of each parameter having apt weight in other magnitudes of surrounding noise and confirm that proposed parameters show a robust characteristic in circumstances having surrounding noise by using DTW(Dynamic Time Waning) in recognition experiment.
PDF

Implementation of Speaker Independent Speech Recognition System Using Independent Component Analysis based on DSP (독립성분분석을 이용한 DSP 기반의 화자 독립 음성 인식 시스템의 구현)

김창근;박진영;박정원;이광석;허강인
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.8 no.2
- /
- pp.359-364
- /
- 2004
In this paper, we implemented real-time speaker undependent speech recognizer that is robust in noise environment using DSP(Digital Signal Processor). Implemented system is composed of TMS320C32 that is floating-point DSP of Texas Instrument Inc. and CODEC for real-time speech input. Speech feature parameter of the speech recognizer used robust feature parameter in noise environment that is transformed feature space of MFCC(met frequency cepstral coefficient) using ICA(Independent Component Analysis) on behalf of MFCC. In recognition result in noise environment, we hew that recognition performance of ICA feature parameter is superior than that of MFCC.
PDF KSCI

An Implementation of the Real Time Speech Recognition for the Automatic Switching System (자동 교환 시스템을 위한 실시간 음성 인식 구현)

박익현;이재성;김현아;함정표;유승균;강해익;박성현
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.4
- /
- pp.31-36
- /
- 2000
This paper describes the implementation and the evaluation of the speech recognition automatic exchange system. The system provides government or public offices, companies, educational institutions that are composed of large number of members and parts with exchange service using speech recognition technology. The recognizer of the system is a Speaker-Independent, Isolated-word, Flexible-Vocabulary recognizer based on SCHMM(Semi-Continuous Hidden Markov Model). For real-time implementation, DSP TMS320C32 made in Texas Instrument Inc. is used. The system operating terminal including the diagnosis of speech recognition DSP and the alternation of speech recognition candidates makes operation easy. In this experiment, 8 speakers pronounced words of 1,300 vocabulary related to automatic exchange system over wire telephone network and the recognition system achieved 91.5% of word accuracy.
PDF

Search Result 32, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)