Search | Korea Science

Distortion in Visual Memory for Wide-angle Image (광각 이미지에 대한 시각적 기억의 왜곡)

Jang, Phil-Sik
- Journal of the Ergonomics Society of Korea
- /
- v.26 no.3
- /
- pp.11-16
- /
- 2007
Viewers remember seeing more of the scene than was present in the physical input: an illusion known as boundary extension. This study examined the aspects of the distortion by presenting 69 subjects with wide-angle views of four scenes. Results of recognition and reproduction test showed that the boundary extension is not a unidirectional phenomenon. On the contrary, boundary restriction and foreground extension were observed with extreme wide-angle views of scenes. Results support the hypothesis that boundary restriction and foreground extension were mediated by the activation of a memory schema during picture perception.
https://doi.org/10.5143/JESK.2007.26.3.011 인용 PDF KSCI

Implementation of a Speaker-independent Speech Recognizer Using the TMS320F28335 DSP (TMS320F28335 DSP를 이용한 화자독립 음성인식기 구현)

Chung, Ik-Joo
- Journal of Industrial Technology
- /
- v.29 no.A
- /
- pp.95-100
- /
- 2009
In this paper, we implemented a speaker-independent speech recognizer using the TMS320F28335 DSP which is optimized for control applications. For this implementation, we used a small-sized commercial DSP module and developed a peripheral board including a codec, signal conditioning circuits and I/O interfaces. The speech signal digitized by the TLV320AIC23 codec is analyzed based on MFCC feature extraction methed and recognized using the continuous-density HMM. Thanks to the internal SRAM and flash memory on the TMS320F28335 DSP, we did not need any external memory devices. The internal flash memory contains ADPCM data for voice response as well as HMM data. Since the TMS320F28335 DSP is optimized for control applications, the recognizer may play a good role in the voice-activated control areas in aspect that it can integrate speech recognition capability and inherent control functions into the single DSP.
PDF

Implementation of Symmetrec Three Layered Network for Large Capacity Optical Associative Memory (대용향 광 연상기억을 위한 대칭 삼층구조의 구현)

서호형;이상수
- Korean Journal of Optics and Photonics
- /
- v.3 no.3
- /
- pp.191-197
- /
- 1992
We have developed a new optical associative memory system hased on the symmetric three layered neural network model, uhing two holograms and a LCIV. In the experiment, four Korean alphabet letters (ㄹ, ㅅ, ㅇ, ㅈ) are used as memory patterns. The results are compared with those of the two layered network and the IIopfield models. The results show that more than 95% recognition ablity is obtained for thc input which has the error rate less than 12%.
PDF

Compressed Ensemble of Deep Convolutional Neural Networks with Global and Local Facial Features for Improved Face Recognition (얼굴인식 성능 향상을 위한 얼굴 전역 및 지역 특징 기반 앙상블 압축 심층합성곱신경망 모델 제안)

Yoon, Kyung Shin;Choi, Jae Young
- Journal of Korea Multimedia Society
- /
- v.23 no.8
- /
- pp.1019-1029
- /
- 2020
In this paper, we propose a novel knowledge distillation algorithm to create an compressed deep ensemble network coupled with the combined use of local and global features of face images. In order to transfer the capability of high-level recognition performances of the ensemble deep networks to a single deep network, the probability for class prediction, which is the softmax output of the ensemble network, is used as soft target for training a single deep network. By applying the knowledge distillation algorithm, the local feature informations obtained by training the deep ensemble network using facial subregions of the face image as input are transmitted to a single deep network to create a so-called compressed ensemble DCNN. The experimental results demonstrate that our proposed compressed ensemble deep network can maintain the recognition performance of the complex ensemble deep networks and is superior to the recognition performance of a single deep network. In addition, our proposed method can significantly reduce the storage(memory) space and execution time, compared to the conventional ensemble deep networks developed for face recognition.
https://doi.org/10.9717/kmms.2020.23.8.1019 인용 PDF KSCI HTML

Online Character Recognition Technique Using PCA (PCA를 이용한 온라인 문자인식 기법)

Yoo Jae-Man;Kim Woo-Saeng;Han Jeong-Hoon
- Journal of Korea Multimedia Society
- /
- v.9 no.4
- /
- pp.414-420
- /
- 2006
Online character recognition techniques have been applied in many new fields of PDA, Tablet PC etc. But the recognition techniques can not use such high technologies naturally yet. Hidden Markov Model (HMM) that is much used recently requires high memory space and complex computational tasks because of comparing the input data with entire standard patterns. In this paper we propose a method to recognize the online characters more efficiently. At first we create chain-codes of learning data and recognition data in preprocessing phase, and then we compress dimensions of data using Principal Component Analysis (PCA) and recognize a character compressed data in recognition phrase. Validity of proposed method .is verified. by experiment results.
PDF

Monophone and Biphone Compuond Unit for Korean Vocabulary Speech Recognition (한국어 어휘 인식을 위한 혼합형 음성 인식 단위)

이기정;이상운;홍재근
- Journal of the Korea Computer Industry Society
- /
- v.2 no.6
- /
- pp.867-874
- /
- 2001
In this paper, considering the pronunciation characteristic of Korean, recognition units which can shorten the recognition time and reflect the coarticulation effect simultaneously are suggested. These units are composed of monophone and hipbone ones. Monophone units are applied to the vowels which represent stable characteristic. Biphones are used to the consonant which vary according to adjacent vowel. In the experiment of word recognition of PBW445 database, the compound units result in comparable recognition accuracy with 57％ speed up compared with triphone units and better recognition accuracy with similar speed. In addition, we can reduce the memory size because of fewer units.
PDF

Korean continuous digit speech recognition by multilayer perceptron using KL transformation (KL 변환을 이용한 multilayer perceptron에 의한 한국어 연속 숫자음 인식)

박정선;권장우;권정상;이응혁;홍승홍
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.8
- /
- pp.105-113
- /
- 1996
In this paper, a new korean digita speech recognition technique was proposed using muktolayer perceptron (MLP). In spite of its weakness in dynamic signal recognition, MLP was adapted for this model, cecause korean syllable could give static features. It is so simle in its structure and fast in its computing that MLP was used to the suggested system. MLP's input vectors was transformed using karhunen-loeve transformation (KLT), which compress signal successfully without losin gits separateness, but its physical properties is changed. Because the suggested technique could extract static features while it is not affected from the changes of syllable lengths, it is effectively useful for korean numeric recognition system. Without decreasing classification rates, we can save the time and memory size for computation using KLT. The proposed feature extraction technique extracts same size of features form the tow same parts, front and end of a syllable. This technique makes frames, where features are extracted, using unique size of windows. It could be applied for continuous speech recognition that was not easy for the normal neural network recognition system.
PDF

Study on Fast-Changing Mixed-Modulation Recognition Based on Neural Network Algorithms

Jing, Qingfeng;Wang, Huaxia;Yang, Liming
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.12
- /
- pp.4664-4681
- /
- 2020
Modulation recognition (MR) plays a key role in cognitive radar, cognitive radio, and some other civilian and military fields. While existing methods can identify the signal modulation type by extracting the signal characteristics, the quality of feature extraction has a serious impact on the recognition results. In this paper, an end-to-end MR method based on long short-term memory (LSTM) and the gated recurrent unit (GRU) is put forward, which can directly predict the modulation type from a sampled signal. Additionally, the sliding window method is applied to fast-changing mixed-modulation signals for which the signal modulation type changes over time. The recognition accuracy on training datasets in different SNR ranges and the proportion of each modulation method in misclassified samples are analyzed, and it is found to be reasonable to select the evenly-distributed and full range of SNR data as the training data. With the improvement of the SNR, the recognition accuracy increases rapidly. When the length of the training dataset increases, the neural network recognition effect is better. The loss function value of the neural network decreases with the increase of the training dataset length, and then tends to be stable. Moreover, when the fast-changing period is less than 20ms, the error rate is as high as 50%. As the fast-changing period is increased to 30ms, the error rates of the GRU and LSTM neural networks are less than 5%.
https://doi.org/10.3837/tiis.2020.12.003 인용 PDF KSCI HTML

Activity recognition of stroke-affected people using wearable sensor

Anusha David;Rajavel Ramadoss;Amutha Ramachandran;Shoba Sivapatham
- ETRI Journal
- /
- v.45 no.6
- /
- pp.1079-1089
- /
- 2023
Stroke is one of the leading causes of long-term disability worldwide, placing huge burdens on individuals and society. Further, automatic human activity recognition is a challenging task that is vital to the future of healthcare and physical therapy. Using a baseline long short-term memory recurrent neural network, this study provides a novel dataset of stretching, upward stretching, flinging motions, hand-to-mouth movements, swiping gestures, and pouring motions for improved model training and testing of stroke-affected patients. A MATLAB application is used to output textual and audible prediction results. A wearable sensor with a triaxial accelerometer is used to collect preprocessed real-time data. The model is trained with features extracted from the actual patient to recognize new actions, and the recognition accuracy provided by multiple datasets is compared based on the same baseline model. When training and testing using the new dataset, the baseline model shows recognition accuracy that is 11% higher than the Activity Daily Living dataset, 22% higher than the Activity Recognition Single Chest-Mounted Accelerometer dataset, and 10% higher than another real-world dataset.
https://doi.org/10.4218/etrij.2022-0242 인용 PDF

Bi-directional LSTM-CNN-CRF for Korean Named Entity Recognition System with Feature Augmentation (자질 보강과 양방향 LSTM-CNN-CRF 기반의 한국어 개체명 인식 모델)

Lee, DongYub;Yu, Wonhee;Lim, HeuiSeok
- Journal of the Korea Convergence Society
- /
- v.8 no.12
- /
- pp.55-62
- /
- 2017
The Named Entity Recognition system is a system that recognizes words or phrases with object names such as personal name (PS), place name (LC), and group name (OG) in the document as corresponding object names. Traditional approaches to named entity recognition include statistical-based models that learn models based on hand-crafted features. Recently, it has been proposed to construct the qualities expressing the sentence using models such as deep-learning based Recurrent Neural Networks (RNN) and long-short term memory (LSTM) to solve the problem of sequence labeling. In this research, to improve the performance of the Korean named entity recognition system, we used a hand-crafted feature, part-of-speech tagging information, and pre-built lexicon information to augment features for representing sentence. Experimental results show that the proposed method improves the performance of Korean named entity recognition system. The results of this study are presented through github for future collaborative research with researchers studying Korean Natural Language Processing (NLP) and named entity recognition system.
https://doi.org/10.15207/JKCS.2017.8.12.055 인용 PDF KSCI

Search Result 473, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)