Search | Korea Science

Improvement of Gesture Recognition using 2-stage HMM (2단계 히든마코프 모델을 이용한 제스쳐의 성능향상 연구)

Jung, Hwon-Jae;Park, Hyeonjun;Kim, Donghan
- Journal of Institute of Control, Robotics and Systems
- /
- v.21 no.11
- /
- pp.1034-1037
- /
- 2015
In recent years in the field of robotics, various methods have been developed to create an intimate relationship between people and robots. These methods include speech, vision, and biometrics recognition as well as gesture-based interaction. These recognition technologies are used in various wearable devices, smartphones and other electric devices for convenience. Among these technologies, gesture recognition is the most commonly used and appropriate technology for wearable devices. Gesture recognition can be classified as contact or noncontact gesture recognition. This paper proposes contact gesture recognition with IMU and EMG sensors by using the hidden Markov model (HMM) twice. Several simple behaviors make main gestures through the one-stage HMM. It is equal to the Hidden Markov model process, which is well known for pattern recognition. Additionally, the sequence of the main gestures, which comes from the one-stage HMM, creates some higher-order gestures through the two-stage HMM. In this way, more natural and intelligent gestures can be implemented through simple gestures. This advanced process can play a larger role in gesture recognition-based UX for many wearable and smart devices.
https://doi.org/10.5302/J.ICROS.2015.15.0089 인용 PDF KSCI

Speech emotion recognition using attention mechanism-based deep neural networks (주목 메커니즘 기반의 심층신경망을 이용한 음성 감정인식)

Ko, Sang-Sun;Cho, Hye-Seung;Kim, Hyoung-Gook
- The Journal of the Acoustical Society of Korea
- /
- v.36 no.6
- /
- pp.407-412
- /
- 2017
In this paper, we propose a speech emotion recognition method using a deep neural network based on the attention mechanism. The proposed method consists of a combination of CNN (Convolution Neural Networks), GRU (Gated Recurrent Unit), DNN (Deep Neural Networks) and attention mechanism. The spectrogram of the speech signal contains characteristic patterns according to the emotion. Therefore, we modeled characteristic patterns according to the emotion by applying the tuned Gabor filters as convolutional filter of typical CNN. In addition, we applied the attention mechanism with CNN and FC (Fully-Connected) layer to obtain the attention weight by considering context information of extracted features and used it for emotion recognition. To verify the proposed method, we conducted emotion recognition experiments on six emotions. The experimental results show that the proposed method achieves higher performance in speech emotion recognition than the conventional methods.
https://doi.org/10.7776/ASK.2017.36.6.407 인용 PDF KSCI

Deep Learning Based 3D Gesture Recognition Using Spatio-Temporal Normalization (시 공간 정규화를 통한 딥 러닝 기반의 3D 제스처 인식)

Chae, Ji Hun;Gang, Su Myung;Kim, Hae Sung;Lee, Joon Jae
- Journal of Korea Multimedia Society
- /
- v.21 no.5
- /
- pp.626-637
- /
- 2018
Human exchanges information not only through words, but also through body gesture or hand gesture. And they can be used to build effective interfaces in mobile, virtual reality, and augmented reality. The past 2D gesture recognition research had information loss caused by projecting 3D information in 2D. Since the recognition of the gesture in 3D is higher than 2D space in terms of recognition range, the complexity of gesture recognition increases. In this paper, we proposed a real-time gesture recognition deep learning model and application in 3D space using deep learning technique. First, in order to recognize the gesture in the 3D space, the data collection is performed using the unity game engine to construct and acquire data. Second, input vector normalization for learning 3D gesture recognition model is processed based on deep learning. Thirdly, the SELU(Scaled Exponential Linear Unit) function is applied to the neural network's active function for faster learning and better recognition performance. The proposed system is expected to be applicable to various fields such as rehabilitation cares, game applications, and virtual reality.
https://doi.org/10.9717/kmms.2018.21.5.626 인용 PDF KSCI

Retrieve System for Performance support of Vocabulary Clustering Model In Continuous Vocabulary Recognition System (연속 어휘 인식 시스템에서 어휘 클러스터링 모델의 성능 지원을 위한 검색 시스템)

Oh, Sang Yeob
- Journal of Digital Convergence
- /
- v.10 no.9
- /
- pp.339-344
- /
- 2012
Established continuous vocabulary recognition system improved recognition rate by using decision tree based tying modeling method. However, since system model cannot support the retrieve of phoneme data, it is hard to secure the accuracy. In order to improve this problem, we remodeled a system that could retrieve probabilistic model from continuous vocabulary clustering model to phoneme unit. Therefore in this paper showed 95.88%of recognition rate in system performance.
https://doi.org/10.14400/JDPM.2012.10.9.339 인용 PDF

The Korean Word Length Effect on AudWord Recognition (청각단어 재인에서 나타난 한국어 단어 길이 효과)

Choi Wonil;Nam Kichun
- MALSORI
- /
- no.44
- /
- pp.33-46
- /
- 2002
This study was conducted to examine the effect of word length on auditory word recognition. Word length can be defined by several sublexical units, such as letters, phonemes, syllables, etc. To find out which sublexical units are influential in auditory word recognition, the auditory lexical decision task was used. In Experiment 1, we examined the partial correlation between the speed of reaction time and the number of sublexical units, and in Experiment 2, we executed ANOVA to find out which sublexical length variable was an influential unit. Through these two experiment, we concluded syllable length was the most important variable on auditory word recognition.
PDF

Deep Neural Network-based Jellyfish Distribution Recognition System Using a UAV (무인기를 이용한 심층 신경망 기반 해파리 분포 인식 시스템)

Koo, Jungmo;Myung, Hyun
- The Journal of Korea Robotics Society
- /
- v.12 no.4
- /
- pp.432-440
- /
- 2017
In this paper, we propose a jellyfish distribution recognition and monitoring system using a UAV (unmanned aerial vehicle). The UAV was designed to satisfy the requirements for flight in ocean environment. The target jellyfish, Aurelia aurita, is recognized through convolutional neural network and its distribution is calculated. The modified deep neural network architecture has been developed to have reliable recognition accuracy and fast operation speed. Recognition speed is about 400 times faster than GoogLeNet by using a lightweight network architecture. We also introduce the method for selecting candidates to be used as inputs to the proposed network. The recognition accuracy of the jellyfish is improved by removing the probability value of the meaningless class among the probability vectors of the evaluated input image and re-evaluating it by normalization. The jellyfish distribution is calculated based on the unit jellyfish image recognized. The distribution level is defined by using the novelty concept of the distribution map buffer.
https://doi.org/10.7746/jkros.2017.12.4.432 인용 PDF KSCI

Gesture Recognition Using Higher Correlation Feature Information and PCA

Kim, Jong-Min;Lee, Kee-Jun
- Journal of Integrative Natural Science
- /
- v.5 no.2
- /
- pp.120-126
- /
- 2012
This paper describes the algorithm that lowers the dimension, maintains the gesture recognition and significantly reduces the eigenspace configuration time by combining the higher correlation feature information and Principle Component Analysis. Since the suggested method doesn't require a lot of computation than the method using existing geometric information or stereo image, the fact that it is very suitable for building the real-time system has been proved through the experiment. In addition, since the existing point to point method which is a simple distance calculation has many errors, in this paper to improve recognition rate the recognition error could be reduced by using several successive input images as a unit of recognition with K-Nearest Neighbor which is the improved Class to Class method.
https://doi.org/10.13160/ricns.2012.5.2.120 인용 PDF

A Study for the Improvement of the Fault Decision Capability of FRTU using Discrete Wavelet Transform and Neural Network (이산 웨이블릿 변환과 신경회로망을 이용한 FRTU의 고장판단 능력 개선에 관한 연구)

Hong, Dae-Seung;Ko, Yoon-Seok;Kang, Tae-Ku;Park, Hak-Yeol;Yim, Hwa-Young
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.56 no.7
- /
- pp.1183-1190
- /
- 2007
This paper proposes the improved fault decision algorithm using DWT(Discrete Wavelet Transform) and ANNs for the FRTU(Feeder Remote Terminal Unit) on the feeder in the power distribution system. Generally, the FRTU has the fault decision scheme detecting the phase fault, the ground fault. Especially FRTU has the function for 2000ms. This function doesn't operate FI(Fault Indicator) for the Inrush current generated in switching time. But it has a defect making it impossible for the FI to be operated from the real fault current in inrush restraint time. In such a case, we can not find the fault zone from FI information. Accordingly, the improved fault recognition algorithm is needed to solve this problem. The DWT analysis gives the frequency and time-scale information. The neural network system as a fault recognition was trained to distinguish the inrush current from the fault status by a gradient descent method. In this paper, fault recognition algorithm is improved by using voltage monitoring system, DWT and neural network. All of the data were measured in actual 22.9kV power distribution system.
PDF KSCI

An Analysis of Recognition and Preference for the View in an Apartment Unit (아파트 단위세대에서 보이는 경관에 대한 인지 및 선호 특성)

Moon, Ji-Won;Ha, Jae-Myung
- Journal of the Korean housing association
- /
- v.18 no.1
- /
- pp.83-93
- /
- 2007
Following the previous ones, this study is intended to explore methods of qualitative assessment on the view from apartment units. It first complemented and analyzed the attributes of landscape elements and then set up questionnaire items based on these attributes to identify the tendencies in apartment inhabitants' recognition of landscape elements, and then conducted a preference assessment on the test cases sampled on the basis of picture and other data collected in the previous studies to identify the characteristics of the preference for the view from apartment units according to landscape elements. Consequently, the following results have been derived. First, the landscape elements seen from apartment units may be classified into a total of sixteen categories, and the overall ratio of natural elements to artificial ones is shown to be approximately one to three. Second, it is also shown that apartment dwellers tend to prefer natural landscape elements over artificial ones, and the preferences for the distance to and location of landscape elements exhibit certain variance depending on the type of the elements. Third, the analysis of the preference for landscape elements has revealed that the types of landscape elements, the make-up and diversity of landscape elements, and the perceived distance to landscape elements as well as the resultant feeling of openness all affect the preference tendencies.
PDF KSCI

Implementation of A Morphological Analyzer Based on Pseudo-morpheme for Large Vocabulary Speech Recognizing (대어휘 음성인식을 위한 의사형태소 분석 시스템의 구현)

양승원
- Journal of Korea Society of Industrial Information Systems
- /
- v.4 no.2
- /
- pp.102-108
- /
- 1999
It is important to decide processing unit in the large vocabulary speech recognition system we propose a Pseudo-Morpheme as the recognition unit to resolve the problems in the recognition systems using the phrase or the general morpheme. We implement a morphological analysis system and tagger for Pseudo-Morpheme. The speech processing system using this pseudo-morpheme can get better result than other systems using the phrase or the general morpheme. So, the quality of the whole spoken language translation system can be improved. The analysis-ratio of our implemented system is similar to the common morphological analysis systems.
PDF

Search Result 515, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)