통합 검색 | Korea Science

얼굴영상과 음성을 이용한 멀티모달 감정인식 (Multimodal Emotion Recognition using Face Image and Speech)

이현구;김동주
- 디지털산업정보학회논문지
- /
- 제8권1호
- /
- pp.29-40
- /
- 2012
A challenging research issue that has been one of growing importance to those working in human-computer interaction are to endow a machine with an emotional intelligence. Thus, emotion recognition technology plays an important role in the research area of human-computer interaction, and it allows a more natural and more human-like communication between human and computer. In this paper, we propose the multimodal emotion recognition system using face and speech to improve recognition performance. The distance measurement of the face-based emotion recognition is calculated by 2D-PCA of MCS-LBP image and nearest neighbor classifier, and also the likelihood measurement is obtained by Gaussian mixture model algorithm based on pitch and mel-frequency cepstral coefficient features in speech-based emotion recognition. The individual matching scores obtained from face and speech are combined using a weighted-summation operation, and the fused-score is utilized to classify the human emotion. Through experimental results, the proposed method exhibits improved recognition accuracy of about 11.25% to 19.75% when compared to the most uni-modal approach. From these results, we confirmed that the proposed approach achieved a significant performance improvement and the proposed method was very effective.
KSCI

Human Action Recognition Via Multi-modality Information

Gao, Zan;Song, Jian-Ming;Zhang, Hua;Liu, An-An;Xue, Yan-Bing;Xu, Guang-Ping
- Journal of Electrical Engineering and Technology
- /
- 제9권2호
- /
- pp.739-748
- /
- 2014
In this paper, we propose pyramid appearance and global structure action descriptors on both RGB and depth motion history images and a model-free method for human action recognition. In proposed algorithm, we firstly construct motion history image for both RGB and depth channels, at the same time, depth information is employed to filter RGB information, after that, different action descriptors are extracted from depth and RGB MHIs to represent these actions, and then multimodality information collaborative representation and recognition model, in which multi-modality information are put into object function naturally, and information fusion and action recognition also be done together, is proposed to classify human actions. To demonstrate the superiority of the proposed method, we evaluate it on MSR Action3D and DHA datasets, the well-known dataset for human action recognition. Large scale experiment shows our descriptors are robust, stable and efficient, when comparing with the-state-of-the-art algorithms, the performances of our descriptors are better than that of them, further, the performance of combined descriptors is much better than just using sole descriptor. What is more, our proposed model outperforms the state-of-the-art methods on both MSR Action3D and DHA datasets.
https://doi.org/10.5370/JEET.2014.9.2.739 인용 PDF KSCI KPUBS HTML

A Multi-Scale Parallel Convolutional Neural Network Based Intelligent Human Identification Using Face Information

Li, Chen;Liang, Mengti;Song, Wei;Xiao, Ke
- Journal of Information Processing Systems
- /
- 제14권6호
- /
- pp.1494-1507
- /
- 2018
Intelligent human identification using face information has been the research hotspot ranging from Internet of Things (IoT) application, intelligent self-service bank, intelligent surveillance to public safety and intelligent access control. Since 2D face images are usually captured from a long distance in an unconstrained environment, to fully exploit this advantage and make human recognition appropriate for wider intelligent applications with higher security and convenience, the key difficulties here include gray scale change caused by illumination variance, occlusion caused by glasses, hair or scarf, self-occlusion and deformation caused by pose or expression variation. To conquer these, many solutions have been proposed. However, most of them only improve recognition performance under one influence factor, which still cannot meet the real face recognition scenario. In this paper we propose a multi-scale parallel convolutional neural network architecture to extract deep robust facial features with high discriminative ability. Abundant experiments are conducted on CMU-PIE, extended FERET and AR database. And the experiment results show that the proposed algorithm exhibits excellent discriminative ability compared with other existing algorithms.
https://doi.org/10.3745/JIPS.02.0103 인용 PDF KSCI HTML

스마트폰 기반 행동인식 기술 동향 (Trends in Activity Recognition Using Smartphone Sensors)

김무섭;정치윤;손종무;임지연;정승은;정현태;신형철
- 전자통신동향분석
- /
- 제33권3호
- /
- pp.89-99
- /
- 2018
Human activity recognition (HAR) is a technology that aims to offer an automatic recognition of what a person is doing with respect to their body motion and gestures. HAR is essential in many applications such as human-computer interaction, health care, rehabilitation engineering, video surveillance, and artificial intelligence. Smartphones are becoming the most popular platform for activity recognition owing to their convenience, portability, and ease of use. The noticeable change in smartphone-based activity recognition is the adoption of a deep learning algorithm leading to successful learning outcomes. In this article, we analyze the technology trend of activity recognition using smartphone sensors, challenging issues for future development, and a strategy change in terms of the generation of a activity recognition dataset.
https://doi.org/10.22648/ETRI.2018.J.330310 인용 PDF

Human Action Recognition Based on 3D Human Modeling and Cyclic HMMs

Ke, Shian-Ru;Thuc, Hoang Le Uyen;Hwang, Jenq-Neng;Yoo, Jang-Hee;Choi, Kyoung-Ho
- ETRI Journal
- /
- 제36권4호
- /
- pp.662-672
- /
- 2014
Human action recognition is used in areas such as surveillance, entertainment, and healthcare. This paper proposes a system to recognize both single and continuous human actions from monocular video sequences, based on 3D human modeling and cyclic hidden Markov models (CHMMs). First, for each frame in a monocular video sequence, the 3D coordinates of joints belonging to a human object, through actions of multiple cycles, are extracted using 3D human modeling techniques. The 3D coordinates are then converted into a set of geometrical relational features (GRFs) for dimensionality reduction and discrimination increase. For further dimensionality reduction, k-means clustering is applied to the GRFs to generate clustered feature vectors. These vectors are used to train CHMMs separately for different types of actions, based on the Baum-Welch re-estimation algorithm. For recognition of continuous actions that are concatenated from several distinct types of actions, a designed graphical model is used to systematically concatenate different separately trained CHMMs. The experimental results show the effective performance of our proposed system in both single and continuous action recognition problems.
https://doi.org/10.4218/etrij.14.0113.0647 인용 PDF KSCI KPUBS

Emotion Recognition Method for Driver Services

Kim, Ho-Duck;Sim, Kwee-Bo
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- 제7권4호
- /
- pp.256-261
- /
- 2007
Electroencephalographic(EEG) is used to record activities of human brain in the area of psychology for many years. As technology developed, neural basis of functional areas of emotion processing is revealed gradually. So we measure fundamental areas of human brain that controls emotion of human by using EEG. Hands gestures such as shaking and head gesture such as nodding are often used as human body languages for communication with each other, and their recognition is important that it is a useful communication medium between human and computers. Research methods about gesture recognition are used of computer vision. Many researchers study Emotion Recognition method which uses one of EEG signals and Gestures in the existing research. In this paper, we use together EEG signals and Gestures for Emotion Recognition of human. And we select the driver emotion as a specific target. The experimental result shows that using of both EEG signals and gestures gets high recognition rates better than using EEG signals or gestures. Both EEG signals and gestures use Interactive Feature Selection(IFS) for the feature selection whose method is based on the reinforcement learning.
https://doi.org/10.5391/IJFIS.2007.7.4.256 인용 PDF KSCI

Intention Recognition Using Case-base Learning in Human Vehicle

Yamaguchi, Toru;Dayaong, Chen;Takeda, Yasuhiro;Jing, Jianping
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 2003년도 ISIS 2003
- /
- pp.110-113
- /
- 2003
Most traffic accidents are caused by drivers' carelessness and lack of information on the surrounding objects. In this paper we proposed a model of human intention recognition through case-base learning and to build up an experiment system. The system can help us recognize object's intention (e.g. turn left, turn right or straight) by using detected data about human's motion, speed of the car and the distance between the car and the intersection. Furthermore, we included an example using case-base learning in this paper to improve the precision of recognition as well as an example to explain the use of the system. PC can be used to predict the driving reaction beforehand and send a warning signal to the driver in time if there is any danger.
PDF

Representing Human Motions in an Eigenspace Based on Surrounding Cameras

Houman, Satoshi;Rahman, M. Masudur;Tan, Joo Kooi;Ishikawa, Seiji
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2004년도 ICCAS
- /
- pp.1808-1813
- /
- 2004
Recognition of human motions using their 2-D images has various applications. An eigenspace method is employed in this paper for representing and recognizing human motions. An eigenspace is created from the images taken by multiple cameras that surround a human in motion. Image streams obtained from the cameras compose the same number of curved lines in the eigenspace and they are used for recognizing a human motion in a video image. Performance of the proposed technique is shown experimentally.
PDF

Spatio-Temporal Analysis of Trajectory for Pedestrian Activity Recognition

Kim, Young-Nam;Park, Jin-Hee;Kim, Moon-Hyun
- Journal of Electrical Engineering and Technology
- /
- 제13권2호
- /
- pp.961-968
- /
- 2018
Recently, researches on automatic recognition of human activities have been actively carried out with the emergence of various intelligent systems. Since a large amount of visual data can be secured through Closed Circuit Television, it is required to recognize human behavior in a dynamic situation rather than a static situation. In this paper, we propose new intelligent human activity recognition model using the trajectory information extracted from the video sequence. The proposed model consists of three steps: segmentation and partitioning of trajectory step, feature extraction step, and behavioral learning step. First, the entire trajectory is fuzzy partitioned according to the motion characteristics, and then temporal features and spatial features are extracted. Using the extracted features, four pedestrian behaviors were modeled by decision tree learning algorithm and performance evaluation was performed. The experiments in this paper were conducted using Caviar data sets. Experimental results show that trajectory provides good activity recognition accuracy by extracting instantaneous property and distinctive regional property.
https://doi.org/10.5370/JEET.2018.13.2.961 인용 PDF KSCI

Vector space based augmented structural kinematic feature descriptor for human activity recognition in videos

Dharmalingam, Sowmiya;Palanisamy, Anandhakumar
- ETRI Journal
- /
- 제40권4호
- /
- pp.499-510
- /
- 2018
A vector space based augmented structural kinematic (VSASK) feature descriptor is proposed for human activity recognition. An action descriptor is built by integrating the structural and kinematic properties of the actor using vector space based augmented matrix representation. Using the local or global information separately may not provide sufficient action characteristics. The proposed action descriptor combines both the local (pose) and global (position and velocity) features using augmented matrix schema and thereby increases the robustness of the descriptor. A multiclass support vector machine (SVM) is used to learn each action descriptor for the corresponding activity classification and understanding. The performance of the proposed descriptor is experimentally analyzed using the Weizmann and KTH datasets. The average recognition rate for the Weizmann and KTH datasets is 100% and 99.89%, respectively. The computational time for the proposed descriptor learning is 0.003 seconds, which is an improvement of approximately 1.4% over the existing methods.
https://doi.org/10.4218/etrij.2018-0102 인용 PDF KSCI

검색결과 755건 처리시간 0.032초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)