Search | Korea Science

Speech Emotion Recognition using Feature Selection and Fusion Method (특징 선택과 융합 방법을 이용한 음성 감정 인식)

Kim, Weon-Goo
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.66 no.8
- /
- pp.1265-1271
- /
- 2017
In this paper, the speech parameter fusion method is studied to improve the performance of the conventional emotion recognition system. For this purpose, the combination of the parameters that show the best performance by combining the cepstrum parameters and the various pitch parameters used in the conventional emotion recognition system are selected. Various pitch parameters were generated using numerical and statistical methods using pitch of speech. Performance evaluation was performed on the emotion recognition system using Gaussian mixture model(GMM) to select the pitch parameters that showed the best performance in combination with cepstrum parameters. As a parameter selection method, sequential feature selection method was used. In the experiment to distinguish the four emotions of normal, joy, sadness and angry, fifteen of the total 56 pitch parameters were selected and showed the best recognition performance when fused with cepstrum and delta cepstrum coefficients. This is a 48.9% reduction in the error of emotion recognition system using only pitch parameters.
https://doi.org/10.5370/KIEE.2017.66.8.1265 인용 PDF KSCI

Emotion Recognition and Expression System of User using Multi-Modal Sensor Fusion Algorithm (다중 센서 융합 알고리즘을 이용한 사용자의 감정 인식 및 표현 시스템)

Yeom, Hong-Gi;Joo, Jong-Tae;Sim, Kwee-Bo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.18 no.1
- /
- pp.20-26
- /
- 2008
As they have more and more intelligence robots or computers these days, so the interaction between intelligence robot(computer) - human is getting more and more important also the emotion recognition and expression are indispensable for interaction between intelligence robot(computer) - human. In this paper, firstly we extract emotional features at speech signal and facial image. Secondly we apply both BL(Bayesian Learning) and PCA(Principal Component Analysis), lastly we classify five emotions patterns(normal, happy, anger, surprise and sad) also, we experiment with decision fusion and feature fusion to enhance emotion recognition rate. The decision fusion method experiment on emotion recognition that result values of each recognition system apply Fuzzy membership function and the feature fusion method selects superior features through SFS(Sequential Forward Selection) method and superior features are applied to Neural Networks based on MLP(Multi Layer Perceptron) for classifying five emotions patterns. and recognized result apply to 2D facial shape for express emotion.
https://doi.org/10.5391/JKIIS.2008.18.1.020 인용 PDF KSCI

Gait Recognition Algorithm Based on Feature Fusion of GEI Dynamic Region and Gabor Wavelets

Huang, Jun;Wang, Xiuhui;Wang, Jun
- Journal of Information Processing Systems
- /
- v.14 no.4
- /
- pp.892-903
- /
- 2018
The paper proposes a novel gait recognition algorithm based on feature fusion of gait energy image (GEI) dynamic region and Gabor, which consists of four steps. First, the gait contour images are extracted through the object detection, binarization and morphological process. Secondly, features of GEI at different angles and Gabor features with multiple orientations are extracted from the dynamic part of GEI, respectively. Then averaging method is adopted to fuse features of GEI dynamic region with features of Gabor wavelets on feature layer and the feature space dimension is reduced by an improved Kernel Principal Component Analysis (KPCA). Finally, the vectors of feature fusion are input into the support vector machine (SVM) based on multi classification to realize the classification and recognition of gait. The primary contributions of the paper are: a novel gait recognition algorithm based on based on feature fusion of GEI and Gabor is proposed; an improved KPCA method is used to reduce the feature matrix dimension; a SVM is employed to identify the gait sequences. The experimental results suggest that the proposed algorithm yields over 90% of correct classification rate, which testify that the method can identify better different human gait and get better recognized effect than other existing algorithms.
https://doi.org/10.3745/JIPS.02.0088 인용 PDF KSCI

Ensemble convolutional neural networks for automatic fusion recognition of multi-platform radar emitters

Zhou, Zhiwen;Huang, Gaoming;Wang, Xuebao
- ETRI Journal
- /
- v.41 no.6
- /
- pp.750-759
- /
- 2019
Presently, the extraction of hand-crafted features is still the dominant method in radar emitter recognition. To solve the complicated problems of selection and updation of empirical features, we present a novel automatic feature extraction structure based on deep learning. In particular, a convolutional neural network (CNN) is adopted to extract high-level abstract representations from the time-frequency images of emitter signals. Thus, the redundant process of designing discriminative features can be avoided. Furthermore, to address the performance degradation of a single platform, we propose the construction of an ensemble learning-based architecture for multi-platform fusion recognition. Experimental results indicate that the proposed algorithms are feasible and effective, and they outperform other typical feature extraction and fusion recognition methods in terms of accuracy. Moreover, the proposed structure could be extended to other prevalent ensemble learning alternatives.
https://doi.org/10.4218/etrij.2017-0327 인용 PDF KSCI

MSFM: Multi-view Semantic Feature Fusion Model for Chinese Named Entity Recognition

Liu, Jingxin;Cheng, Jieren;Peng, Xin;Zhao, Zeli;Tang, Xiangyan;Sheng, Victor S.
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.6
- /
- pp.1833-1848
- /
- 2022
Named entity recognition (NER) is an important basic task in the field of Natural Language Processing (NLP). Recently deep learning approaches by extracting word segmentation or character features have been proved to be effective for Chinese Named Entity Recognition (CNER). However, since this method of extracting features only focuses on extracting some of the features, it lacks textual information mining from multiple perspectives and dimensions, resulting in the model not being able to fully capture semantic features. To tackle this problem, we propose a novel Multi-view Semantic Feature Fusion Model (MSFM). The proposed model mainly consists of two core components, that is, Multi-view Semantic Feature Fusion Embedding Module (MFEM) and Multi-head Self-Attention Mechanism Module (MSAM). Specifically, the MFEM extracts character features, word boundary features, radical features, and pinyin features of Chinese characters. The acquired font shape, font sound, and font meaning features are fused to enhance the semantic information of Chinese characters with different granularities. Moreover, the MSAM is used to capture the dependencies between characters in a multi-dimensional subspace to better understand the semantic features of the context. Extensive experimental results on four benchmark datasets show that our method improves the overall performance of the CNER model.
https://doi.org/10.3837/tiis.2022.06.004 인용 PDF KSCI HTML

Multimodal Attention-Based Fusion Model for Context-Aware Emotion Recognition

Vo, Minh-Cong;Lee, Guee-Sang
- International Journal of Contents
- /
- v.18 no.3
- /
- pp.11-20
- /
- 2022
Human Emotion Recognition is an exciting topic that has been attracting many researchers for a lengthy time. In recent years, there has been an increasing interest in exploiting contextual information on emotion recognition. Some previous explorations in psychology show that emotional perception is impacted by facial expressions, as well as contextual information from the scene, such as human activities, interactions, and body poses. Those explorations initialize a trend in computer vision in exploring the critical role of contexts, by considering them as modalities to infer predicted emotion along with facial expressions. However, the contextual information has not been fully exploited. The scene emotion created by the surrounding environment, can shape how people perceive emotion. Besides, additive fusion in multimodal training fashion is not practical, because the contributions of each modality are not equal to the final prediction. The purpose of this paper was to contribute to this growing area of research, by exploring the effectiveness of the emotional scene gist in the input image, to infer the emotional state of the primary target. The emotional scene gist includes emotion, emotional feelings, and actions or events that directly trigger emotional reactions in the input image. We also present an attention-based fusion network, to combine multimodal features based on their impacts on the target emotional state. We demonstrate the effectiveness of the method, through a significant improvement on the EMOTIC dataset.
https://doi.org/10.5392/IJoC.2022.18.3.011 인용 PDF KSCI HTML

Video Expression Recognition Method Based on Spatiotemporal Recurrent Neural Network and Feature Fusion

Zhou, Xuan
- Journal of Information Processing Systems
- /
- v.17 no.2
- /
- pp.337-351
- /
- 2021
Automatically recognizing facial expressions in video sequences is a challenging task because there is little direct correlation between facial features and subjective emotions in video. To overcome the problem, a video facial expression recognition method using spatiotemporal recurrent neural network and feature fusion is proposed. Firstly, the video is preprocessed. Then, the double-layer cascade structure is used to detect a face in a video image. In addition, two deep convolutional neural networks are used to extract the time-domain and airspace facial features in the video. The spatial convolutional neural network is used to extract the spatial information features from each frame of the static expression images in the video. The temporal convolutional neural network is used to extract the dynamic information features from the optical flow information from multiple frames of expression images in the video. A multiplication fusion is performed with the spatiotemporal features learned by the two deep convolutional neural networks. Finally, the fused features are input to the support vector machine to realize the facial expression classification task. The experimental results on cNTERFACE, RML, and AFEW6.0 datasets show that the recognition rates obtained by the proposed method are as high as 88.67%, 70.32%, and 63.84%, respectively. Comparative experiments show that the proposed method obtains higher recognition accuracy than other recently reported methods.
https://doi.org/10.3745/JIPS.01.0067 인용 PDF KSCI

Novel function of stabilin-2 in myoblast fusion: the recognition of extracellular phosphatidylserine as a "fuse-me" signal

Kim, Go-Woon;Park, Seung-Yoon;Kim, In-San
- BMB Reports
- /
- v.49 no.6
- /
- pp.303-304
- /
- 2016
Myoblast fusion is important for skeletal muscle formation. Even though the knowledge of myoblast fusion mechanism has accumulated over the years, the initial signal of fusion is yet to be elucidated. Our study reveals the novel function of a phosphatidylserine (PS) receptor, stabilin-2 (Stab2), in the modulation of myoblast fusion, through the recognition of PS exposed on myoblasts. During differentiation of myoblasts, Stab2 expression is higher than other PS receptors and is controlled by calcineurin/NFAT signaling on myoblasts. The forced expression of Stab2 results in an increase in myoblast fusion; genetic ablation of Stab2 in mice causes a reduction in muscle size, as a result of impaired myoblast fusion. After muscle injury, muscle regeneration is impaired in Stab2-deficient mice, resulting in small myofibers with fewer nuclei, which is due to reduction of fusion rather than defection of myoblast differentiation. The fusion-promoting role of Stab2 is dependent on its PS-binding motif, and the blocking of PS-Stab2 binding impairs cell-cell fusion on myoblasts. Given our previous finding that Stab2 recognizes PS exposed on apoptotic cells for sensing as an "eat-me" signal, we propose that PS-Stab2 binding is required for sensing of a "fuse-me" signal as the initial signal of myoblast fusion.
https://doi.org/10.5483/BMBRep.2016.49.6.078 인용 PDF KSCI KPUBS

Bayesian Fusion of Confidence Measures for Confidence Scoring (베이시안 신뢰도 융합을 이용한 신뢰도 측정)

김태윤;고한석
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.5
- /
- pp.410-419
- /
- 2004
In this paper. we propose a method of confidence measure fusion under Bayesian framework for speech recognition. Centralized and distributed schemes are considered for confidence measure fusion. Centralized fusion is feature level fusion which combines the values of individual confidence scores and makes a final decision. In contrast. distributed fusion is decision level fusion which combines the individual decision makings made by each individual confidence measuring method. Optimal Bayesian fusion rules for centralized and distributed cases are presented. In isolated word Out-of-Vocabulary (OOV) rejection experiments. centralized Bayesian fusion shows over 13% relative equal error rate (EER) reduction compared with the individual confidence measure methods. In contrast. the distributed Bayesian fusion shows no significant performance increase.
PDF KSCI

Multi-classifier Fusion Based Facial Expression Recognition Approach

Jia, Xibin;Zhang, Yanhua;Powers, David;Ali, Humayra Binte
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.1
- /
- pp.196-212
- /
- 2014
Facial expression recognition is an important part in emotional interaction between human and machine. This paper proposes a facial expression recognition approach based on multi-classifier fusion with stacking algorithm. The kappa-error diagram is employed in base-level classifiers selection, which gains insights about which individual classifier has the better recognition performance and how diverse among them to help improve the recognition accuracy rate by fusing the complementary functions. In order to avoid the influence of the chance factor caused by guessing in algorithm evaluation and get more reliable awareness of algorithm performance, kappa and informedness besides accuracy are utilized as measure criteria in the comparison experiments. To verify the effectiveness of our approach, two public databases are used in the experiments. The experiment results show that compared with individual classifier and two other typical ensemble methods, our proposed stacked ensemble system does recognize facial expression more accurately with less standard deviation. It overcomes the individual classifier's bias and achieves more reliable recognition results.
https://doi.org/10.3837/tiis.2014.01.012 인용 PDF KSCI KPUBS

Search Result 363, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)