Search | Korea Science

Multimodal Parametric Fusion for Emotion Recognition

Kim, Jonghwa
- International journal of advanced smart convergence
- /
- v.9 no.1
- /
- pp.193-201
- /
- 2020
The main objective of this study is to investigate the impact of additional modalities on the performance of emotion recognition using speech, facial expression and physiological measurements. In order to compare different approaches, we designed a feature-based recognition system as a benchmark which carries out linear supervised classification followed by the leave-one-out cross-validation. For the classification of four emotions, it turned out that bimodal fusion in our experiment improves recognition accuracy of unimodal approach, while the performance of trimodal fusion varies strongly depending on the individual. Furthermore, we experienced extremely high disparity between single class recognition rates, while we could not observe a best performing single modality in our experiment. Based on these observations, we developed a novel fusion method, called parametric decision fusion (PDF), which lies in building emotion-specific classifiers and exploits advantage of a parametrized decision process. By using the PDF scheme we achieved 16% improvement in accuracy of subject-dependent recognition and 10% for subject-independent recognition compared to the best unimodal results.
https://doi.org/10.7236/IJASC.2020.9.1.193 인용 PDF KSCI

Smart Phone Road Signs Recognition Model Using Image Segmentation Algorithm

Huang, Ying;Song, Jeong-Young
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2012.10a
- /
- pp.887-890
- /
- 2012
Image recognition is one of the most important research directions of pattern recognition. Image based road automatic identification technology is widely used in current society, the intelligence has become the trend of the times. This paper studied the image segmentation algorithm theory and its application in road signs recognition system. With the help of image processing technique, respectively, on road signs automatic recognition algorithm of three main parts, namely, image segmentation, character segmentation, image and character recognition, made a systematic study and algorithm. The experimental results show that: the image segmentation algorithm to establish road signs recognition model, can make effective use of smart phone system and application.
PDF

An Efficient Binarization Method for Vehicle License Plate Character Recognition

Yang, Xue-Ya;Kim, Kyung-Lok;Hwang, Byung-Kon
- Journal of Korea Multimedia Society
- /
- v.11 no.12
- /
- pp.1649-1657
- /
- 2008
In this paper, to overcome the failure of binarization for the characters suffered from low contrast and non-uniform illumination in license plate character recognition system, we improved the binarization method by combining local thresholding with global thresholding and edge detection. Firstly, apply the local thresholding method to locate the characters in the license plate image and then get the threshold value for the character based on edge detector. This method solves the problem of local low contrast and non-uniform illumination. Finally, back-propagation Neural Network is selected as a powerful tool to perform the recognition process. The results of the experiments i1lustrate that the proposed binarization method works well and the selected classifier saves the processing time. Besides, the character recognition system performed better recognition accuracy 95.7%, and the recognition speed is controlled within 0.3 seconds.
PDF

Face Recognition System for Multimedia Application (멀티미디어 응용을 위한 얼굴 인식시스템)

Park, Sang-Gyou;Seong, Hyeon-Kyeong;Han, Young-Hwan
- Journal of IKEEE
- /
- v.6 no.2 s.11
- /
- pp.152-160
- /
- 2002
This paper is the realization of the face recognition system for multimedia application. This system is focused on the design concerning the improvement of recognition rate and the reduction of processing time for face recognition. The non-modificated application of typical RGB color system enables the reduction of time required for color system transform. The neural network and the application of algorithm using face characteristic improves the recognition rate. After mosaicking an image, a face-color block has been selected through the color analysis of mosaic block. The characteristic of the face removes the mis-checked face-color candidate block. Finally, from the face color block, four special values are obtained. These values are processed to the neural network using the back propagation algorithm. The output values are the touchstone to decide the genuineness of face field. The realized system showed 90% of face recognition rate with less than 0.1 second of processing time. This result can be understood as sufficient processing time and recognition rate to find out the face block for multimedia application in dynamic image.
PDF

A Training Method for Emotionally Robust Speech Recognition using Frequency Warping (주파수 와핑을 이용한 감정에 강인한 음성 인식 학습 방법)

Kim, Weon-Goo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.20 no.4
- /
- pp.528-533
- /
- 2010
This paper studied the training methods less affected by the emotional variation for the development of the robust speech recognition system. For this purpose, the effect of emotional variation on the speech signal and the speech recognition system were studied using speech database containing various emotions. The performance of the speech recognition system trained by using the speech signal containing no emotion is deteriorated if the test speech signal contains the emotions because of the emotional difference between the test and training data. In this study, it is observed that vocal tract length of the speaker is affected by the emotional variation and this effect is one of the reasons that makes the performance of the speech recognition system worse. In this paper, a training method that cover the speech variations is proposed to develop the emotionally robust speech recognition system. Experimental results from the isolated word recognition using HMM showed that propose method reduced the error rate of the conventional recognition system by 28.4% when emotional test data was used.
https://doi.org/10.5391/JKIIS.2010.20.4.528 인용 PDF KSCI

Steganography based Multi-modal Biometrics System

Go, Hyoun-Joo;Chun, Myung-Geun
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.7 no.2
- /
- pp.148-153
- /
- 2007
This paper deals with implementing a steganography based multi-modal biometric system. For this purpose, we construct a multi-biometrics system based on the face and iris recognition. Here, the feature vector of iris pattern is hidden in the face image. The recognition system is designed by the fuzzy-based Linear Discriminant Analysis(LDA), which is an expanded approach of the LDA method combined by the theory of fuzzy sets. Furthermore, we present a watermarking method that can embed iris information into face images. Finally, we show the advantages of the proposed watermarking scheme by computing the ROC curves and make some comparisons recognition rates of watermarked face images with those of original ones. From various experiments, we found that our proposed scheme could be used for establishing efficient and secure multi-modal biometric systems.
https://doi.org/10.5391/IJFIS.2007.7.2.148 인용 PDF KSCI

Classification and recognition of electrical tracking signal by means of LabVIEW (LabVIEW에 의한 Tracking 신호 분류 및 인식)

Kim, Dae-Bok;Kim, Jung-Tae;Oh, Sung-Kwun
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.59 no.4
- /
- pp.779-787
- /
- 2010
In this paper, We introduce electrical tracking generated from surface activity associated with flow of leakage current on insulator under wet and contaminated conditions and design electrical tracking pattern recognition system by using LabVIEW. We measure the leaking current of contaminated wire by using LabVIEW software and the NI-c-DAQ 9172 and NI-9239 hardware. As pattern recognition algorithm and optimization algorithm for electrical tracking system, neural networks, Radial Basis Function Neural Networks(RBFNNs) and particle swarm optimization are exploited. The designed electrical tracking recognition system consists of two parts such as the hardware part of electrical tracking generator, the NI-c-DAQ 9172 and NI-9239 hardware and the software part of LabVIEW block diagram, LabVIEW front panel and pattern recognition-related application software. The electrical tracking system decides whether electrical tracking generate or not on electrical wire.
https://doi.org/10.5370/KIEE.2010.59.4.779 인용 PDF KSCI

User-customized Interaction using both Speech and Face Recognition (음성인식과 얼굴인식을 사용한 사용자 환경의 상호작용)

Kim, Sung-Ill;Oh, Se-Jin;Lee, Sang-Yong;Hwang, Seung-Gook
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2007.04a
- /
- pp.397-400
- /
- 2007
In this paper, we discuss the user-customized interaction for intelligent home environments. The interactive system is based upon the integrated techniques using both speech and face recognition. For essential modules, the speech recognition and synthesis were basically used for a virtual interaction between user and proposed system. In experiments, particularly, the real-time speech recognizer based on the HM-Net(Hidden Markov Network) was incorporated into the integrated system. Besides, the face identification was adopted to customize home environments for a specific user. In evaluation, the results showed that the proposed system was easy to use for intelligent home environments, even though the performance of the speech recognizer did not show a satisfactory results owing to the noisy environments.
PDF

Wavelet-based Feature Extraction Algorithm for an Iris Recognition System

Panganiban, Ayra;Linsangan, Noel;Caluyo, Felicito
- Journal of Information Processing Systems
- /
- v.7 no.3
- /
- pp.425-434
- /
- 2011
The success of iris recognition depends mainly on two factors: image acquisition and an iris recognition algorithm. In this study, we present a system that considers both factors and focuses on the latter. The proposed algorithm aims to find out the most efficient wavelet family and its coefficients for encoding the iris template of the experiment samples. The algorithm implemented in software performs segmentation, normalization, feature encoding, data storage, and matching. By using the Haar and Biorthogonal wavelet families at various levels feature encoding is performed by decomposing the normalized iris image. The vertical coefficient is encoded into the iris template and is stored in the database. The performance of the system is evaluated by using the number of degrees of freedom, False Reject Rate (FRR), False Accept Rate (FAR), and Equal Error Rate (EER) and the metrics show that the proposed algorithm can be employed for an iris recognition system.
https://doi.org/10.3745/JIPS.2011.7.3.425 인용 PDF KSCI

Speech Parameters for the Robust Emotional Speech Recognition (감정에 강인한 음성 인식을 위한 음성 파라메터)

Kim, Weon-Goo
- Journal of Institute of Control, Robotics and Systems
- /
- v.16 no.12
- /
- pp.1137-1142
- /
- 2010
This paper studied the speech parameters less affected by the human emotion for the development of the robust speech recognition system. For this purpose, the effect of emotion on the speech recognition system and robust speech parameters of speech recognition system were studied using speech database containing various emotions. In this study, mel-cepstral coefficient, delta-cepstral coefficient, RASTA mel-cepstral coefficient and frequency warped mel-cepstral coefficient were used as feature parameters. And CMS (Cepstral Mean Subtraction) method were used as a signal bias removal technique. Experimental results showed that the HMM based speaker independent word recognizer using vocal tract length normalized mel-cepstral coefficient, its derivatives and CMS as a signal bias removal showed the best performance of 0.78% word error rate. This corresponds to about a 50% word error reduction as compare to the performance of baseline system using mel-cepstral coefficient, its derivatives and CMS.
https://doi.org/10.5302/J.ICROS.2010.16.12.1137 인용 PDF KSCI

Search Result 6,982, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)