• 제목/요약/키워드: Feature recognition technology

검색결과 552건 처리시간 0.025초

Emotion Recognition Method for Driver Services

  • Kim, Ho-Duck;Sim, Kwee-Bo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제7권4호
    • /
    • pp.256-261
    • /
    • 2007
  • Electroencephalographic(EEG) is used to record activities of human brain in the area of psychology for many years. As technology developed, neural basis of functional areas of emotion processing is revealed gradually. So we measure fundamental areas of human brain that controls emotion of human by using EEG. Hands gestures such as shaking and head gesture such as nodding are often used as human body languages for communication with each other, and their recognition is important that it is a useful communication medium between human and computers. Research methods about gesture recognition are used of computer vision. Many researchers study Emotion Recognition method which uses one of EEG signals and Gestures in the existing research. In this paper, we use together EEG signals and Gestures for Emotion Recognition of human. And we select the driver emotion as a specific target. The experimental result shows that using of both EEG signals and gestures gets high recognition rates better than using EEG signals or gestures. Both EEG signals and gestures use Interactive Feature Selection(IFS) for the feature selection whose method is based on the reinforcement learning.

Improved Bimodal Speech Recognition Study Based on Product Hidden Markov Model

  • Xi, Su Mei;Cho, Young Im
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제13권3호
    • /
    • pp.164-170
    • /
    • 2013
  • Recent years have been higher demands for automatic speech recognition (ASR) systems that are able to operate robustly in an acoustically noisy environment. This paper proposes an improved product hidden markov model (HMM) used for bimodal speech recognition. A two-dimensional training model is built based on dependently trained audio-HMM and visual-HMM, reflecting the asynchronous characteristics of the audio and video streams. A weight coefficient is introduced to adjust the weight of the video and audio streams automatically according to differences in the noise environment. Experimental results show that compared with other bimodal speech recognition approaches, this approach obtains better speech recognition performance.

한글 자소의 획 정보에 의한 멀티미디어 단말기에서의 온라인 한글 문자 인식 (On-Line Korean Character Recognition by the Stroke Information of Korean Phoneme in Multimedia Terminal)

  • 오준택;정모문;이우범;김욱현
    • 융합신호처리학회논문지
    • /
    • 제1권1호
    • /
    • pp.64-73
    • /
    • 2000
  • 멀티미디어 단말기에서 사용자 인터페이스를 위한 한글 문자 인식기술은 빠른 처리시간과 높은 인식률을 요구한다. 따라서 본 논문에서는 특징점, 특징벡터, 가상벡터, 획간의 위치관계와 한글의 특성정보를 이용한 자소와 문자 인식기술을 제안한다. 그리고 사용자의 다양한 필체 유형에 따른 자소와 문자 인식을 위해서 한글의 특성정보와 다양한 획 정보로 구성된 한글데이터 베이스를 구축한다. 또한, 복잡한 자소 분리와 처리과정의 단순화를 위해서 획간의 위치관계에 의한 순차적 처리와 각 자소들이 가지는 획 수의 변경에 의한 백트래킹 처리를 사용한다. 제안된 온라인 한글 문자 인식기는 상용 1,200단어 중 10명이 필기한 총 600문자를 대상으로 실험한 결과 $95^{\circ}C$이상의 인식률과 13msec의 평균문자처리시간을 얻었다.

  • PDF

Hybrid Feature Selection Using Genetic Algorithm and Information Theory

  • Cho, Jae Hoon;Lee, Dae-Jong;Park, Jin-Il;Chun, Myung-Geun
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제13권1호
    • /
    • pp.73-82
    • /
    • 2013
  • In pattern classification, feature selection is an important factor in the performance of classifiers. In particular, when classifying a large number of features or variables, the accuracy and computational time of the classifier can be improved by using the relevant feature subset to remove the irrelevant, redundant, or noisy data. The proposed method consists of two parts: a wrapper part with an improved genetic algorithm(GA) using a new reproduction method and a filter part using mutual information. We also considered feature selection methods based on mutual information(MI) to improve computational complexity. Experimental results show that this method can achieve better performance in pattern recognition problems than other conventional solutions.

WLDF: Effective Statistical Shape Feature for Cracked Tongue Recognition

  • Li, Xiao-qiang;Wang, Dan;Cui, Qing
    • Journal of Electrical Engineering and Technology
    • /
    • 제12권1호
    • /
    • pp.420-427
    • /
    • 2017
  • This paper proposes a new method using Wide Line Detector based statistical shape Feature (WLDF) to identify whether or not a tongue is cracked; a cracked tongue is one of the most frequently used visible features for diagnosis in traditional Chinese Medicine (TCM). We first detected a wide line in the tongue image, and then extracted WLDF, such as the maximum length of each detected region, and the ratio between maximum length and the area of the detected region. We trained a binary support vector machine (SVM) based on the WLDF to build a classifier for cracked tongues. We conducted an experiment based on our proposed scheme, using 196 samples of cracked tongues and 245 samples of non-cracked tongues. The results of the experiment indicate that the recognition accuracy of the proposed method is greater than 95%. In addition, we provide an analysis of the results of this experiment with different parameters, demonstrating the feasibility and effectiveness of the proposed scheme.

RFID Tag Protection using Face Feature

  • Park, Sung-Hyun;Rhee, Sang-Burm
    • 반도체디스플레이기술학회지
    • /
    • 제6권2호
    • /
    • pp.59-63
    • /
    • 2007
  • Radio Frequency Identification (RFID) is a common term for technologies using micro chips that are able to communicate over short-range radio and that can be used for identifying physical objects. RFID technology already has several application areas and more are being envisioned all the time. While it has the potential of becoming a really ubiquitous part of the information society over time, there are many security and privacy concerns related to RFID that need to be solved. This paper proposes a method which could protect private information and ensure RFID's identification effectively storing face feature information on RFID tag. This method improved linear discriminant analysis has reduced the dimension of feature information which has large size of data. Therefore, face feature information can be stored in small memory field of RFID tag. The proposed algorithm in comparison with other previous methods shows better stability and elevated detection rate and also can be applied to the entrance control management system, digital identification card and others.

  • PDF

Design of Prototype-Based Emotion Recognizer Using Physiological Signals

  • Park, Byoung-Jun;Jang, Eun-Hye;Chung, Myung-Ae;Kim, Sang-Hyeob
    • ETRI Journal
    • /
    • 제35권5호
    • /
    • pp.869-879
    • /
    • 2013
  • This study is related to the acquisition of physiological signals of human emotions and the recognition of human emotions using such physiological signals. To acquire physiological signals, seven emotions are evoked through stimuli. Regarding the induced emotions, the results of skin temperature, photoplethysmography, electrodermal activity, and an electrocardiogram are recorded and analyzed as physiological signals. The suitability and effectiveness of the stimuli are evaluated by the subjects themselves. To address the problem of the emotions not being recognized, we introduce a methodology for a recognizer using prototype-based learning and particle swarm optimization (PSO). The design involves two main phases: i) PSO selects the P% of the patterns to be treated as prototypes of the seven emotions; ii) PSO is instrumental in the formation of the core set of features. The experiments show that a suitable selection of prototypes and a substantial reduction of the feature space can be accomplished, and the recognizer formed in this manner is characterized by high recognition accuracy for the seven emotions using physiological signals.

Robust Histogram Equalization Using Compensated Probability Distribution

  • Kim, Sung-Tak;Kim, Hoi-Rin
    • 대한음성학회지:말소리
    • /
    • 제55권
    • /
    • pp.131-142
    • /
    • 2005
  • A mismatch between the training and the test conditions often causes a drastic decrease in the performance of the speech recognition systems. In this paper, non-linear transformation techniques based on histogram equalization in the acoustic feature space are studied for reducing the mismatched condition. The purpose of histogram equalization(HEQ) is to convert the probability distribution of test speech into the probability distribution of training speech. While conventional histogram equalization methods consider only the probability distribution of a test speech, for noise-corrupted test speech, its probability distribution is also distorted. The transformation function obtained by this distorted probability distribution maybe bring about miss-transformation of feature vectors, and this causes the performance of histogram equalization to decrease. Therefore, this paper proposes a new method of calculating noise-removed probability distribution by using assumption that the CDF of noisy speech feature vectors consists of component of speech feature vectors and component of noise feature vectors, and this compensated probability distribution is used in HEQ process. In the AURORA-2 framework, the proposed method reduced the error rate by over $44\%$ in clean training condition compared to the baseline system. For multi training condition, the proposed methods are also better than the baseline system.

  • PDF

Parking Space Recognition for Autonomous Valet Parking Using Height and Salient-Line Probability Maps

  • Han, Seung-Jun;Choi, Jeongdan
    • ETRI Journal
    • /
    • 제37권6호
    • /
    • pp.1220-1230
    • /
    • 2015
  • An autonomous valet parking (AVP) system is designed to locate a vacant parking space and park the vehicle in which it resides on behalf of the driver, once the driver has left the vehicle. In addition, the AVP is able to direct the vehicle to a location desired by the driver when requested. In this paper, for an AVP system, we introduce technology to recognize a parking space using image sensors. The proposed technology is mainly divided into three parts. First, spatial analysis is carried out using a height map that is based on dense motion stereo. Second, modelling of road markings is conducted using a probability map with a new salient-line feature extractor. Finally, parking space recognition is based on a Bayesian classifier. The experimental results show an execution time of up to 10 ms and a recognition rate of over 99%. Also, the performance and properties of the proposed technology were evaluated with a variety of data. Our algorithms, which are part of the proposed technology, are expected to apply to various research areas regarding autonomous vehicles, such as map generation, road marking recognition, localization, and environment recognition.

Speaker Adaptation Using ICA-Based Feature Transformation

  • Jung, Ho-Young;Park, Man-Soo;Kim, Hoi-Rin;Hahn, Min-Soo
    • ETRI Journal
    • /
    • 제24권6호
    • /
    • pp.469-472
    • /
    • 2002
  • Speaker adaptation techniques are generally used to reduce speaker differences in speech recognition. In this work, we focus on the features fitted to a linear regression-based speaker adaptation. These are obtained by feature transformation based on independent component analysis (ICA), and the feature transformation matrices are estimated from the training data and adaptation data. Since the adaptation data is not sufficient to reliably estimate the ICA-based feature transformation matrix, it is necessary to adjust the ICA-based feature transformation matrix estimated from a new speaker utterance. To cope with this problem, we propose a smoothing method through a linear interpolation between the speaker-independent (SI) feature transformation matrix and the speaker-dependent (SD) feature transformation matrix. From our experiments, we observed that the proposed method is more effective in the mismatched case. In the mismatched case, the adaptation performance is improved because the smoothed feature transformation matrix makes speaker adaptation using noisy speech more robust.

  • PDF