• 제목/요약/키워드: Feature learning

검색결과 1,924건 처리시간 0.031초

Development of Interactive Feature Selection Algorithm(IFS) for Emotion Recognition

  • Yang, Hyun-Chang;Kim, Ho-Duck;Park, Chang-Hyun;Sim, Kwee-Bo
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권4호
    • /
    • pp.282-287
    • /
    • 2006
  • This paper presents an original feature selection method for Emotion Recognition which includes many original elements. Feature selection has some merits regarding pattern recognition performance. Thus, we developed a method called thee 'Interactive Feature Selection' and the results (selected features) of the IFS were applied to an emotion recognition system (ERS), which was also implemented in this research. The innovative feature selection method was based on a Reinforcement Learning Algorithm and since it required responses from human users, it was denoted an 'Interactive Feature Selection'. By performing an IFS, we were able to obtain three top features and apply them to the ERS. Comparing those results from a random selection and Sequential Forward Selection (SFS) and Genetic Algorithm Feature Selection (GAFS), we verified that the top three features were better than the randomly selected feature set.

얼굴 인식률 향상을 위한 멀티 블록 방식의 딥러닝 구조에 관한 연구 (A Study on Deep Learning Structure of Multi-Block Method for Improving Face Recognition)

  • 라승탁;김홍직;이승호
    • 전기전자학회논문지
    • /
    • 제22권4호
    • /
    • pp.933-940
    • /
    • 2018
  • 본 논문에서는 얼굴 인식률 향상을 위한 멀티 블록 방식의 딥러닝 구조를 제안한다. 제안하는 딥러닝의 인식 구조는 입력된 이미지의 멀티 블록화, 특징 수치 분석을 통한 멀티 블록 선정, 선정된 멀티 블록의 딥러닝 수행 등의 3가지 과정으로 구성된다. 첫 번째로 입력된 이미지의 멀티 블록화는 입력된 이미지를 4등분하여 멀티 블록화 시킨다. 두 번째로 특징 수치분석을 통한 멀티 블록 선정에서는 4등분된 멀티 블록들의 특징 수치를 확인하고 특징이 많이 부각되는 블록만을 선정하여 얼굴 인식에 방해가 되는 요소를 사전에 제거한 블록들을 선정한다. 세 번째로 선정된 멀티 블록으로 딥러닝 수행은 선정된 멀티 블록 부위가 학습되어진 딥러닝 모델에 인식을 수행하여 특징 수치가 높은 효율적인 블록으로 얼굴 인식의 결과를 도출한다. 제안된 딥러닝 구조의 성능을 평가하기 위하여 CAS-PEAL 얼굴 데이터베이스를 사용하여 실험 하였다. 실험 결과, 제안하는 멀티 블록 방식의 딥러닝 구조가 기존의 딥러닝 구조보다 평균 약 2.3% 향상된 얼굴 인식률을 나타내어 그 효용성이 입증됨을 확인하였다.

이진 분류를 위하여 거리계산을 이용한 특징 변환 기반의 가중된 최소 자승법 (Weighted Least Squares Based on Feature Transformation using Distance Computation for Binary Classification)

  • 장세인;박충식
    • 한국정보통신학회논문지
    • /
    • 제24권2호
    • /
    • pp.219-224
    • /
    • 2020
  • 이진 분류(binary classification)는 머신러닝(machine learning) 분야에서 많이 다루어진 주제이다. 게다가 이진 분류는 다중 분류로 쉽게 발전될 수 있는 중요한 분야이다. 머신러닝 방법들을 적용할 때에 전처리(preprocessing)이나 특징 추출(feature extraction)과 같은 작업이 필수적이다. 이는 분류기 성능을 향상시키기 위한 중요한 작업이다. 본 논문에서는 가중된 최소 자승법을 기반으로 새로운 머신러닝 방법을 제안한다. 또한, 특징 변환시킬 수 있는 새로운 가중치 계산 방법을 제안한다. 이를 통해 특징 변환과 동시에 학습을 진행할 수 있는 방법을 제안한다. 본 제안을 다섯 개의 머신러닝 데이터베이스에서 실험을 진행하였으며 이 데이터베이스에서 우수한 성능을 얻을 수 있었다.

MIMO-OFDM 시스템에서 에너지 효율성을 위한 기계 학습 기반 적응형 전송 기술 및 Feature Space 연구 (Machine-Learning-Based Link Adaptation for Energy-Efficient MIMO-OFDM Systems)

  • 오명석;김기범;박현철
    • 한국전자파학회논문지
    • /
    • 제27권5호
    • /
    • pp.407-415
    • /
    • 2016
  • 무선 통신의 최근 동향을 살펴보면 에너지 효율적 전송의 중요성이 강조되고 있다. 본 논문은 multiple-input multiple-output orthogonal frequency division multiplexing(MIMO-OFDM) 무선 시스템에서 에너지 효율성을 최대화하기 위해 기계학습 기술을 사용하는 적응형 전송을 고려한다. MIMO-OFDM 시스템의 채널 상태를 효과적으로 나타내기 위한 two- dimensional capacity(2D-CAP) feature space와 classification 기술을 통해 에너지 효율적인 적응형 전송을 수행하는 machine-learning-based bit and power adaptation(ML-BPA) 알고리즘을 제안한다. 모의 실험 결과를 통해 2D-CAP이 본 논문이 고려하는 무선 채널 상태를 정확하게 나타내며, 이를 통해 적응형 전송의 성능을 향상시킴을 확인하였다. 또한, ordered postprocessing signal-to-noise ratio(ordSNR)를 포함한 다른 feature space들과 직접적인 비교를 통해 2D-CAP이 전송 성능이나 복잡도 측면에서 뚜렷한 이득을 가짐을 확인하였다.

Intelligent System for the Prediction of Heart Diseases Using Machine Learning Algorithms with Anew Mixed Feature Creation (MFC) technique

  • Rawia Elarabi;Abdelrahman Elsharif Karrar;Murtada El-mukashfi El-taher
    • International Journal of Computer Science & Network Security
    • /
    • 제23권5호
    • /
    • pp.148-162
    • /
    • 2023
  • Classification systems can significantly assist the medical sector by allowing for the precise and quick diagnosis of diseases. As a result, both doctors and patients will save time. A possible way for identifying risk variables is to use machine learning algorithms. Non-surgical technologies, such as machine learning, are trustworthy and effective in categorizing healthy and heart-disease patients, and they save time and effort. The goal of this study is to create a medical intelligent decision support system based on machine learning for the diagnosis of heart disease. We have used a mixed feature creation (MFC) technique to generate new features from the UCI Cleveland Cardiology dataset. We select the most suitable features by using Least Absolute Shrinkage and Selection Operator (LASSO), Recursive Feature Elimination with Random Forest feature selection (RFE-RF) and the best features of both LASSO RFE-RF (BLR) techniques. Cross-validated and grid-search methods are used to optimize the parameters of the estimator used in applying these algorithms. and classifier performance assessment metrics including classification accuracy, specificity, sensitivity, precision, and F1-Score, of each classification model, along with execution time and RMSE the results are presented independently for comparison. Our proposed work finds the best potential outcome across all available prediction models and improves the system's performance, allowing physicians to diagnose heart patients more accurately.

A Sparse Target Matrix Generation Based Unsupervised Feature Learning Algorithm for Image Classification

  • Zhao, Dan;Guo, Baolong;Yan, Yunyi
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권6호
    • /
    • pp.2806-2825
    • /
    • 2018
  • Unsupervised learning has shown good performance on image, video and audio classification tasks, and much progress has been made so far. It studies how systems can learn to represent particular input patterns in a way that reflects the statistical structure of the overall collection of input patterns. Many promising deep learning systems are commonly trained by the greedy layerwise unsupervised learning manner. The performance of these deep learning architectures benefits from the unsupervised learning ability to disentangling the abstractions and picking out the useful features. However, the existing unsupervised learning algorithms are often difficult to train partly because of the requirement of extensive hyperparameters. The tuning of these hyperparameters is a laborious task that requires expert knowledge, rules of thumb or extensive search. In this paper, we propose a simple and effective unsupervised feature learning algorithm for image classification, which exploits an explicit optimizing way for population and lifetime sparsity. Firstly, a sparse target matrix is built by the competitive rules. Then, the sparse features are optimized by means of minimizing the Euclidean norm ($L_2$) error between the sparse target and the competitive layer outputs. Finally, a classifier is trained using the obtained sparse features. Experimental results show that the proposed method achieves good performance for image classification, and provides discriminative features that generalize well.

Comparison of Machine Learning Techniques for Cyberbullying Detection on YouTube Arabic Comments

  • Alsubait, Tahani;Alfageh, Danyah
    • International Journal of Computer Science & Network Security
    • /
    • 제21권1호
    • /
    • pp.1-5
    • /
    • 2021
  • Cyberbullying is a problem that is faced in many cultures. Due to their popularity and interactive nature, social media platforms have also been affected by cyberbullying. Social media users from Arab countries have also reported being a target of cyberbullying. Machine learning techniques have been a prominent approach used by scientists to detect and battle this phenomenon. In this paper, we compare different machine learning algorithms for their performance in cyberbullying detection based on a labeled dataset of Arabic YouTube comments. Three machine learning models are considered, namely: Multinomial Naïve Bayes (MNB), Complement Naïve Bayes (CNB), and Linear Regression (LR). In addition, we experiment with two feature extraction methods, namely: Count Vectorizer and Tfidf Vectorizer. Our results show that, using count vectroizer feature extraction, the Logistic Regression model can outperform both Multinomial and Complement Naïve Bayes models. However, when using Tfidf vectorizer feature extraction, Complement Naive Bayes model can outperform the other two models.

Study of Machine-Learning Classifier and Feature Set Selection for Intent Classification of Korean Tweets about Food Safety

  • Yeom, Ha-Neul;Hwang, Myunggwon;Hwang, Mi-Nyeong;Jung, Hanmin
    • Journal of Information Science Theory and Practice
    • /
    • 제2권3호
    • /
    • pp.29-39
    • /
    • 2014
  • In recent years, several studies have proposed making use of the Twitter micro-blogging service to track various trends in online media and discussion. In this study, we specifically examine the use of Twitter to track discussions of food safety in the Korean language. Given the irregularity of keyword use in most tweets, we focus on optimistic machine-learning and feature set selection to classify collected tweets. We build the classifier model using Naive Bayes & Naive Bayes Multinomial, Support Vector Machine, and Decision Tree Algorithms, all of which show good performance. To select an optimum feature set, we construct a basic feature set as a standard for performance comparison, so that further test feature sets can be evaluated. Experiments show that precision and F-measure performance are best when using a Naive Bayes Multinomial classifier model with a test feature set defined by extracting Substantive, Predicate, Modifier, and Interjection parts of speech.

CNN-based Android Malware Detection Using Reduced Feature Set

  • Kim, Dong-Min;Lee, Soo-jin
    • 한국컴퓨터정보학회논문지
    • /
    • 제26권10호
    • /
    • pp.19-26
    • /
    • 2021
  • 딥러닝 기반 악성코드 탐지 및 분류모델의 성능은 특성집합을 어떻게 구성하느냐에 따라 크게 좌우된다. 본 논문에서는 CNN 기반의 안드로이드 악성코드 탐지 시 탐지성능을 극대화할 수 있는 최적의 특성집합(feature set)을 선정하는 방법을 제안한다. 특성집합에 포함될 특성은 기계학습 및 딥러닝에서 특성추출을 위해 널리 사용되는 Chi-Square test 알고리즘을 사용하여 선정하였다. CICANDMAL2017 데이터세트를 대상으로 선정된 36개의 특성을 이용하여 CNN 모델을 학습시킨 후 악성코드 탐지성능을 측정한 결과 이진분류에서는 99.99%, 다중분류에서는 98.55%의 Accuracy를 달성하였다.

음각 정보를 이용한 딥러닝 기반의 알약 식별 알고리즘 연구 (Pill Identification Algorithm Based on Deep Learning Using Imprinted Text Feature)

  • 이선민;김영재;김광기
    • 대한의용생체공학회:의공학회지
    • /
    • 제43권6호
    • /
    • pp.441-447
    • /
    • 2022
  • In this paper, we propose a pill identification model using engraved text feature and image feature such as shape and color, and compare it with an identification model that does not use engraved text feature to verify the possibility of improving identification performance by improving recognition rate of the engraved text. The data consisted of 100 classes and used 10 images per class. The engraved text feature was acquired through Keras OCR based on deep learning and 1D CNN, and the image feature was acquired through 2D CNN. According to the identification results, the accuracy of the text recognition model was 90%. The accuracy of the comparative model and the proposed model was 91.9% and 97.6%. The accuracy, precision, recall, and F1-score of the proposed model were better than those of the comparative model in terms of statistical significance. As a result, we confirmed that the expansion of the range of feature improved the performance of the identification model.