• Title/Summary/Keyword: 특징 집합 선택

Search Result 112, Processing Time 0.027 seconds

Combined Feature Set and Hybrid Feature Selection Method for Effective Document Classification (효율적인 문서 분류를 위한 혼합 특징 집합과 하이브리드 특징 선택 기법)

  • In, Joo-Ho;Kim, Jung-Ho;Chae, Soo-Hoan
    • Journal of Internet Computing and Services
    • /
    • v.14 no.5
    • /
    • pp.49-57
    • /
    • 2013
  • A novel approach for the feature selection is proposed, which is the important preprocessing task of on-line document classification. In previous researches, the features based on information from their single population for feature selection task have been selected. In this paper, a mixed feature set is constructed by selecting features from multi-population as well as single population based on various information. The mixed feature set consists of two feature sets: the original feature set that is made up of words on documents and the transformed feature set that is made up of features generated by LSA. The hybrid feature selection method using both filter and wrapper method is used to obtain optimal features set from the mixed feature set. We performed classification experiments using the obtained optimal feature sets. As a result of the experiments, our expectation that our approach makes better performance of classification is verified, which is over 90% accuracy. In particular, it is confirmed that our approach has over 90% recall and precision that have a low deviation between categories.

A Study on Genetic Feature Selection (유전적 특징선택에 관한 연구)

  • Han, Myung-Mook
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2008.04a
    • /
    • pp.292-293
    • /
    • 2008
  • 많은 분야에서 최적의 기준을 바탕으로 특징들의 부분집합을 선택하는 문제들이 핵심 요소로 작용하고 있다. 다양한 특징들의 부분집합 중에서 가능한 한 가장 성능이 우수한 특징들의 부분집합을 선택하기 위해서는 특징선택 방법이 알고리즘과 적용분야들을 고려해야한다. 이 논문에서는 특징선택을 위해서 서로 다른 두 종류의 최적화 문제를 탐색하는 방법을 제안하고, 그 결과를 실험으로 보여준다.

  • PDF

Feature Combination and Selection Using Genetic Algorithm for Character Recognition (유전 알고리즘을 이용한 특징 결합과 선택)

  • Lee Jin-Seon
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.5
    • /
    • pp.152-158
    • /
    • 2005
  • By using a combination of different feature sets extracted from input character patterns, we can improve the character recognition system performance. To reduce the dimensionality of the combined feature vector, we conduct the feature selection. This paper proposes a general framework for the feature combination and selection for character recognition problems. It also presents a specific design for the handwritten numeral recognition. Tn the design, DDD and AGD feature sets are extracted from handwritten numeral patterns, and a genetic algorithm is used for the feature selection. Experimental result showed a significant accuracy improvement by about 0.7% for the CENPARMI handwrittennumeral database.

  • PDF

Feature Subset Selection Algorithm based on Entropy (엔트로피를 기반으로 한 특징 집합 선택 알고리즘)

  • 홍석미;안종일;정태충
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.41 no.2
    • /
    • pp.87-94
    • /
    • 2004
  • The feature subset selection is used as a preprocessing step of a teaming algorithm. If collected data are irrelevant or redundant information, we can improve the performance of learning by removing these data before creating of the learning model. The feature subset selection can also reduce the search space and the storage requirement. This paper proposed a new feature subset selection algorithm that is using the heuristic function based on entropy to evaluate the performance of the abstracted feature subset and feature selection. The ACS algorithm was used as a search method. We could decrease a size of learning model and unnecessary calculating time by reducing the dimension of the feature that was used for learning.

Selective Set Operations based on Feature Modeling History (특징형상 모델링 연혁을 바탕으로 한 선택적 집합 연산)

  • Lee, Sang-Hun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06b
    • /
    • pp.280-281
    • /
    • 2011
  • 특징형상기반 다중해상도 모델링 기법은 컴퓨터 그래픽스의 응용분야인 컴퓨터 응용 설계, 해석, 가상생산과 같은 분야에 주목을 받고 있는 새로운 기술이다. 다중해상도 모델을 제공하기 위하여 특징형상을 재배열할 필요가 있는데 이 경우 빼기 더하기 집합연산의 순서가 달라지면 최종형상이 달라질 수 있다. 이러한 문제를 해결하기 위하여 특징형상 모델링 연혁을 고려한 선택적 집합 연산을 개발하였다. 이 작업을 적용하면 최종형상뿐만 아니라 합리저긴 중간단계의 다중해상도 모델도 생성할 수 있다.

A Diagnostic Feature Subset Selection of Breast Tumor Based on Neighborhood Rough Set Model (Neighborhood 러프집합 모델을 활용한 유방 종양의 진단적 특징 선택)

  • Son, Chang-Sik;Choi, Rock-Hyun;Kang, Won-Seok;Lee, Jong-Ha
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.21 no.6
    • /
    • pp.13-21
    • /
    • 2016
  • Feature selection is the one of important issue in the field of data mining and machine learning. It is the technique to find a subset of features which provides the best classification performance, from the source data. We propose a feature subset selection method using the neighborhood rough set model based on information granularity. To demonstrate the effectiveness of proposed method, it was applied to select the useful features associated with breast tumor diagnosis of 298 shape features extracted from 5,252 breast ultrasound images, which include 2,745 benign and 2,507 malignant cases. Experimental results showed that 19 diagnostic features were strong predictors of breast cancer diagnosis and then average classification accuracy was 97.6%.

Features Extraction of Remote Sensed Multispectral Image Data Using Rough Sets Theory (Rough 집합 이론을 이용한 원격 탐사 다중 분광 이미지 데이터의 특징 추출)

  • 원성현;정환묵
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.8 no.3
    • /
    • pp.16-25
    • /
    • 1998
  • In this paper, we propose features extraction method using Rough sets theory for efficient data classifications in hyperspectral environment. First, analyze the properties of multispectral image data, then select the most efficient bands using discemibility of Rough sets theory based on analysis results. The proposed method is applied Landsat TM image data, from this, we verify the equivalence of traditional bands selection method by band features and bands selection method using Rough sets theory that pmposed in this paper. Finally, we present theoretical basis to features extraction in hyperspectral environment.

  • PDF

A Study on Feature selection based the Fuzzy Min-Max Neural Network and Application on Gait Phase recognition using EMG (퍼지 최대-최소 신경망을 이용한 특징 집합 선택에 관한 연구 및 보행 단계인식에의 응용)

  • Lee, Tae-Yeop;Lee, Sang-Wan;Byeon, Jeung-Nam
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.167-171
    • /
    • 2007
  • 본 논문은 패턴 분류 문제에 사용되는 퍼지 최대-최소 신경망 방법을 이용하여 특정 집합으로부터 새로운 특정 집합을 추출해내고 추출된 특정 집합으로부터 의미 있는 특정을 선택해 내는 새로운 방법을 제안한다. 퍼지 최대-최소 신경망은 패턴 분류를 위해 주로 사용이 되어 왔지만, 퍼지 최대-최소 신경망을 이용해 특정 집합의 값들을 패턴 공간내의 초상자의 집합으로 변환하고 변환된 초상자들끼리의 인접성을 척도로 단순한 연산을 통한 빠른 특정 집합을 선택하게 된다. 마지막으로 본 논문의 특정 집합 선택 방법을 하지 근전도 신호를 이용한 보행 패턴 분류에 적용해 보고, 그 결과를 기존 여러 특정 집합 선태 방법들과 비교해 봄으로써 제안한 방법의 타당성 및 적용 가능성을 알아본다.

  • PDF

The Real-Time Face Detection based on Simple Feature (간단한 특징에 기반한 얼굴 검출)

  • 임옥현;이우주;이경일;이배호
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2004.05a
    • /
    • pp.247-250
    • /
    • 2004
  • 본 논문에서는 간단한 사각형 특징과 계층적 분류기를 이용하여 실시간으로 얼굴을 검출하는 방법을 제안하고자 한다. 우리는 다섯 가지 형태의 기본적인 특징 모델을 바탕으로 20*20 크기의 훈련 영상에 적용하여 많은 초기 특징 집합을 구성하였다. AdaBoost(Adaptive Boosting) 알고리즘을 이용한 학습을 통하여 초기 특징 집합 중에서 얼굴 검출하는데 강인한 집합들만을 선택하였다. 제안된 알고리즘을 이용한 실제 실험에서 90% 이상의 높은 검출율을 확인하였고 초당 10프레임의 실시간 검출에도 성공하였다.

  • PDF

A Feature Set Selection Approach Based on Pearson Correlation Coefficient for Real Time Attack Detection (실시간 공격 탐지를 위한 Pearson 상관계수 기반 특징 집합 선택 방법)

  • Kang, Seung-Ho;Jeong, In-Seon;Lim, Hyeong-Seok
    • Convergence Security Journal
    • /
    • v.18 no.5_1
    • /
    • pp.59-66
    • /
    • 2018
  • The performance of a network intrusion detection system using the machine learning method depends heavily on the composition and the size of the feature set. The detection accuracy, such as the detection rate or the false positive rate, of the system relies on the feature composition. And the time it takes to train and detect depends on the size of the feature set. Therefore, in order to enable the system to detect intrusions in real-time, the feature set to beused should have a small size as well as an appropriate composition. In this paper, we show that the size of the feature set can be further reduced without decreasing the detection rate through using Pearson correlation coefficient between features along with the multi-objective genetic algorithm which was used to shorten the size of the feature set in previous work. For the evaluation of the proposed method, the experiments to classify 10 kinds of attacks and benign traffic are performed against NSL_KDD data set.

  • PDF