유전자 알고리즘을 활용한 데이터 불균형 해소 기법의 조합적 활용

Jang, Yeong-Sik;Kim, Jong-U;Heo, Jun;

한국지능정보시스템학회:학술대회논문집 (Proceedings of the Korea Inteligent Information System Society Conference)

한국지능정보시스템학회 (Korea Intelligent Information System Society)

유전자 알고리즘을 활용한 데이터 불균형 해소 기법의 조합적 활용

장영식 (한양대학교 대학원 경영학과) ;
김종우 (한양대학교 경영대학 경영학부) ;
허준

Jang, Yeong-Sik ;
Kim, Jong-U ;
Heo, Jun (SPSS Korea Data Solution Inc.)

발행 : 2007.05.18

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

The data imbalance problem which can be uncounted in data mining classification problems typically means that there are more or less instances in a class than those in other classes. It causes low prediction accuracy of the minority class because classifiers tend to assign instances to major classes and ignore the minor class to reduce overall misclassification rate. In order to solve the data imbalance problem, there has been proposed a number of techniques based on resampling with replacement, adjusting decision thresholds, and adjusting the cost of the different classes. In this paper, we study the feasibility of the combination usage of the techniques previously proposed to deal with the data imbalance problem, and suggest a combination method using genetic algorithm to find the optimal combination ratio of the techniques. To improve the prediction accuracy of a minority class, we determine the combination ratio based on the F-value of the minority class as the fitness function of genetic algorithm. To compare the performance with those of single techniques and the matrix-style combination of random percentage, we performed experiments using four public datasets which has been generally used to compare the performance of methods for the data imbalance problem. From the results of experiments, we can find the usefulness of the proposed method.

한국지능정보시스템학회:학술대회논문집 (Proceedings of the Korea Inteligent Information System Society Conference)

유전자 알고리즘을 활용한 데이터 불균형 해소 기법의 조합적 활용

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)