Design and Implementation of the Ensemble-based Classification Model by Using k-means Clustering

Song, Sung-Yeol;Khil, A-Ra;

doi:10.9708/jksci.2015.20.10.031

한국컴퓨터정보학회논문지 (Journal of the Korea Society of Computer and Information)

제20권10호
/
Pages.31-38
/
2015
/
1598-849X(pISSN)
/
2383-9945(eISSN)

한국컴퓨터정보학회 (Korean Society of Computer Information)

DOI QR Code

Design and Implementation of the Ensemble-based Classification Model by Using k-means Clustering

Song, Sung-Yeol (Dept. of Computer Science and Engineering, Soongsil University) ;
Khil, A-Ra (Dept. of Computer Science and Engineering, Soongsil University)

투고 : 2015.07.15
심사 : 2015.09.22
발행 : 2015.10.30

https://doi.org/10.9708/jksci.2015.20.10.031 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In this paper, we propose the ensemble-based classification model which extracts just new data patterns from the streaming-data by using clustering and generates new classification models to be added to the ensemble in order to reduce the number of data labeling while it keeps the accuracy of the existing system. The proposed technique performs clustering of similar patterned data from streaming data. It performs the data labeling to each cluster at the point when a certain amount of data has been gathered. The proposed technique applies the K-NN technique to the classification model unit in order to keep the accuracy of the existing system while it uses a small amount of data. The proposed technique is efficient as using about 3% less data comparing with the existing technique as shown the simulation results for benchmarks, thereby using clustering.

키워드

참고문헌

Hebah H. O. Nasereddin, "Stream Data Mining,"International Journal of Web Applications, vol.1,no.4, pp.183-190, 2009.
Kantardzic, Mehmed. Data mining: concepts, models, methods, and algorithms. John Wiley & Sons, 2011.
Tsymbal, Alexey. "The problem of concept drift: definitions and related work." Computer Science Department, Trinity College Dublin 106 (2004).
Wang, Haixun, et al. "Mining concept-drifting data streams using ensemble classifiers." Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2003.
Kolter, Jeremy Z., and M. Maloof. "Dynamic weighted majority: A new ensemble method for tracking concept drift." Data Mining, 2003. ICDM 2003. Third IEEE International Conference on. IEEE, 2003.
Brzezinski, Dariusz, and Jerzy Stefanowski. "Accuracy updated ensemble for data streams with concept drift." Hybrid Artificial Intelligent Systems. Springer Berlin Heidelberg, 2011. 155-163.
Joung-Woo Ryu and Myung-Won Kim, "An Ensemble Model based on Data Distribution for Streaming Data Classification," Journal of KIISE : Database Research, vol.40, no.2, 2013, 89-98.
Altman, Naomi S. "An introduction to kernel and nearest-neighbor nonparametric regression." The American Statistician 46.3 (1992): 175-185.
Domeniconi, Carlotta, and Dimitrios Gunopulos. "Incremental support vector machine construction." Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on. IEEE, 2001.
Bock, Hans-Hermann. "Clustering methods: a history of k-means algorithms." Selected contributions in data analysis and classification. Springer Berlin Heidelberg, 2007. 161-172.

한국컴퓨터정보학회논문지 (Journal of the Korea Society of Computer and Information)

Design and Implementation of the Ensemble-based Classification Model by Using k-means Clustering

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)