Browse > Article
http://dx.doi.org/10.3745/JIPS.2013.9.1.031

A Feature Selection-based Ensemble Method for Arrhythmia Classification  

Namsrai, Erdenetuya (Database/Bioinformatics Laboratory, Chungbuk National University)
Munkhdalai, Tsendsuren (Database/Bioinformatics Laboratory, Chungbuk National University)
Li, Meijing (Database/Bioinformatics Laboratory, Chungbuk National University)
Shin, Jung-Hoon (Dept. of Software Engineering, Chonbuk National University)
Namsrai, Oyun-Erdene (Dept. of Information Technology, Mongolian National University)
Ryu, Keun Ho (Database/Bioinformatics Laboratory, Chungbuk National University)
Publication Information
Journal of Information Processing Systems / v.9, no.1, 2013 , pp. 31-40 More about this Journal
Abstract
In this paper, a novel method is proposed to build an ensemble of classifiers by using a feature selection schema. The feature selection schema identifies the best feature sets that affect the arrhythmia classification. Firstly, a number of feature subsets are extracted by applying the feature selection schema to the original dataset. Then classification models are built by using the each feature subset. Finally, we combine the classification models by adopting a voting approach to form a classification ensemble. The voting approach in our method involves both classification error rate and feature selection rate to calculate the score of the each classifier in the ensemble. In our method, the feature selection rate depends on the extracting order of the feature subsets. In the experiment, we applied our method to arrhythmia dataset and generated three top disjointed feature sets. We then built three classifiers based on the top-three feature subsets and formed the classifier ensemble by using the voting approach. Our method can improve the classification accuracy in high dimensional dataset. The performance of each classifier and the performance of their ensemble were higher than the performance of the classifier that was based on whole feature space of the dataset. The classification performance was improved and a more stable classification model could be constructed with the proposed approach.
Keywords
Data Mining; Ensemble Method; Fe ature Selection; Arrhythmia Classification;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Lior Rokach, Barak Chizi, "A Methodology for Improving the Performance of Non-ranker Feature Selection Filters", International Journal Pattern Recognition and Artificial Intelligence, 2007, pp.1-20.
2 Eugene Tuv, Alexander Borisov, George Runger, "Feature selection with Ensembles, Artificial variables, and Redundancy elimination", Journal of Machine Learning Research, Vol.10, 2009, pp.1341-1366.
3 Ching Wei Bang, "New Ensemble Machine Learning Method for Classification and Prediction on Gene Expression Data", IEEE, 2006, pp.3478-3481.
4 Mohamed Ezzeldin A.Bashir, Dong Gyu Lee, Keun Ho Ryu, "Nested Ensemble Technique for Excellence Real time cardiac Health Monitoring", BIOCOMP, 2010.
5 S.Oh, M.S Lee, B. Zhang, "Ensemble learning with active example selection for imbalanced biomedical data classification", IEEE/ACM transactions on computational biology and bioinformatics, Vol.8, No.2, 2011.
6 Thara Soman, Patrick O.Bobbie, "Classification of Arrhythmia Using Machine Learning Techniques", WSEAS Transactions on computers, Vol.4, June, 2005, pp.548-552.
7 Mohd Fauzi bin Othman, Thomas Moh Shan Yau, "Comparison of Different Classification Techniques Using WEKA for Breast cancer", IFMBE Proceedings Vol.15, 2007, pp.520-523.
8 Asha Gowda Karegowda, M.A.Jayaram, A.S. Manjunath, "Feature Subset Selection Problem using Wrapper Approach in Supervised learning", International journal of Computer applications, Vol.1, No.7, 2010, pp.13-17.
9 Pengy Yang, Yee Hwa Yang, Bing B.Zhou, "A Review of Ensemble Methods in Bioinformatics", Current Bioinformatics, Vol.5, No.4, 2010, pp.296-308.   DOI
10 Ching Wei Bang, "New Ensemble Machine Learning Method for Classification and Prediction on Gene Expression Data", IEEE, 2006, pp.3478-3481.
11 UCI Machine Learning Repository, http://archive.ics.edu/ml/datasets/Arrhythmia
12 Weka web site, http://www.cs.waikato.ac.nz/ml/WEKA/
13 Dymitr Ruta, Bogdan Gabrys, "Classifier selection for majority voting", Information fusion, Vol.6, 2005, pp.63-81.   DOI   ScienceOn
14 F.Yaghouby, A.Ayatollahi, R.Soleimani, "Classification of Cardiac Abnormalities Using Reduced Features of Heart Rate Variability Signal", World Applied Sciences Journal, Vol.6, 2009, pp.1547-1554.
15 M.R.Homaeinezhad, E.Tavakkoli, M.Habibi, "Combination of Different Classifiers for cardiac Arrhythmia Recognition", World Academy of Science, Engiinering and technology, 2011, pp.1189-1200.
16 Zhi Hua Zhou, "Ensemble Learning", Encyclopedia of Bioinformatics, 2009, pp.270-273.
17 Ho Sun Shon, Kyung-Sook Yang, Keun Ho Ryu, "Feature Selection Method using WF-LASSO for Gene Expression Data analysis", ACM-BCM, 2011, pp.522-525.
18 Jon Atli Benediktsson, Havier Ceamanos Garcia, Bjorn Waske, Jocelyn Chanussor, "Ensemble Methods for Classification of Hyperspectral Data", IEEE, 2008, pp.62-65.
19 Guangrong Li, Xiaohua Hu, Xiajiong Shen, Xin Chen, "A Novel Unsupervised feature Selection Method for Bioinformatics Data Sets through Feature Clustering", IEEE, 2008, pp.41-47.
20 Schapire RE, Freund Y, Bartlett P, Lee WS, "Boosting the margin: A new explanation for the effectiveness of voting methods", Statistics, 1998, pp.1651-1686.
21 Alexey Tsymbal, Seppo Puuronen, David Patterson, "Feature selection for Simple Bayesian Classifiers", ISSU, 2002, pp.592-600.