Browse > Article
http://dx.doi.org/10.9708/jksci.2015.20.2.021

Prototype based Classification by Generating Multidimensional Spheres per Class Area  

Shim, Seyong (Dept. of Computer Science, Dankook University)
Hwang, Doosung (Dept. of Kinesiologic Medical Science & Computer Science, Dankook University)
Abstract
In this paper, we propose a prototype-based classification learning by using the nearest-neighbor rule. The nearest-neighbor is applied to segment the class area of all the training data into spheres within which the data exist from the same class. Prototypes are the center of spheres and their radii are computed by the mid-point of the two distances to the farthest same class point and the nearest another class point. And we transform the prototype selection problem into a set covering problem in order to determine the smallest set of prototypes that include all the training data. The proposed prototype selection method is based on a greedy algorithm that is applicable to the training data per class. The complexity of the proposed method is not complicated and the possibility of its parallel implementation is high. The prototype-based classification learning takes up the set of prototypes and predicts the class of test data by the nearest neighbor rule. In experiments, the generalization performance of our prototype classifier is superior to those of the nearest neighbor, Bayes classifier, and another prototype classifier.
Keywords
Prototype selection; Nearest-neighbor rule; Classification learning; Set covering optimization; Greedy algorithm;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 X. Wu et al., "The top ten algorithms in data mining," CRC Press, 2009.
2 T. Hastie, R. Tibshirani, and J. Friedman, "The Elements of Statistical Learning: Data Mining," Inference, and Prediction, Springer Series in Statistics, 2001.
3 J. Arturo Olvera-Lopez, J. Ariel Carrasco-Ochoa, J. Francisco Martinez Trinidad, and J. Kittler, "A review of instance selection methods," Artif. Intell. Rev Vol. 34, No. 2, pp. 133-143, Aug. 2010.   DOI
4 S. Garcia, J. Derrac, J. Cano, and F. Herrera, "Prototype Selection for Nearest Neighbor Classification : Taxonomy and Empirical Study," IEEE Transactions on Pattern Analysis and Machone Intelligence, Vol. 34, No. 3, pp. 417-435, Mar. 2012.   DOI   ScienceOn
5 D. S. Hwang and D. W. Kim, "Near-boundary data selection for fast support vector machines," Malasian journal of Computer Science, Vol. 25(l), pp. 23-37, Mar. 2012
6 F. Angiulli, "FastNearestNeighbor Condensation for Large Data Sets Classification," IEEE Transactions onKnowledge andData Engineering, Vol. 19, No. 11, pp. 1450-1464, Nov. 2007.   DOI   ScienceOn
7 D. R. Wilson, and T. R. Martinez, "Reduction Techniques for Instance-BasedLearning Algorithms," Machine Learning, Vol. 38, No. 3, pp. 257-286, Mar. 2000.   DOI
8 J. Bien and R. Tibshirani, "Prototype selection for interpretable classification," The Annuals of Applied Statistics Vol. 5, No. 4, pp. 2403-2424, Dec, 2011.   DOI
9 I. Takigawa, M. Kudo, and A. Nakamura, "Convex sets as prototypes for classifying patterns," Engineering Applications of Artificial Intelligence, Vol. 22, No. 1, pp.101-108, Feb. 2009.   DOI   ScienceOn
10 D. Marchette, "Class cover catch digraphs," Wiley Interdisciplinary Reviews : Computational Statistics Vol. 2, No. 2, pp. 171-177, Mar. 2010.   DOI
11 R. Younsi, and A. Bagnall, "An efficient randomised sphere cover classifier," Int. J. of Data Mining, Modelling and Management, Vol. 4, No. 2, pp.156-171, Jan. 2012.   DOI
12 GLPK, The GLPK Linear Programming Kit Package, https://www.gnu.org/software/glpk/
13 Vijay V. Vazirani, "Approximation Algorithms," Springer, 2001.
14 UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/
15 The DELVE Manual, http://www.cs.utoronto.ca/-delve/
16 Stalog project, http://www1.maths.leed.ac.uk/-charles/statlog/indexdos.html
17 K. S. Kim and D. S. Hwang "Support Vector Machine Algorithm for Imbalanced Data Learning," Journal of the Korea Society of Computer and Information, Vol. 15, No. 7, pp. 11-17, July. 2010   과학기술학회마을   DOI   ScienceOn