Browse > Article
http://dx.doi.org/10.5391/IJFIS.2008.8.3.196

Improvement of Support Vector Clustering using Evolutionary Programming and Bootstrap  

Jun, Sung-Hae (Department of Bioinformatics & Statistics, Cheongju University)
Publication Information
International Journal of Fuzzy Logic and Intelligent Systems / v.8, no.3, 2008 , pp. 196-201 More about this Journal
Abstract
Statistical learning theory has three analytical tools which are support vector machine, support vector regression, and support vector clustering for classification, regression, and clustering respectively. In general, their performances are good because they are constructed by convex optimization. But, there are some problems in the methods. One of the problems is the subjective determination of the parameters for kernel function and regularization by the arts of researchers. Also, the results of the learning machines are depended on the selected parameters. In this paper, we propose an efficient method for objective determination of the parameters of support vector clustering which is the clustering method of statistical learning theory. Using evolutionary algorithm and bootstrap method, we select the parameters of kernel function and regularization constant objectively. To verify improved performances of proposed research, we compare our method with established learning algorithms using the data sets form ucr machine learning repository and synthetic data.
Keywords
Support Vector Clustering; Evolutionary Programming; Bootstrap;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. L. N. Fred, A. K. Jain, Robust Data Clustering, Proceeding of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 128-133, 2003
2 B. Ribeiro, On Data Based Learning using Support Vector Clustering, Proceeding of the 9th International Conference on Neural Information Processing, vol. 5, pp. 2516-2521, 2002
3 J. Wang, X. Wu, C. Zhang, Support vector machine based on K-means clustering for real-time business intelligence systems, International Journal of Business Intelligence and Data Mining, vol. 1, no. 1, pp. 54-64, 2005   DOI
4 A. E. Eiben, J. E. Smith, Introduction to Evolutionary Computing, Springer, 2003
5 T. M. Mitchell, Machine Learning, McGraw-Hill, 1997
6 P. Ling, Y. Wang, N. Lu, J. Y. Wang, S. Liang, C. G. Zhou, Two-Phase Support Vector Clustering for Multi-Relational Data Mining, Proceeding of the International Conference on Cyber-worlds, 2005
7 UCI Machine Learning Repository, http://www.ics.uci.edu/-mlearn/MLRepository.html
8 W. L. Martinez, A. R. Martinez, Computational Statistics Handbook with MATRAB, Chapman & Hall, 2002
9 B. S. Everitt, S. Landau, M. Leese, Cluster Analysis, Arnold, 2001
10 A. Ben-Hur, D. Horn, H. T. Siegelmann, V. Vapnik, Support Vector Clustering, Journal of Machine Learning Research, vol. 2, pp. 125-137, 2001   DOI
11 J. C. Chiang, J. S. Wang, A Validity-Guided Support Vector Clustering Algorithm for Identification of Optimal Cluster Configuration, Proceeding of IEEE International Conference on Systems, Man and Cybernetics, pp. 3613-3618, 2004
12 J. Han, M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann, 2001
13 S. H. Jun, Web Usage Mining Using Evolutionary Support Vector Machine, Lecture Note in Artificial Intelligence, vol. 3809, pp. 1015-1020, 2005
14 K. Krishna, K. Narasimha Murty, Genetic K-means algorithm, Proceeding of IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 29, no. 3, pp. 433-439, 1999   DOI   ScienceOn
15 G. Mclachlan, D. Peel, Finite Mixture Models, John Wiley & Sons, 2000
16 L. Y. Tseng, S. B. Yang, Genetic Algorithms for Clustering, Feature Selection and Classification, Proceeding of International Conference on Neural Networks, vol. 3, pp. 1612-1616, 1997
17 S. H. Jun, K. W. Oh, A Competitive Co-Evolving Support Vector Clustering, Lecture Note in Computer Science, vol. 4232, pp. 864-873, 2006   DOI   ScienceOn
18 A. C. Davison, Bootstrap methods and their application, Cambridge University Press, 1997
19 V. Vapnik, Statistical Learning Theory, John Wiley & Sons, 1998
20 P. Giudici, Applied Data Mining, Wiley, 2003
21 J. Lee, D. Lee, An Improved Cluster Labeling Method for Support Vector Clustering, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 3, pp. 461-464, 2005   DOI   ScienceOn
22 B. Y. Sun, D. S. Huang, Support Vector Clustering for Multiclass Classification Problems, Proceeding of IEEE Evolutionary Computation Congress, vol. 2, pp. 1480-1485, 2003