DOI QR코드

DOI QR Code

Improvement of Support Vector Clustering using Evolutionary Programming and Bootstrap

  • Jun, Sung-Hae (Department of Bioinformatics & Statistics, Cheongju University)
  • 발행 : 2008.09.01

초록

Statistical learning theory has three analytical tools which are support vector machine, support vector regression, and support vector clustering for classification, regression, and clustering respectively. In general, their performances are good because they are constructed by convex optimization. But, there are some problems in the methods. One of the problems is the subjective determination of the parameters for kernel function and regularization by the arts of researchers. Also, the results of the learning machines are depended on the selected parameters. In this paper, we propose an efficient method for objective determination of the parameters of support vector clustering which is the clustering method of statistical learning theory. Using evolutionary algorithm and bootstrap method, we select the parameters of kernel function and regularization constant objectively. To verify improved performances of proposed research, we compare our method with established learning algorithms using the data sets form ucr machine learning repository and synthetic data.

키워드

참고문헌

  1. B. S. Everitt, S. Landau, M. Leese, Cluster Analysis, Arnold, 2001
  2. P. Giudici, Applied Data Mining, Wiley, 2003
  3. A. L. N. Fred, A. K. Jain, Robust Data Clustering, Proceeding of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 128-133, 2003
  4. K. Krishna, K. Narasimha Murty, Genetic K-means algorithm, Proceeding of IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 29, no. 3, pp. 433-439, 1999 https://doi.org/10.1109/3477.764879
  5. J. Han, M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann, 2001
  6. L. Y. Tseng, S. B. Yang, Genetic Algorithms for Clustering, Feature Selection and Classification, Proceeding of International Conference on Neural Networks, vol. 3, pp. 1612-1616, 1997
  7. B. Ribeiro, On Data Based Learning using Support Vector Clustering, Proceeding of the 9th International Conference on Neural Information Processing, vol. 5, pp. 2516-2521, 2002
  8. B. Y. Sun, D. S. Huang, Support Vector Clustering for Multiclass Classification Problems, Proceeding of IEEE Evolutionary Computation Congress, vol. 2, pp. 1480-1485, 2003
  9. A. Ben-Hur, D. Horn, H. T. Siegelmann, V. Vapnik, Support Vector Clustering, Journal of Machine Learning Research, vol. 2, pp. 125-137, 2001 https://doi.org/10.1162/15324430260185565
  10. J. Lee, D. Lee, An Improved Cluster Labeling Method for Support Vector Clustering, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 3, pp. 461-464, 2005 https://doi.org/10.1109/TPAMI.2005.47
  11. P. Ling, Y. Wang, N. Lu, J. Y. Wang, S. Liang, C. G. Zhou, Two-Phase Support Vector Clustering for Multi-Relational Data Mining, Proceeding of the International Conference on Cyber-worlds, 2005
  12. J. Wang, X. Wu, C. Zhang, Support vector machine based on K-means clustering for real-time business intelligence systems, International Journal of Business Intelligence and Data Mining, vol. 1, no. 1, pp. 54-64, 2005 https://doi.org/10.1504/IJBIDM.2005.007318
  13. S. H. Jun, K. W. Oh, A Competitive Co-Evolving Support Vector Clustering, Lecture Note in Computer Science, vol. 4232, pp. 864-873, 2006 https://doi.org/10.1007/11893028_96
  14. V. Vapnik, Statistical Learning Theory, John Wiley & Sons, 1998
  15. A. E. Eiben, J. E. Smith, Introduction to Evolutionary Computing, Springer, 2003
  16. S. H. Jun, Web Usage Mining Using Evolutionary Support Vector Machine, Lecture Note in Artificial Intelligence, vol. 3809, pp. 1015-1020, 2005
  17. A. C. Davison, Bootstrap methods and their application, Cambridge University Press, 1997
  18. J. C. Chiang, J. S. Wang, A Validity-Guided Support Vector Clustering Algorithm for Identification of Optimal Cluster Configuration, Proceeding of IEEE International Conference on Systems, Man and Cybernetics, pp. 3613-3618, 2004
  19. UCI Machine Learning Repository, http://www.ics.uci.edu/-mlearn/MLRepository.html
  20. W. L. Martinez, A. R. Martinez, Computational Statistics Handbook with MATRAB, Chapman & Hall, 2002
  21. G. Mclachlan, D. Peel, Finite Mixture Models, John Wiley & Sons, 2000
  22. T. M. Mitchell, Machine Learning, McGraw-Hill, 1997