Browse > Article
http://dx.doi.org/10.5391/IJFIS.2008.8.2.116

Improvement of Self Organizing Maps using Gap Statistic and Probability Distribution  

Jun, Sung-Hae (Department of Bioinformatics & Statistics, Cheongju University)
Publication Information
International Journal of Fuzzy Logic and Intelligent Systems / v.8, no.2, 2008 , pp. 116-120 More about this Journal
Abstract
Clustering is a method for unsupervised learning. General clustering tools have been depended on statistical methods and machine learning algorithms. One of the popular clustering algorithms based on machine learning is the self organizing map(SOM). SOM is a neural networks model for clustering. SOM and extended SOM have been used in diverse classification and clustering fields such as data mining. But, SOM has had a problem determining optimal number of clusters. In this paper, we propose an improvement of SOM using gap statistic and probability distribution. The gap statistic was introduced to estimate the number of clusters in a dataset. We use gap statistic for settling the problem of SOM. Also, in our research, weights of feature nodes are updated by probability distribution. After complete updating according to prior and posterior distributions, the weights of SOM have probability distributions for optima clustering. To verify improved performance of our work, we make experiments compared with other learning algorithms using simulation data sets.
Keywords
Self Organizing Maps; Number of Clusters; Gap Statistic; Probability Distribution;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 D. A. Stacey, R. Farshad, "A probabilistic self-organizing classification neural network architecture", Proceeding of International Joint Conference on Neural Networks, vol. 6, pp. 4059-4063, 1999
2 A. Utsugi, "Topology selection for self-organizing maps", Network: Computation in Neural Systems, vol. 7, no. 4, pp. 727-740, 1996   DOI   ScienceOn
3 S. H. Jun, "An Optimal Clustering using Hybrid Self Organizing Map", International Journal of Fuzzy Logic and Intelligent Systems, vol. 6, no. 1, pp. 10-14, 2006   과학기술학회마을   DOI   ScienceOn
4 D. Dumitrescu, B. Lazzerini, L. C. Jain, Fuzzy Sets and Their Application to Clustering and Training, CRC Press, 2000
5 M. J. Park, S. H. Jun, K. W. Oh, "Determination of Optimal Cluster Size Using Bootstrap and Genetic Algorithm", International Journal of Fuzzy Logic and Intelligent Systems, vol. 13, no. 1, pp. 12-17, 2003   과학기술학회마을   DOI   ScienceOn
6 R. Tibshirani, G. Walther, T. Hastie, "Estimating the number of clusters in a dataset via the Gap statistics", Journal of the Royal Statistical Society, B, 63, pp. 411-423, 2001   DOI   ScienceOn
7 G. Mclachlan, D. Peel, Finite Mixture Models, John Wiley & Sons, 2000
8 A. S. Pandya, R. B. Macy, Pattern Recognition with Neural Networks in C++, IEEE Press, 1995
9 T. Kohonen, Self Organizing Maps, Second Edition, Springer, 1997
10 A. Gelman, J. B. Carlin, H. S. Stern, D. B. Rudin, Bayesian Data Analysis, Chapman & Hill, 1995
11 M. A. Tanner, Tools for Statistical inference, Springer, 1996
12 A. Utsugi, "Hyperparameter selection for self-organizing maps", Neural Computation, vol. 9, no. 3, pp. 623-635, 1997   DOI   ScienceOn
13 W. L. Martinez, A. R. Zartinez, Computational Statistics Handbook with MATRAB, Chapman & Hall, 2002
14 A. Ngan, S. Thiria, F. Badran, M. Yaccoub, C. Moulin, M. Crepon, Clustering and classification based on expert knowledge propagation using probabilistic self-organizing map(PRSOM): application to the classification of satellite ocean color TOA observations", Proceeding of IEEE International Symposium on Computational Intelligence for Measurement Systems and Applications, pp. 146-148, 2003
15 S. H. Jun, "New Heuristic of Self Organizing Map using Updating Distribution", Proceeding of the 1st International Conference on Cognitive Neurodynamics - 2007 (ICCN'07) and the 3rd Shanghai International Conference on Physiological Biophysics - Cognitive Neurodynamics (SICPB'07), 2007
16 R. M. Neal, Bayesian Learning for Neural Networks, Springer, 1996
17 S. H. Jun, H. Jorn, J. Hwang, "Bayesian Learning for Self Organizing Maps", The Korean Journal of Applied Statistics, vol. 15, no. 2, pp. 251-267, 2002   DOI   ScienceOn
18 B. S. Everitt, S. Landau, M. Leese, Cluster Analysis, Arnold, 2001
19 S. Haykin, Neural Networks, Prentice Hall, 1999
20 C. M. Bishop, M. Svensen, C. K. I. Williams, "GTM: A Principled Alternative to the Self Organizing Map", Proceeding of ICANN 1996, vol. 1112, pp. 165-170, 1996
21 H., Yin, N. M., Allinson, "Bayesian learning for self-organising maps", Electronics Letters, vol. 33, issue 4, pp. 304-305, 1997   DOI   ScienceOn
22 T. M. Mitchell, Machine Learning, McGraw-Hill, 1997
23 J. Han, M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann, 2001
24 S. J. Press, Bayesian Statistics: Principles, Models, and Applications, John Wiley & Sons, 1989