Application of Genetic and Local Optimization Algorithms for Object Clustering Problem with Similarity Coefficients

유사성 계수를 이용한 군집화 문제에서 유전자와 국부 최적화 알고리듬의 적용

  • Yim, Dong-Soon (Department of Industrial and Systems Engineeringm Hannam University) ;
  • Oh, Hyun-Seung (Department of Industrial and Systems Engineeringm Hannam University)
  • 임동순 (한남대학교 산업시스템공학) ;
  • 오현승 (한남대학교 산업시스템공학)
  • Published : 2003.03.31

Abstract

Object clustering, which makes classification for a set of objects into a number of groups such that objects included in a group have similar characteristic and objects in different groups have dissimilar characteristic each other, has been exploited in diverse area such as information retrieval, data mining, group technology, etc. In this study, an object-clustering problem with similarity coefficients between objects is considered. At first, an evaluation function for the optimization problem is defined. Then, a genetic algorithm and local optimization technique based on heuristic method are proposed and used in order to obtain near optimal solutions. Solutions from the genetic algorithm are improved by local optimization techniques based on object relocation and cluster merging. Throughout extensive experiments, the validity and effectiveness of the proposed algorithms are tested.

Keywords

References

  1. Aldenderfer, M. S. and Blashtleld, R. K. (1984), Cluster Analysis, SAGE Publication, London
  2. Bhuyan, J. N., Raghavan, V. v., and Elayavalli, V. K. (1991), Genetic Algorithm for Clustering with an Ordered Representation, Proceedings of the Fourth International Conference on Genetic Algorithms, Morgan Kaufmann Publishers, Los Altos, CA, 408-415
  3. Bradley, P., Fayyad, U., and Reina, C. (1998), Scaling Clustering Algorithms to Large Databases, Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, AAAI Press, 9-15
  4. Davis, L. (1985), Applying Adaptive Algorithms to Epistatic Domains, Proceedings of the International Joint Conference on Artificial Intelligence, 162-164
  5. Dubes, R. C. and Jain, A. K. (1988), Algorithmsfor Clustering Data, Prentice Hall
  6. Goldberg, D. E. and Lingle, R. (1985), Alleles, Loci, and the Traveling Salesman Problem, Proceedings of the First International Conference on Genetic Algorithms, Lawrence Erlbaum Associates, Hillsdale, NJ, 154-159
  7. Grossman, D. A. and Frieder, O. (1998), Information Retrieval: Algorithms and Heuristics, Kluwer Academic Publishers, Boston, MA.
  8. Jones, D. R. and Bertramo, M. A. (1991), Solving Partitioning Problems with Genetic Algorithms. Proceedings of the Fourth International Conference on Genetic Algorithms, Morgan Kaufinann Publishers, Los Altos, CA, 442-449
  9. Kaufman, L. and Rousseeuw, P. J. (1990), Finding Groups in Data: An Introduction to Cluster Analysis, John Wiley & Sons, Inc
  10. Lucasius, C. B., Dane, A. D., and Kateman G. (1993), On K-medoid Clustering of Large Data Sets with the Aid of a Genetic Algorithm: Background, Feasibility and Comparison, Analytica Chimica Acta, 282, 647-669
  11. MacQueen, J. (1967), Some Methods for Classification and Analysis of Multivariate Observations, Proceedings on 5th Berkeley Symposium, 281-297
  12. Michaelwicz, Z. (1992), Genetic Algorithms + Data Structures = Evolution Programs, Springer-Verlag
  13. Moraczewski, I. R., Borkowski, W., and Kierzek, A. (1995), Clustering Geobotanical Data with the Use of a Genetic Algorithm, COENOSES, 10(1), 17-28
  14. Oliver, I. M., Smith, D. J., and Holland, J. R. C. (1987), A Study of Permutation Crossover Operators on the Traveling Salesman Problem, Proceedings of the Second International Conference on Genetic Algorithms, Lawrence Erlbaum Associates, Hillsdale, NJ, 224-230
  15. Seifoddini, H. and Wolfe, P. M. (1986), Application of the Similarity Coefficient Method in Group Technology, IIE Transactions, 18 (3),271-277 https://doi.org/10.1080/07408178608974704