Browse > Article
http://dx.doi.org/10.5351/CKSS.2005.12.2.531

Revising K-Means Clustering under Semi-Supervision  

Huh Myung-Hoe (Department of Statistics, Korea University)
Yi SeongKeun (Dept. of Business Administration, Sungshin Women's University)
Lee Yonggoo (Department of Statistics, Chung-Ang University)
Publication Information
Communications for Statistical Applications and Methods / v.12, no.2, 2005 , pp. 531-538 More about this Journal
Abstract
In k-means clustering, we standardize variables before clustering and iterate two steps: units allocation by Euclidean sense and centroids updating. In applications to DB marketing where clusters are to be used as customer segments with similar consumption behaviors, we frequently acquire additional variables on the customers or the units through marketing campaigns a posteriori. Hence we need to modify the clusters originally formed after each campaign. The aim of this study is to propose a revision method of k-means clusters, incorporating added information by weighting clustering variables. We illustrate the proposed method in an empirical case.
Keywords
k-means clustering; customer segmentation; weighting variables; entropy criterion; marketing campaign;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Wedel, M. and Kamakura, W. (2000). Market Segmentation: Concept and Methodological Foundations, 2nd Ed. Kluwer Academic Publishers. (Chapter 5)
2 DeSarbo, W.S., Carrol, J.D., and Clark, L.A., and Green, P.E. (1984). 'Synthesized clustering: A method for amalgamating alternative clustering bases with differential weighting of variables,' Psychometrika, 49. 57-78   DOI
3 Giudici, P. (2003). Applied Data Mining. Wiley. (Section 4.2)
4 Huh, M.H, and Lee, Y. (2004). 'Reproducibility assessment of k-means clustering and applications,' Korean Journal of Applied Statistics, 17. 135-144. (Written in Korean)   DOI   ScienceOn
5 Makarenkov, V. and Legendre, P. (2001). 'Optimal variable weighting for ultrametric and additive trees and k-rneans partitioning: methods and software,' Journal of Classification, 18. 245-271
6 Milligan, G. W. (1995). 'Issues in applied classification,' CSNA Newsletter, 36-38
7 Ripley, B.D. (1996). Pattern Recognition and Neural Networks. Cambridge University Press. (Section 4.2)