Browse > Article
http://dx.doi.org/10.5351/CKSS.2010.17.4.527

Rule-Based Classification Analysis Using Entropy Distribution  

Lee, Jung-Jin (Department of Statistics, Soongsil University)
Park, Hae-Ki (Department of Statistics, Soongsil University)
Publication Information
Communications for Statistical Applications and Methods / v.17, no.4, 2010 , pp. 527-540 More about this Journal
Abstract
Rule-based classification analysis is widely used for massive datamining because it is easy to understand and its algorithm is uncomplicated. In this classification analysis, majority vote of rules or weighted combination of rules using their supports are frequently used in order to combine rules. We propose a method to combine rules by using the multinomial distribution in this paper. Iterative proportional fitting algorithm is used to estimate the multinomial distribution which maximizes entropy constrained on rules' support. Simulation experiments show that this method can compete with other well known classification models in the case of two similar populations.
Keywords
Rule-based classification analysis; maximum entropy distribution; iterative proportional fitting algorithm;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 이정진 (2005). Discriminant analysis of binary data with multinomial distribution by using the iterative cross entropy minimization, <한국통계학회논문집>, 12, 125-137.   과학기술학회마을   DOI   ScienceOn
2 이정진, 김수관 (2002). Classification analysis in information retrieval by using Gauss patterns, <한국통계학회논문집>, 9, 1-11.   과학기술학회마을   DOI   ScienceOn
3 이정진, 황준 (2003). Discriminant analysis of binary data by using the maximum entropy distribution, <한국통계학회논문집>, 10, 909-917.   과학기술학회마을   DOI   ScienceOn
4 Asparoukhov, O. K. and Krzanowski, W. J. (2001). A comparison of discriminant procedures for binary variables, Computational Statistics and Data Analysis, 38, 139-160.   DOI   ScienceOn
5 Cramer, E. (2000). Probability measures with given marginals and conditionals: I-projections and conditional iterative proportional fitting, Statistics & Decisions, 18, 311-329.
6 Ruschendorf, L. (1995) Convergence of the iterative proportional fitting procedure, The Annals of Statistics, 23, 1160-1174.   DOI
7 Kantor, P. B. and Lee, J. J. (1998). Testing the maximum entropy principle for information retrieval, Journal of American Society for Information Science, 49, 557-566.   DOI   ScienceOn
8 Duda, R. O., Hart, P. E. and Stork, D. G. (2001). Pattern Classification, Wiley, New York.
9 Han, J. and Kamber, M. (2000). Data Mining Concepts and Technique, Elsevier.
10 Ireland, C. T. and Kullback, S. (1968). Contingency tables with given marginals, Biometrika, 55, 179-188.   DOI   ScienceOn
11 Lachenbruch (1981). Discriminant Analysis, Prentice Hall.
12 Liu, B., Hsu, W. and Ma, Y. (1998). Integrating classification and association rule mining, Proceeding 1998 International Conference Knowledge Discovery and Data Mining, 80-86, New York, August 1998.