[KSCI] Korea Science Citation Index Service

Standardization for basic association measures in association rule mining

Park, Hee-Chang (Department of Statistics, Changwon National University)

Publication Information

Journal of the Korean Data and Information Science Society / v.21, no.5, 2010 , pp. 891-899 More about this Journal

Abstract

Association rule is the technique to represent the relationship between two or more items by numerical representing for the relevance of each item in vast amounts of databases, and is most being used in data mining. The basic thresholds for association rule are support, confidence, and lift. these are used to generate the association rules. We need standardization of lift because the range of lift value is different from that of support and confidence. And also we need standardization of support and confidence to compare objectively association level of antecedent variables for one descendant variable. In this paper we propose a method for standardization of association thresholds considering marginal probability for each item to grasp objectively and exactly association level, check the conditions for association criteria and then compare association thresholds with standardized association thresholds using some concrete examples.

Keywords

Association rule; confidence; lift; standardized threshold; support;

Citations & Related Records

Times Cited By KSCI : 3 (Citation Analysis)

Reference
Cited By KSCI

1	Cho, K. H. and Park, H. C. (2007). Association rule mining by environmental data fusion. Journal of the Korean Data & Information Science Society, 18, 279-287. 과학기술학회마을
2	Cai, C. H., Fu, A. W. C., Cheng, C. H. and Kwong, W. W. (1998). Mining association rules with weighted items. Proceedings of International Database Engineering and Applications Symposium, 68-77.
3	Bayardo, R. J. (1998). Efficiently mining long patterns from databases. Processing of ACM SIGMOD Conference on Management of Data, 85-93.
4	Piatetsky, S. G. (1991). Discovery, analysis and presentation of strong rules. Knowledge Discovery in Databases, AAAI/MIT Press, 229-248.
5	Agrawal, R., Imielinski R. and Swami, A. (1993). Mining association rules between sets of items in large databases. Proceedings of the ACM SIGMOD Conference on Management of Data, 207-216.
6	Agrawal, R. and Srikant, R. (1994). Fast algorithms for mining association rules. Proceedings of the 20th VLDB Conference, 487-499.
7	Srikant, R. and Agrawal, R. (1995). Mining generalized association rules. Proceedings of the 21st VLDB Conference, 407-419.
8	Toivonen, H. (1996). Sampling large database for association rules. Proceedings of the 22nd VLDB Conference, 134-145.
9	Pasquier, N., Bastide, Y., Taouil, R. and Lakhal, L. (1999). Discovering frequent closed itemsets for association rules. Proceedings of the 7th International Conference on Database Theory, 398-416.
10	Pei, J., Han, J. and Mao, R. (2000). CLOSET: An efficient algorithm for mining frequent closed itemsets. Proceedings of ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 21-30.
11	Park, H. C. and Cho, K. H. (2005). Waste database analysis joined with local information using association rules. Journal of the Korean Data Analysis Society, 7, 763-772.
12	Cho, K. H. and Park, H. C. (2008). A study of association rule application using self-organizing map for fused data. Journal of the Korean Data & Information Science Society, 19, 95-104.
13	Park J. S., Chen M. S. and Philip S. Y. (1995). An effective hash-based algorithms for mining association rules. Proceedings of ACM SIGMOD Conference on Management of Data, 175-186.
14	McNicholas, P. D., Murphy, T. B. and O'Regan, O. (2008). Standardising the lift of an association rule. Computational Statistics and Data Analysis, 52, 4712-4721. DOI ScienceOn
15	Park, H. C. (2008). The proposition of conditionally pure confidence in association rule mining. Journal of the Korean Data & Information Science Society, 19, 1141-1151. 과학기술학회마을
16	Han, J., Pei, J. and Yin, Y. (2000). Mining frequent patterns without candidate generation. Proceedings of ACM SIGMOD Conference on Management of Data, 1-12.
17	Liu, B., Hsu, W. and Ma, Y. (1999). Mining association rules with multiple minimum supports. Proceedings of the 5th International Conference on Knowledge Discovery and Data Mining, 337-241.
18	Choi, J. H. and Park, H. C. (2008). Comparative study of quantitative data binning methods in association rule. Journal of the Korean Data & Information Science Society, 19, 903-910. 과학기술학회마을
19	Han, J. and Fu, Y. (1999). Mining multiple-level association rules in large databases. IEEE Transactions on Knowledge and Data Engineering, 11, 68-77.

Reference
Cited By KSCI

1	Small diagnostic scale for internet addiction / [Oh, Kwang-Sik;] / Journal of the Korean Data and Information Science Society
2	Association rule thresholds of similarity measures considering negative co-occurrence frequencies / [Park, Hee-Chang;] / Journal of the Korean Data and Information Science Society
3	Association rule thresholds considering the number of possible rules of interest items / [Park, Hee-Chang;] / Journal of the Korean Data and Information Science Society
4	Negatively attributable and pure confidence for generation of negative association rules / [Park, Hee-Chang;] / Journal of the Korean Data and Information Science Society
5	Exploration of PIM based similarity measures as association rule thresholds / [Park, Hee Chang;] / Journal of the Korean Data and Information Science Society
6	Association Analysis of Construction Accident Attributes Causing Fatalities / [Shin, Dong-Pil;Son, Chang-Baek;Lee, Dong-Eun;] / Journal of the Architectural Institute of Korea Structure & Construction
7	The proposition of compared and attributably pure confidence in association rule mining / [Park, Hee Chang;] / Journal of the Korean Data and Information Science Society
8	Utilization of similarity measures by PIM with AMP as association rule thresholds / [Park, Hee Chang;] / Journal of the Korean Data and Information Science Society
9	Non-linear regression model considering all association thresholds for decision of association rule numbers / [Park, Hee Chang;] / Journal of the Korean Data and Information Science Society

KSCI

Standardization for basic association measures in association rule mining 연관 규칙 마이닝에서의 평가기준 표준화 방안

Standardization for basic association measures in association rule mining