Association Rule Mining Algorithm and Analysis of Missing Values

  • Lee, Jae-Wan (School of Electronic and Information Engineering, Kunsan National University) ;
  • Bobby D. Gerardo (School of Electronic and Information Engineering, Kunsan National University) ;
  • Kim, Gui-Tae (School of Electronic and Information Engineering, Kunsan National University) ;
  • Jeong, Jin-Seob (School of Electronic and Information Engineering, Kunsan National University)
  • Published : 2003.09.01

Abstract

This paper explored the use of an algorithm for the data mining and method in handling missing data which had generated enhanced association patterns observed using the data illustrated here. The evaluations showed that more association patterns are generated in the second analysis which suggests more meaningful rules than in the first situation. It showed that the model offer more precise and important association rules that is more valuable when applied for business decision making. With the discovery of accurate association rules or business patterns, strategies could be efficiently planned out and implemented to improve marketing schemes. This investigation gives rise to a number of interesting issues that could be explored further like the effect of outliers and missing data for detecting fraud and devious database entries.

Keywords

References

  1. Agrawal and Srikant. Fast Algorithms for Mining Association Rules. Proceeding of International Conference on Very Large Databases VLDB, 1994, 487-499
  2. Coenen, F. The Apriori Algorithm. http://www.csc.liv.ac.uk/~frans/KDD/aprioriTdemo.html#algorithm (2001)
  3. Edelstein, Herb. Data Mining: Can you dig it? http://www.teradatamagazine.com/articles/2003/vol3_no2/enterpriseviews/default.htm
  4. Han J. and Kamber M. Data mining concepts and techniques. USA: Morgan Kaufmann (2001)
  5. Handling missing or incomplete data. http://www.utexas.edu/cc/faqs/stat/general/gen25.html
  6. Hellerstein, J.L., Ma, S. and Pemg, C. S. Discovering actionable patterns in event data. IBM Systems Journal, Vol. 41, No.3, 2002
  7. Knowledge Discovery in Databases. http://www.cs.ualberta.ca/~joerg/courses/cmput690/slides/Overview-s4.pdf
  8. Multi-Dimensional Constrained Gradient Mining. ftp://fas.sfu.ca/pub/cs/theses/2001/JoyceManWingLamMSc.pdf
  9. Nayak, Jyothsna R. and Cook, Diane J. Approximate Association Rule Mining. Proceedings of the Florida Artificial Intelligence Research Symposium, 2001
  10. Nestorov, Svetlozar and Jukic, Nenad. Ad-Hoc Association-Rule Mining within the Data Warehouse. Proceedings of 36th Annual Hawaii International Conference on System Sciences, page 232a, January 2003
  11. Pairwise Deletion of Missing Data vs. Mean Substitution. http://www.statsoftinc.com/textbook/glosp.html
  12. Text Mining. http://www.cs.waikato.ac.nz/~nzdl/textmining