DOI QR코드

DOI QR Code

Probabilistic Graphical Model for Transaction Data Analysis

트랜잭션 데이터 분석을 위한 확률 그래프 모형

  • Ahn, Gil Seung (Department of Industrial and Management Engineering, Hanyang University) ;
  • Hur, Sun (Department of Industrial and Management Engineering, Hanyang University)
  • 안길승 (한양대학교 산업경영공학과) ;
  • 허선 (한양대학교 산업경영공학과)
  • Received : 2015.12.07
  • Accepted : 2016.05.27
  • Published : 2016.08.15

Abstract

Recently, transaction data is accumulated everywhere very rapidly. Association analysis methods are usually applied to analyze transaction data, but the methods have several problems. For example, these methods can only consider one-way relations among items and cannot reflect domain knowledge into analysis process. In order to overcome defect of association analysis methods, we suggest a transaction data analysis method based on probabilistic graphical model (PGM) in this study. The method we suggest has several advantages as compared with association analysis methods. For example, this method has a high flexibility, and can give a solution to various probability problems regarding the transaction data with relationships among items.

Keywords

References

  1. Ahn, G.-S. and Hur, S. (2015), Prediction of New Customer's Degree of Loyalty of Internet Shopping Mall Using Continuous Conditional Random Field, Journal of Korean Institute of Industrial Engineers, 41(1), 10-16. https://doi.org/10.7232/JKIIE.2015.41.1.010
  2. Bron, C. and Kerbosch, J. (1973), Algorithm 457 : Finding All Cliques of an undirected Graph, Communications of the ACM, 16(9), 575-577. https://doi.org/10.1145/362342.362367
  3. Chen, Y. and Welling, M. (2014), Bayesian structure learning for Markov random fields with a spike and slab prior, arXiv preprint, arXiv : 1408.2047.
  4. eMarketer (2014), Worldwide E-commerces Sales to increase Nearly 20% in 2014.
  5. Fuguang, B. (2015), A Novel Method of Interestingness Measures for Association Rules Mining Based on Profit, Discrete Dynamics in Nature and Society, 1-10.
  6. Han, D.-S. (2009), Identification of Conserved Protein Domain Combination based on Association Rule. Journal of KIISE : Computing Practices and Letters, 15(5), 375-379.
  7. Informationweek (2012), Catalina Marketing Aims For the Cutting Edge of 'Big Data.'
  8. Jordan, M. I. (1999), Learning in Graphical Models, MIT Press, Massachusetts, USA.
  9. Kim, N.-K. (2008), Effect of Market Basket Size on the Accuracy of Association Rule Measures, Asia Pacific Journal of Information Systems, 18(2), 95-114.
  10. Kolaczyk, E. (2009), Statistical Analysis of Network Data, Springer, Boston, USA.
  11. Park, D.-S. (2005), A Visualization on Data Mining for Association based on Web, Journal of Korean Institute of Information Technology, 3(4), 1-9.
  12. Rajtmajer, S. M. (2012), Introduction to Markov Random Fields.
  13. Son, J.-E., Kim, S.-B., Kim, H.-J., and Cho, S.-Z. (2015), Review and Analysis of Recommender Systems, Journal of Korean Institute of Industrial Engineers, 41(2), 185-208. https://doi.org/10.7232/JKIIE.2015.41.2.185
  14. Turney, P. D. (2002), Thums up or thums down?: semantic orientation applied to unsupervised classification of reviews, In Proceedings of the 40th annual meeting on association for computational linguistics, 417-424.
  15. Yang, S.-M. (2003), Discovery of Association Rules Based on items of Categorical Attribute and Quantitative Attribute, Proceedings of Korean Institute of Information Technology, 456-461.