Browse > Article
http://dx.doi.org/10.7472/jksii.2019.20.5.49

HBase based Business Process Event Log Schema Design of Hadoop Framework  

Ham, Seonghun (Div. of Computer Science and Engineering, Kyonggi University)
Ahn, Hyun (Div. of Computer Science and Engineering, Kyonggi University)
Kim, Kwanghoon Pio (Div. of Computer Science and Engineering, Kyonggi University)
Publication Information
Journal of Internet Computing and Services / v.20, no.5, 2019 , pp. 49-55 More about this Journal
Abstract
Organizations design and operate business process models to achieve their goals efficiently and systematically. With the advancement of IT technology, the number of items that computer systems can participate in and the process becomes huge and complicated. This phenomenon created a more complex and subdivide flow of business process.The process instances that contain workcase and events are larger and have more data. This is an essential resource for process mining and is used directly in model discovery, analysis, and improvement of processes. This event log is getting bigger and broader, which leads to problems such as capacity management and I / O load in management of existing row level program or management through a relational database. In this paper, as the event log becomes big data, we have found the problem of management limit based on the existing original file or relational database. Design and apply schemes to archive and analyze large event logs through Hadoop, an open source distributed file system, and HBase, a NoSQL database system.
Keywords
Workflow Process; NoSQL; Process Mining; Event Log; Hadoop; Process Discovery;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Ghemawat, Sanjay, Howard Gobioff, and Shun-Tak Leung. "The Google file system." 2003. https://ai.google/research/pubs/pub51
2 Dean, Jeffrey, and Sanjay Ghemawat. "MapReduce: simplified data processing on large clusters." Communications of the ACM, Vol. 51, No. 1, pp. 107-113, 2008. http://dx.doi.org/10.1145/1327452.1327492   DOI
3 Gunther, Christian W., and Eric Verbeek., "Xes standard definition," Fluxicon Process Laboratories, Vol 13, No. 14, 2009. https://pure.tue.nl/ws/portalfiles/portal/3981980/692728941269079.pdf
4 W. M. P. van der Aalst, B. F. van Dongena; J. Herbst, L. Marustera, G. Schimm and A. J. M. M. Weijters, "Workflow mining: A survey of issues and approaches," Journal of Data & Knowledge Engineering, Vol. 47, Issue 2, pp. 237-267, 2003. https://doi.org/10.1016/S0169-023X(03)00066-1   DOI
5 Kim, Kwanghoon and Ellis, Clarence A., "${\sigma}$-Algorithm: Structured Workflow Process Mining Through Amalgamating Temporal Workcases," The Proceedings of PAKDD2007, Advances in Knowledge Discovery and Data Mining, Lecture Notes in Artificial Intelligence, Vol. 4426, pp. 119-130, 2007. https://doi.org/10.1007/978-3-540-71701-0_14   DOI
6 K. im, M. Yeon, B. Jeong, and K. P. Kim, "A Conceptual Approach for Discovering Proportions of Disjunctive Routing Patterns in a Business Process Model," KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, Vol. 11, No. 2, pp. 1148-1161, 2017. https://doi.org/10.3837/tiis.2017.02.030   DOI
7 Patel, Aditya B., Manashvi Birla, and Ushma Nair. "Addressing big data problem using Hadoop and Map Reduce." 2012 Nirma University International Conference on Engineering (NUiCONE). IEEE, 2012. https://ieeexplore.ieee.org/abstract/document/6493198
8 Minhyuck Jin, and Kwanghoon Pio Kim. "A MapReduce-Based Workflow BIG-Log Clustering Tec," Journal of Internet Computing and Services, Vol. 20, No. 1, pp. 87-96, 2019. https://doi.org/10.7472/jksii.2019.20.1.87   DOI
9 Park, Min-Jae, and Kwang-Hoon Kim. "Control-Path Oriented Workflow Intelligence Analysis and Mining System." 2007 International Conference on Convergence Information Technology (ICCIT 2007). IEEE, 2007. https://ieeexplore.ieee.org/abstract/document/4420383
10 Kim, Jawon, et al. "An Estimated Closeness Centrality Ranking Algorithm and Its Performance Analysis in Large-Scale Workflow-supported Social Networks," KSII Transactions on Internet & Information Systems, Vol. 10, No. 3, https://doi.org/2016.10.3837/tiis.2016.03.031
11 BPI Challenge 2018, 4TU.Centre for Research Data, https://data.4tu.nl/repository/collection:event-logs-real.