Browse > Article

An Efficient Candidate Pattern Tree Structure and Algorithm for Incremental Web Mining  

Kang, Hee-Seong (Dept. of Computer Science, Kwangwoon University)
Park, Byung-Joon (Dept. of Computer Science, Kwangwoon University)
Publication Information
Abstract
Recent advances in the internet infrastructure have resulted in a large number of huge Web sites and portals worldwide. These Web sites are being visited by various types of users in many different ways. Among all the web page access sequences from different users, some of them occur so frequently that may need an attention from those who are interested. We call them frequent access patterns and access sequences that can be frequent the candidate patterns. Since these candidate patterns play an important role in the incremental Web mining, it is important to efficiently generate, add, delete, and search for them. This thesis presents a novel tree structure that can efficiently store the candidate patterns and a related set of algorithms for generating the tree structure, adding new patterns, deleting unnecessary patterns, and searching for the needed ones. The proposed tree structure has a kind of the 3 dimensional link structure and its nodes are layered.
Keywords
Tree; Incremental; Candidate; Pattern; Web Mining;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Xidong Wang, Yiming Ouyang, Xugang Hu, Yan Zhang, 'Discovery of User Frequent Access patterns on Web usage Mining', IEEE, pp. 765-769, 2003
2 Maged El-Sayed, Carolina Ruiz, and Elke A.Rundensteiner, 'FS-Miner : Efficient and Incremental Mining of Frequent Sequence Patterns in Web logs', WIDM, pp.128-135, 2004   DOI
3 Wolfgang Gaul, Lars Schmidt-Thieme, 'Mining Web navigation path fragments', Institut fur Entscheidungstheorie und Unternehm ens for schung. pp.1-6, 2000
4 R. Cooley, B. Mobasher, and J. Srivastava, 'Web Mining : Information and Pattern Discovery on the World Wide Web', Proc. of the 9th IEEE International Conference on Tools with Artificial Intelligence, pp.558-567, Nov 1997   DOI
5 Ramakrishnan Srikant, and Rakesh Agrawal, 'Mining Sequential Patterns : Generalizations and Performance Improvements', IBM Almaden Research Center 650 Harry Road, San Jose, CA 95120, pp.3-17, 1996
6 Apache Korea Group, http://www.apache.kr.net
7 S.Parthasarathy, M. J. Zaki, M. Ogihara, S. Dwarkadas, 'Incremental and Interactive Sequence Mining', CIKM, pp.251-258, 1999   DOI
8 Mathias Gery, Hatem Haddad, 'Evaluation of Web Usage Mining approaches for user's next request prediction', WIDM, pp.75-76, 2003   DOI
9 Jaideep Srivastava, R. Cooley, M. Deshpande, P-T. Tan, 'Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data', SIGKDD Explorations, Volume 1, Issue 2, pp.12-23, Jan 2000   DOI
10 코리아인터넷 마케팅센터, http://www.webpro.co.kr