Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2004.11D.3.717

Similarity Measurement with Interestingness Weight for Improving the Accuracy of Web Transaction Clustering  

Kang, Tae-Ho (충북대학교 대학원 정보통신공학과)
Min, Young-Soo (충북대학교 대학원 정보통신공학)
Yoo, Jae-Soo (충북대학교 전기전자 및 컴퓨터공학부)
Abstract
Recently. many researches on the personalization of a web-site have been actively made. The web personalization predicts the sets of the most interesting URLs for each user through data mining approaches such as clustering techniques. Most existing methods using clustering techniques represented the web transactions as bit vectors that represent whether users visit a certain WRL or not to cluster web transactions. The similarity of the web transactions was decided according to the match degree of bit vectors. However, since the existing methods consider only whether users visit a certain URL or not, users' interestingness on the URL is excluded from clustering web transactions. That is, it is possible that the web transactions with different visit proposes or inclinations are classified into the same group. In this paper. we propose an enhanced transaction modeling with interestingness weight to solve such problems and a new similarity measuring method that exploits the proposed transaction modeling. It is shown through performance evaluation that our similarity measuring method improves the accuracy of the web transaction clustering over the existing method.
Keywords
Web Personalization; Web Transaction Clustering; Similarity Measurement;
Citations & Related Records
연도 인용수 순위
  • Reference
1 T. Yan, M. Jacobsen, H. Garcia-Molina, and U. Dayal, 'From user access patterns to dynamic hypertext linking,' WWW5/Computer Networks, Vol.28, No.7-11, 1996   DOI   ScienceOn
2 F. Masseglia, P. Poncelet and M. Teisseire, 'Using Data Mining Techniques on Web Access Logs to Dynamically Improve Hypertext Structure,' In ACM SigWeb Letters, Vol.8, No.3, pp.13-19, October, 1999   DOI
3 R. Cooley and J. Srivastava, 'Automatic Personalization Based On Web Usage Mining,' Communications of the Association of Computing Machinery(CACM), pp.142-151, August, 2000   DOI
4 B. Mobasher, H. dai and T. Luo, 'Improving the Effectinveness of Collaborative Filtering on Anonymous Web Usage Data,' Proceedings of the IJCAI 2001 Workshop on Intelligent Techniques for Web Personalization(ITWP01), August, 2001
5 E-H. Han, G. Karypis, V. Kumar and B. Mobasher, 'Clustering based on association rule hypergraphs,' Data Mining and Knowledge Discovery(DMKD), 1997
6 R. Cooley, B. Mobasher and J. Srivastava, 'Data preparation for mining world wide web browsing pattern,' Knowledge and Information Systems, Vol.1, No.1, pp.5-32, 1999   DOI   ScienceOn
7 B. Mobasher, H. Dai and T. Luo, 'Discovery of Aggregate Usage Profiles for Web Personalization,' Proceedings of the Web Mining for E-Commerce Workshop(WEBKDD), August, 2000
8 Alex G. Buchner, Maurice D. Mulvenna, 'Discovering internet marketing intelligence through online analytical Web usage mining,' ACM SIGMOD Record, Vol.27, No4, pp. 54-61, 1998   DOI
9 Lin, S. A. Alvarez and C. Ruiz, 'Efficient adaptivesupport association rule mining for recommender systems,' Data Mining and knowledge Discovery(DMKD), 2002   DOI   ScienceOn
10 Rakesh Agrawal, Ramakrishnan Srikant, 'Fast Algorithms for Mining Association Rules,' Proc. 20th Int. Conf. Very Large Data Bases, VLDB, pp.487-499, Sep., 1994
11 ley, Pang-Ning Tan and Jaideep Srivastava, 'Discovery of Interesting Usage Patterns from Web Data,' World Wide Web Knowledge and Data mining(WEBKDD), pp.163-182, 1999
12 B. Mobasher, H. Dai, and T.Luo, 'Web Usage and Content Mining for More Effective Personalization,' E-Commerce and Web Technologies(ECWeb), September, 2000
13 Shahabi, C., A. Zarkesh and J. Adibi, and V. Shah, 'Knowledge Discovery from Users Web-PageNavigation,' Research Issues in Data Engineering, 1997
14 Feng Taoand and Murtagh, K., 'Towards knowledge discovery from WWW log data,' Proceedings of the The International Conference on Information Technology : Coding and Computing(ITCC), pp.302-307, 2000   DOI
15 Sanjay Kumar Madria, Sourav S. Bhowmick, Wee Keong Ng and Ee-Peng Lim, 'Research Issues in Web Data Mining,' Data Warehousing and Knowledge Discovery(DaWaK), pp.303-312, 1999
16 Mbasher, B., Cooley, R., Srivastaba, J., 'web mining : Information and Pattern Discovery on the World Wide Web,' In Procedings of the 9th IEEE International Conference on Tools with Artificial Intelligence(ICTAI '97), November, 1997   DOI