Browse > Article

Measuring Web Page Similarity using Tags  

Kang, Sang-Wook (삼성전자 무선사업부)
Lee, Ki-Yong (KAIST 전산학과)
Kim, Hyeon-Gyu (KAIST 전산학과)
Kim, Myoung-Ho (KAIST 전산학과)
Abstract
Social bookmarking is one of the most interesting trends in the current web environment. In a social bookmarking system, users annotate a web page with tags, which describe the contents of the page. Numerous studies have been done using this information, mostly on enhancing the quality of web search. In this paper, we use this information to measure the semantic similarity between two web pages. Since web pages consist of various types of multimedia data, it is quite difficult to compare the semantics of two web pages by comparing the actual data contained in the pages. With the help of social bookmarks, this comparison can be performed very effectively. In this paper, we propose a new similarity measure between web pages, called Web Page Similarity Based on Entire Tags (WSET), based on social bookmarks. The experimental results show that the proposed measure yields more satisfactory results than the previous ones.
Keywords
Web page similarity; Tag; Social Bookmarks; WWW;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Yanabe Y., Jatowt A., Nakamura S., Tanaka K., Can Social Bookmarking Enhance Search in the Web? In JCDL '07: Proceedings of the 2007 Conference on Digital Libraries, pp.107-116, ACM (2007).
2 Heymann P., Koutrika G. Garcia-Molina H., Can Social Bookmarking Improve Web Search? In WSDM '08, ACM (2008).
3 Law K., Harik G., Techniques for finding related hyperlinked documents using link-based analysis. U.S. Patent 6,754,873. June 22, 2004.
4 Dean J., Henzinger M., Finding related pages in the World Wide Web. In Proc. of the Eighth International World Wide Web Conference (1999).
5 Page L., Brin S., Motwani R., Winograd T., The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University Database Group (1998).
6 J. M. Kleinberg: Authoritative Sources in a Hyperlinked Environment. In: 9th Annual ACM-SIAM Symposium on Discrete Algorithms, pp.668-677, (1998)
7 Shen X., Tan B., Zhai C., Implicit User Modeling for Personalized Search. In CIKM'05, ACM (2005).
8 Chirita P., Nejdl W., Paui R., Kohlschutter C., Using ODP Metadata to Personalized Search. In Proc. of SIGIR (2005).
9 Bao S., Xue G., Wu X., Yu Y., Fei B., Su Z., Optimizing Web Search Using Social Annotations. In WWW '07: Proceedings of the 16th International Conference on World Wide Web, pp.501- 510, ACM (2007).
10 Delicious social bookmarking, http://delicious.com/
11 Hofmann T., Puzicha J., Statistical Models for Co-occurrence Data. Technical report, A.I.Memo 1635, MIT (1998).
12 Wu X., Zhang L., Yu Y., Exploring Social Annotations for the Semantic Web. In WWW '06: Proceedings of the 15th International Conference on World Wide Web, pp.417-426, ACM (2006).