Browse > Article

A Term Weight Mensuration based on Popularity for Search Query Expansion  

Lee, Jung-Hun (동국대학교 컴퓨터공학과)
Cheon, Suh-Hyun (동국대학교 컴퓨터공학과)
Abstract
With the use of the Internet pervasive in everyday life, people are now able to retrieve a lot of information through the web. However, exponential growth in the quantity of information on the web has brought limits to online search engines in their search performance by showing piles and piles of unwanted information. With so much unwanted information, web users nowadays need more time and efforts than in the past to search for needed information. This paper suggests a method of using query expansion in order to quickly bring wanted information to web users. Popularity based Term Weight Mensuration better performance than the TF-IDF and Simple Popularity Term Weight Mensuration to experiments without changes of search subject. When a subject changed during search, Popularity based Term Weight Mensuration's performance change is smaller than others.
Keywords
personalized search; query extraction; clustering; TF-IDF;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D. Mount, "ANN: Library for Approximate Nearest Neighbor Searching," http://www.cs.umd.edu/-mount/ANN/, 2006.
2 Agrawal, R., and Srikant, R., "Fast Algorithms for Mining Association Rules," Proceeding of the 20th International Conference on Very Large Databases, pp.487-499, 1994.
3 J. Cho, S. Roy, and R. Adams, Page quality: In search of an unbiased web rankIng. Proceedings of the 2005 ACM SIGMOD international conference on Management of data 2005, Baltimore, Maryland, June, pp.14-16, 2005.
4 Zhicheng Dou, Ruihua Song, Ji-Rong Wen, "A Largescale Evaluation and Analysis of Personalized Search Strategies," Proceedings of the 16th international conference on World Wide WebNew York, NY, USA: ACM, pp.581-590. 2007.
5 Kalervo Jarvelin, Jaana Kekalainen, "Cumulated gain-based evaluation of IR techniques," ACM Transactions on Information Systems, 20(4), pp.422-446 (2002).   DOI   ScienceOn
6 Buckley C., Salton G., and Allan J., "The Effect of Adding Relevance Information in a Relevance Feedback Environment," Proceedings of 17th annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Dublin, pp.292-300, 1994.
7 Salton, G., and Buckley, C., "Improving Retrieval Performance by Relevance Feedback," Journal of the American Society for Information Science, vol.41, pp.288-297, 1990.   DOI
8 Anick, P. G. and Vaithyanathan, S., "Exploiting Clustering and Phrases for Context-Based Information Retrieval," Proceeding of the 20th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, pp.314-323, 1997.
9 Tribula, W. J., "Text Mining," Annual Review of Information Science and Technology, pp.385-419, 1999.
10 Kristensen, J., "Expanding End-Users," Query Statements for Free-text Searching with a Search-aid Thesaurus," Information Processing and Management, vol.11, pp.22-33, 1968.
11 Harman, D., "Relevance Feedback Revisited," Proceedings of 15th annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Copenhagen, pp.1-10, 1992.
12 Li Ding, Tim Finin, and Anupam Joshi, "Swoogle: A search and metadata engine for the semantic web.," In Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management, pp.58-61, 2004.
13 B. Yang and G. Jeh, "Retroactive answering of search queries," Proceedings of the 15th international conference on World Wide Web, pp.457-466, 2006.
14 Qiu, F., and Cho, J., "Automatic identification of user interest for personalized search," In Proceedings of the 15th International Conference on World Wide Web., pp.727-736, 2006.
15 K. Sugiyama, K. Hatano, and M. Yoshikawa, "Adaptive web search based on user profile constructed without any effort from users," In Proceedings of the 15th International Conference on World Wide Web., pp.675-684, 2004.
16 Reiner Kraft, Chi Chao Chang, Farzin Maghoul, and Ravi Kumar, "Searching with context," In 15th International CIKM Conference Proceedings, pp.477-486, 2006.
17 S. S. Kang, "A Rule-Based Method for Morphological Disambiguation," Proceedings of the NLPRS (Natural Language Processing Pacific Rim Symposium), pp.67-72, 1999.
18 Lawrence Page, Sergey Brin, Rajeev Motwani, Terry Winograd, "The PageRank Citation Ranking: Bringing Order to the Web," Technical report, Stanford University, 1998.