Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2010.17B.5.399

A Method for Precision Improvement Based on Core Query Clusters and Term Proximity  

Jang, Kye-Hun (전북대학교 컴퓨터공학과)
Lee, Kyung-Soon (전북대학교 컴퓨터공학부/영상정보신기술연구센터)
Abstract
In this paper, we propose a method for precision improvement based on core clusters and term proximity. The method is composed by three steps. The initial retrieval documents are clustered based on query term combination, which occurred in the document. Core clusters are selected by using proximity between query terms. Then, the documents in core clusters are reranked based on context information of query. On TREC AP test collection, experimental results in precision at the top documents(P@100) show that the proposed method improved 11.2% over the language model.
Keywords
Query Term Cluster; Core Query; Term Proximity; Context Term; Reranking;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Strohman, T., Metzler, D., Turtle, H., and Croft, W.B. 2005. Indri: A language model-based search engine for complex queries. In proc. International Conference on Intelligence Analysis. http://www.lemurproject.org
2 신승은, 강유환, 오효정, 장명길, 박상규, 이재성, 서영훈, 2003. 문서필터링을 위한 질의어 확장과 가중치 부여기법. 정보처리학회 논문지. 제10-B권. 제7호. pp.743-750.
3 Balinski, J., Danilowicz, C. 2004. Re-ranking method based on inter-document distances. Information Processing and Management, 41(2005)759-775.   DOI   ScienceOn
4 Sakai, T., Manabe, T., Koyama, M. 2005. Flexible pseudo-relevance feedback via selective sampling. ACM Transaction on Asian Language Information Processing (TALIP), 4(2), pp.111-135.   DOI
5 Collins-Thompson, K., Callan, J. 2007. Estimation and Use of Uncertainty in Pseudo-relevance Feedback. In Proc. of 30th ACM SIGIR on Research and Development in Information Retrieval. pp.303-310.
6 Lv, Y., Zhai, C.X. 2009. Positional Language Models for Information Retrieval. In Proc. of 32nd ACM SIGIR on Research and Development in Information Retrieval. pp.299-306.
7 Lavrenko, V., Croft, W.B. 2001. Relevance-based language models. In Proc. of 24th ACM SIGIR on Research and Development in Information Retrieval. pp.120-127.
8 Bendersky, M., Croft, W.B. 2008. Discovering Key Concepts in Verbose Queries. In Proc. of 31st ACM SIGIR on Research and Development in Information Retrieval. pp.491-498.
9 Kumaran, G., Allan, J. 2008. Effective and Efficient User Interaction for Long Queries. In Proc. of 31st ACM SIGIR on Research and Development in Information Retrieval. pp.11-18.
10 Zhao, J., Yun, Y. 2009. A Proximity Language Model for Information Retrieval. In Proc. of 32nd ACM SIGIR on Research and Development in Information Retrieval. pp.291-298.
11 Seo, J.W., Jeon, J.W. 2009. High Precision Retrieval Using Relevance-Flow Graph. In Proc. of 32nd ACM SIGIR on Research and Development in Information Retrieval. pp 694-695.
12 Ponte, J.M., Croft, W.B. 1998. A Language Modeling Approach to Information Retrieval. In Proc. of 21st ACM SIGIR on Research and Development in Information Retrieval. pp.275-281.