Browse > Article
http://dx.doi.org/10.1633/JISTaP.2013.1.3.3

Topic Level Disambiguation for Weak Queries  

Zhang, Hui (School of Library and Information Science Indiana University)
Yang, Kiduk (Department of Library and Information Science Kyungpook National University)
Jacob, Elin (School of Library and Information Science Indiana University)
Publication Information
Journal of Information Science Theory and Practice / v.1, no.3, 2013 , pp. 33-46 More about this Journal
Abstract
Despite limited success, today's information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queries. However, existing IR approaches such as query expansion are not overly effective because they make little effort to analyze and exploit the meanings of the queries. Furthermore, word sense disambiguation approaches, which rely on textual context, are ineffective against weak queries that are typically short. Motivated by the demand for a robust IR system that can consistently provide highly accurate results, the proposed study implemented a novel topic detection that leveraged both the language model and structural knowledge of Wikipedia and systematically evaluated the effect of query disambiguation and topic-based retrieval approaches on TREC collections. The results not only confirm the effectiveness of the proposed topic detection and topic-based retrieval approaches but also demonstrate that query disambiguation does not improve IR as expected.
Keywords
Topic Detection; Query Disambiguation; Language Model; Information Retrieval; Natural Language Processing;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Ruthven, I. A. N., & Lalmas, M. (2003). A survey on the use of relevance feedback for information access systems. The Knowledge Engineering Review, 18(02), 95-145.   DOI   ScienceOn
2 Salton, G., & Buckley, C. (1990). Improving retrieval performance by relevance feedback. Journal of the American Society for Information Science, 41(4), 288-297.   DOI
3 Sanderson, M. (1994). Word sense disambiguation and information retrieval. New York, NY: Springer-Verlag.
4 Sanderson, M. (2000). Retrieving with good sense. Information Retrieval, 2(1), 49-69.   DOI   ScienceOn
5 Selvaretnam, B., & Belkhatir, M. (2012). Natural language technology and query expansion: Issues, state-of-the-art and perspectives. Journal of Intelligent Information Systems, 38(3), 709-740. doi: 10.1007/s10844-011-0174-3   DOI
6 Spink, A., Wolfram, D., Jansen, M. B. J., & Saracevic, T. (2001). Searching the web: The public and their queries. Journal of the American Society for Information Science and Technology, 52(3), 226-234.   DOI
7 Voorhees, E. M. (1993). Using WordNet to disambiguate word senses for text retrieval. Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, 171-180.
8 Voorhees, E. M. (1994). Query expansion using lexicalsemantic relations. New York, NY: Springer-Verlag.
9 Zhai, C., & Lafferty, J. (2004). A study of smoothing methods for language models applied to information retrieval. ACM Transactions on Information Systems, 22(2), 179-214.   DOI   ScienceOn
10 Zhai, C. X. (2008). Statistical language models for information retrieval: A critical review. Foundations and Trends in Information Retrieval, 2(3), 137-213.
11 Baillie, M., Azzopardi, L., & Crestani, F. (2006). Adaptive query-based sampling of distributed collections. Lecture Notes in Computer Science, 4209, 316.
12 Bendersky, M., Croft, W. B., & Smith, D. A. (2009). Twostage query segmentation for information retrieval. Paper presented at the Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, Boston, MA.
13 Buckley, C., & Harman, D. (2004). Reliable information access final workshop report. ARDA Northeast Regional Research Center Technical Report.
14 Buckley, C., Salton, G., Allan, J., & Singhal, A. (1995). Automatic query expansion using SMART: TREC 3. Overview of the Third Text REtrieval Conference (TREC-3), 500-225.
15 Gale, W. A., Church, K. W., & Yarowsky, D. (1992). A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26(5), 415-439.   DOI
16 Guo, J., Xu, G., Li, H., & Cheng, X. (2008). A unified and discriminative model for query refinement. Paper presented at the Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval.
17 Harman, D. (1992). Relevance feedback revisited. Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, 1-10.
18 Jing, Y., & Croft, W. B. (1994). An association thesaurus for information retrieval. Proceedings of RIAO, 94(1994), 146-160.
19 Klyuev, V., & Haralambous, Y. (2011). Query expansion: Term selection using the ewc semantic relatedness measure. Paper presented at the Computer Science and Information Systems (FedCSIS), 2011 Federated Conference on.
20 Metzler, D., & Croft, W. B. (2004). Combining the language model and inference network approaches to retrieval. Information processing & management, 40(5), 735-750.   DOI   ScienceOn
21 Mihalcea, R. (2003). Turning WordNet into an information retrieval resource: Systematic polysemy and conversion to hierarchical codes. International Journal of Pattern Recognition and Artificial Intelligence, 17(05), 689-704. doi: doi:10.1142/S0218001403002605   DOI   ScienceOn
22 Mihalcea, R., & Csomai, A. (2007). Wikify!: Linking documents to encyclopedic knowledge. Paper pre-sented at the Proceedings of the sixteenth ACM conference on Information and knowledge management, Lisbon, Portugal.
23 Milne, D., & Witten, I. H. (2008). Learning to link with Wikipedia. Paper presented at the Proceedings of the 17th ACM conference on Information and knowledge management, Napa Valley, CA.
24 Navigli, R. (2009). Word sense disambiguation: A survey. ACM Computing Surveys (CSUR), 41(2), 10.
25 Prakash, R. S. S., Jurafsky, D., & Ng, A. Y. (2007). Learning to merge word senses. Computer Science Department, Stanford University.
26 Qiu, Y., & Frei, H. P. (1993). Concept based query expansion. Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, 160-169.