DOI QR코드

DOI QR Code

Fusion Approach for Optimizing Web Search Performance

웹 검색 성능 최적화를 위한 융합적 방식

  • Yang, Kiduk (Kyungpook National University. Department of Library and Information Science)
  • Received : 2014.11.28
  • Accepted : 2015.03.09
  • Published : 2015.03.30

Abstract

This paper describes a Web search optimization study that investigates both static and dynamic tuning methods for optimizing system performance. We extended the conventional fusion approach by introducing the "dynamic tuning" process with which to optimize the fusion formula that combines the contributions of diverse sources of evidence on the Web. By engaging in iterative dynamic tuning process, where we successively fine-tuned the fusion parameters based on the cognitive analysis of immediate system feedback, we were able to significantly increase the retrieval performance. Our results show that exploiting the richness of Web search environment by combining multiple sources of evidence is an effective strategy.

이 논문은 시스템 성능을 최적화하기 위해 정적 및 동적 튜닝 방법을 이용한 웹 융합검색 연구의 내용을 보고합니다. 기존의 융합 방식을 넘어선 "다이나믹 튜닝"이라는 과정을 도입하여 웹의 다양한 정보소스의 기여를 최적화 시킬 수 있는 융합 공식을 생성하는 방법을 조사한 이 연구의 결과는 웹 검색 환경의 풍요로운 여러 데이터 소스를 활용하는 것이 효과적인 전략이라는 것을 보여주었습니다. 본 연구에서는 즉각적인 시스템 피드백 인지분석을 기반으로 융합 매개 변수를 미세 조정하는 반복적 인 다이나믹 튜닝 과정을 통해 크게 검색 성능을 향상시킬 수 있었습니다.

Keywords

References

  1. Amitay, E., Carmel, D., Darlow, A., Lempel, R., & Soffer, A. (2003). Topic distillation with knowledge agents. Proceedings of the11th Text Retrieval Conference, 263-272.
  2. Bartell, B. T., Cottrell, G. W., & Belew, R. K. (1994). Automatic combination of multiple ranked retrieval systems. Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval.
  3. Buckley, C., Salton, G., & Allan, J., & Singhal, A. (1995). Automatic query expansion using SMART: TREC 3. Proceeding of the 3rd Text Rerieval Conference, 1-19.
  4. Buckley, C., Singhal, A., & Mitra, M. (1997). Using query zoning and correlation within SMART: TREC 5. Proceeding of the 5th Text REtrieval Conference, 105-118.
  5. Craswell, N., & Hawking, D. (2003). Overview of the TREC-2002 Web track. Proceedings of the 11th Text Retrieval Conference, 86-95.
  6. Craswell, N., Hawking, D., & Robertson, S. (2001). Effective site finding using link anchor information. Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval, 250-257.
  7. Fox, E. A., & Shaw, J. A. (1995). Combination of multiple searches. Proceeding of the 3rd Text Rerieval Conference, 105-108.
  8. Frakes, W. B., & Baeza-Yates, R. (Eds.). (1992). Information retrieval: Data structures & algorithms. Englewood Cliffs, NJ: Prentice Hall.
  9. Gurrin, C., & Smeaton, A. F. (2001). Dublin City University experiments in connectivity analysis for TREC-9. Proceedings of the 9th Text Retrieval Conference, 179-188.
  10. Hawking, D., & Craswell, N. (2002). Overview of the TREC-2001 Web track. Proceedings of the 10th Text Retrieval Conference, 25-31
  11. Hawking, D., & Craswell, N., Thistlewaite, P., & Harman, D. (1999). Results and challenges in web search evaluation. Proceedings of the 8th WWW Conference, 243-252.
  12. Hawking, D., Voorhees, E., Craswell, N., & Bailey, P. (2000). Overview of the TREC-8 web track. Proceedings of the 8th Text Retrieval Conference, 131-148.
  13. Kraaij, W., Westerveld, T., & Hiemstra, D. (2002). The importance of prior probabilities for entry page search. Proceedings of the 25th ACM SIGIR Conference on Research and Development in Information Retrieval, 27-34.
  14. Kwok, K. L., Grunfeld, L., & Deng, P. (2005). Improving weak ad-hoc retrieval by Web assistance and data fusion. Information Retrieval Technology, 17-30. Springer Berlin Heidelberg.
  15. Lee, J. H. (1997). Analyses of multiple evidence combination. Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, 267-276.
  16. MacFarlane, A. (2003). Pliers at TREC 2002. Proceedings of the 11th Text Retrieval Conference, 152-155.
  17. Modha, D., & Spangler, W. S. (2000). Clustering hypertext with applications to Web searching. Proceedings of the 11th ACM Hypertext Conference, 143-152
  18. Robertson, S. E., & Walker, S. (1994). Some simple approximations to the 2-poisson model for probabilistic weighted retrieval. Proceedings of the 17th ACM SIGIR Conference on Research and Development in Information Retrieval, 232-241
  19. Savoy, J., & Picard, J. (1998). Report on the TREC-8 experiment: Searching on the web and in distributed collections. Proceedings of the 8th Text Retrieval Conference, 229-240.
  20. Savoy, J., & Rasolofo, Y. (2001). Report on the TREC-9 experiment: Link-based retrieval and distributed collections. Proceedings of the 9th Text Retrieval Conference, 579-516.
  21. Singhal, A., Buckley, C., & Mitra, M. (1996). Pivoted document length normalization. Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, 21-29.
  22. Singhal, A., & Kaszkiel, M. (2001). A case study in web search using TREC algorithms. Proceedings of the 11th International WWW Conference, 708-716.
  23. Thompson. P. (1990). A combination of expert opinion approach to probabilistic information retrieval, part 1: The conceptual model. Information Processing & Management, 26(3), 371-382. https://doi.org/10.1016/0306-4573(90)90097-L
  24. Tomlinson, S. (2003). Robust, web and genomic retrieval with hummingbird searchServer at TREC 2003. Proceedings of the 12th Text Retrieval Conference, 254-267.
  25. Voorhees, E., & Harman, D. (2000). Overview of the eighth text retrieval conference. Proceedings of the 8th Text Retrieval Conference, 1-24.
  26. Xu, J., & Croft, W.B. (2000). Improving the effectiveness of information retrieval with local context analysis. ACM Transaction on Information Systems, 18(1), 79-112. https://doi.org/10.1145/333135.333138
  27. Yang, K. (2002a). Combining text-, link-, and classification-based retrieval methods to enhance information discovery on the Web. (Doctoral Dissertation. University of North Carolina).
  28. Yang, K. (2002b). Combining text- and link-based retrieval methods for web IR. Proceedings of the 10th Text Rerieval Conference, 609-618.
  29. Yang, K. (2014). Combining multiple sources of evidence to enhance Web search performance. Journal of Korean Library and Information Science Society, 45(3), 5-36.
  30. Yang, K., & N. Yu. (2005). WIDIT: Fusion-based approach to web search optimization. In Information Retrieval Technology, 206-220. Springer Berlin Heidelberg.
  31. Zhang, M., Song, R., Lin, C., Ma, S., Jiang, Z., Jin, Y., Liu, Y., & Zhao, L. (2003). THU TREC 2002: Web Track Experiments. Proceedings of the 11th Text Retrieval Conference, 591-594.