Performance Evaluation and Analysis of Recent Information Retrieval Models

세 가지 정보 검색 모델의 성능 평가 및 분석

  • Published : 2001.06.01

Abstract

사용자가 질의와 문서 사이의 유사도를 효과적으로 계산하는 정보 검색 모델에 대한 많은 연구가 수행되어 왔다. 이러한 연구들에 의해 개발된 정보 검색 모델들은 우수한 검색 효과를 제공한다고 알려져 있다. 그러나 이들에 대한 분석 및 검색 효과에 대한 비교 평가가 수행되지 않았기 때문에, 정보 검색시스템의 개발시 어떠한 정보 검색 모델을 사용할 것인가에 대한 결정이 매우 어려운 실정이다. 본 연구에서는 최신정보 검색 모델인 피벗 문서 길이 정규화, 추론 네트워크 모델, 2-포아송 모델을 분석하고, 실험을 통하여 검색 효과에 대한 비교 평가를 수행한다.

Keywords

References

  1. Salton, G., 'Historical note: The past thirty years in information retrieval,' Journal of the American Society for Information Science, Vol. 38, No. 5, pp. 375-380, 1987 Salton, Gerard https://doi.org/10.1002/(SICI)1097-4571(198709)38:5<375::AID-ASI5>3.0.CO;2-3
  2. Lee, J.H., Kim, M.H. and Lee, Y.J., 'Ranking documents in thesaurus-based Boolean retireval systems,' Information Processing & Management, Vol. 30, No. 1, pp. 79-91, 1994 https://doi.org/10.1016/0306-4573(94)90025-6
  3. Singhal, A., Buckley, C. and Mitra, M., 'Pivoted document length normalization,' Proceedings of the 19th Annaul International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 21-29, 1996 https://doi.org/10.1145/243199.243206
  4. Turtle, H. and Croft, W.B., 'Evaluation of an inference network-based retrieval model,' ACM Transactions on Information Systems, Vol. 9, No. 3, pp. 187-222, 1991 https://doi.org/10.1145/125187.125188
  5. Robertson, S.E. and Walker, S., 'Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval,' Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 232-241, 1994
  6. Salton, G. and McGill, M.J., Introduction to Modern Information Retrieval, McGraw-Hill, Inc., 1983
  7. Harman, D., 'Overview of the 1st text retrieval conference,' Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 36-48, 1993
  8. J.H. Lee, 'Combining multiple evidence from different properties of weighting schemes,' Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 180-188, 1995 https://doi.org/10.1145/215206.215358
  9. Greiff, W.R., Croft, W.B. and Turtle, H., 'Computationally tractable probabilistic modeling of Boolean operators,' Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 119-128, 1997 https://doi.org/10.1145/258525.258547
  10. Maron, M.E. and Kuhns, J.L., 'On relevance, probabilistic indexing and information retreival,' Association for Computing Machinery, Vol. 7, No. 3, pp. 216-244, 1960 https://doi.org/10.1145/321033.321035
  11. Robertson, S.E. and Sparck Jones, K., 'Relevance weighting of search terms,' Journal of the American Society for Information Science, Vol. 27, pp. 129-146, 1976 https://doi.org/10.1002/asi.4630270302
  12. Harter, S.P., 'A probabilistic approach to automatic keyword indexing,' Journal of the American Society for Information Science, Vol. 26, pp. 197-206, 1975 https://doi.org/10.1002/asi.4630260402
  13. Robertson, S.E., Walke, S., Jones, S., Hancock-Beaulieu, M.M. and Gatford, M., 'Okapi at TREC-3,' Proceedings of the 3rd Text REtrieval Conference (TREC-3), pp. 109-126, 1995