An Implementation of Best Match Algorithm for Korean Text Retrieval in the Client/Server Environment

클라이언트 서버 환경에서 한글텍스트 검색을 위한 베스티매치 알고리즘의 구현

  • Published : 2001.03.01

Abstract

This paper presents the application of best match search algorithm in the client/server system for natural language access to Web-based database. For this purpose, the procedures to process Korean word variants as well as to execute probabilistic weighting scheme have been implemented in the client/server system. The experimental runs have been done using a Korean test set which included documents, queries and relevance judgements. The experimental results demonstrate that best match retrieval with relevance information is better than the retrieval without it.

Keywords

References

  1. Journal of the American Society for Information Science v.50 Stemming Methodologies over Individual Query Words for an Arabic Information Retrieval System Abu Salem, H.;M. Al Omari;M. W. Evens
  2. The Journal of Computer Text Processing v.6 Comparison of n-gram Matching and Stemming for Term Conflation in English, Malay and Turkish Texts Ekmekcioglu, F. C.(et al.)
  3. Fourth Annual Symposium on Document Analysis and Information Retrieval Full-text Search and Document Recognition of Japanese Text Fujisawa, H;K. Marukawa
  4. 22nd Annual Colloquium on Information Retrieval Research A Probabilistic Approach to Chinese Information Retrieval : Theory nd Experiments Huang, X.;S. Robertson
  5. Journal of Korea Information Management Society v.11 A Development of the Test Set for Estimating the Retrieval Performance of an Automatic Indexer Kim, S. H(et al.)
  6. Journal of the American Society for Information Science v.47 Cheshire II : Designing a Next-generation Online Catalog Larson, R. R.(et al.)
  7. Automatic Text Processing for Korean Language Free Text Retrieval Lee, H. S.
  8. In submission to Information Processing & Management Effectiveness of the Korean Stemmer for Word Conflation Lee, H. S.;P. Willett
  9. Communications of the ACM v.39 Natural Language Processing for Information Retrieval Lewis, D. D;Karen Sparck Jones
  10. Journal of Information Science v.6 A Review of the Use of Inverted Files for Best Match Searching in Information Retrieval System Perry, S. A.;P. Willett
  11. Journal of Documentation v.33 The probability ranking principle in information retrieval Robertson, S. E.
  12. Journal of the American Society for Information Science v.27 Relevance Weightiing of Search Terms Robertson, S. E.;Karen Sparck Jones
  13. Literary and Linguistic Computing v.8 A Comparison of Spelling-Correction Methods for the Identification of Word Forms in Historical Text Databases Robertson, A. M.;P. Willett
  14. Information Processing & Management Term Weighting Approaches in Automatic Text Retrieval Salton, G;C. Buckley
  15. ACM SIGIR Forum v.16 The Nearest Neighbour Problem in Information Retrieval : an Algorithm Using Upperbounds Smeaton, A. F.;C. J. van Rijsbergen
  16. Journal of Documentation v.35 Search Term Relevance Weighting Given Little Relevance Information Sparck Jones, K.
  17. Information Processing & Management v.17 The Selection of Good Search Terms van Rijsbergen, C. J.;D. J. Harper;M. F. Porter
  18. Document Retrieval System Willett, P.(ed.)
  19. The OKAPI Online Catalogue Research Projects, In: Readings in Information Retrieval Walker, S;K. Sparck Jones(ed.);P. Willett(ed.)