• Title/Summary/Keyword: MEDLARS

Search Result 6, Processing Time 0.021 seconds

Development of New Retieval Performance Measures for Query Reformulation Algorithms (질의 재구성 알고리즘의 검색성능을 측정하기 위한 새로운 평가 방법의 개발)

  • Kim, Nam-Ho;French, James-C.;Brown, Donald-E.
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.4
    • /
    • pp.963-972
    • /
    • 1997
  • In imformation retrival, query reformulation algorithms construct querise from a set of intial input and feedback documents, and retrieval performance cna be varied by different sets of input documents. In this study, we developed a criterion for measuring the performance sensitivity of query reformulation algorithms to unput sets. In addition, we also propose a way of mesuring the changes in retrived area, (CIRA) during qucry reformulation. We cimpared CIRAs of query refromulation algorithms (i.e., query tree, DNF method, and Dillon's method) using three test sets:the CACM, CISI, and Medlars. In the experiments, the query tree showed the highest decreasing CIRA during refirmulations, which means the fastest convergence rate to an output set. For sensitivity analysis, the query tree sored the highest sensitivity to different input sets even though its differences to the tther algorithms are very small.

  • PDF

Sensitivity Analysis of Decision Tree's Learning Effectiveness in Boolean Query Reformulation (불리언 질의 재구성에서 의사결정나무의 학습 성능 감도 분석)

  • 윤정미;김남호;권영식
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.23 no.4
    • /
    • pp.141-149
    • /
    • 1998
  • One of the difficulties in using the current Boolean-based information retrieval systems is that it is hard for a user, especially a novice, to formulate an effective Boolean query. One solution to this problem is to let the system formulate a query for a user from his relevance feedback documents in this research, an intelligent query reformulation mechanism based on ID3 is proposed and the sensitivity of its retrieval effectiveness, i.e., recall, precision, and E-measure, to various input settings is analyzed. The parameters in the input settings is the number of relevant documents. Experiments conducted on the test set of Medlars revealed that the effectiveness of the proposed system is in fact sensitive to the number of the initial relevant documents. The case with two or more initial relevant documents outperformed the case with one initial relevant document with statistical significances. It is our conclusion that formulation of an effective query in the proposed system requires at least two relevant documents in its initial input set.

  • PDF

Interactive Information Retrieval: An Introduction

  • Borlund, Pia
    • Journal of Information Science Theory and Practice
    • /
    • v.1 no.3
    • /
    • pp.12-32
    • /
    • 2013
  • The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented: The MEDLARS test, the Book House fiction retrieval system, and the OKAPI project. On this basis the call for alternative IIR evaluation approaches motivated by the three revolutions (the cognitive, the relevance, and the interactive revolutions) put forward by Robertson & Hancock-Beaulieu (1992) is presented. As a response to this call the 'IIR evaluation model' by Borlund (e.g., 2003a) is introduced. The objective of the IIR evaluation model is to facilitate IIR evaluation as close as possible to actual information searching and IR processes, though still in a relatively controlled evaluation environment, in which the test instrument of a simulated work task situation plays a central part.

Time Complexity Analysis of Boolean Query Formulation Algorithms (불리언 질의 구성 알고리즘의 시간복잡도 분석)

  • Kim, Nam-Ho;Donald E. Brown;James C. French
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.3
    • /
    • pp.709-719
    • /
    • 1997
  • Performance of an algorithm can be mesaurde from serval aspects.Suppose thre is a query formulation al-gorithm.Even though this algorithm shows high retrival performance, ie, high recall and percision, retriveing items can rake a long time.In this study, we time complexity of automatic query reformulation algorithms, named the query Tree, DNF method, and Dillon's method, and comparethem in theoretical and practical aspects using a tral-time performance)the absolute times for each algorithm to fromulate a query)in a Sun SparcStation 2. In experiments using three test sets, CSCM, CISI, and Medlars, the query Tree algorithm was the fastest among the three algorithms tested.

  • PDF