• 제목/요약/키워드: information Retrieval(IR)

검색결과 85건 처리시간 0.019초

Interactive Information Retrieval: An Introduction

  • Borlund, Pia
    • Journal of Information Science Theory and Practice
    • /
    • 제1권3호
    • /
    • pp.12-32
    • /
    • 2013
  • The paper introduces the research area of interactive information retrieval (IIR) from a historical point of view. Further, the focus here is on evaluation, because much research in IR deals with IR evaluation methodology due to the core research interest in IR performance, system interaction and satisfaction with retrieved information. In order to position IIR evaluation, the Cranfield model and the series of tests that led to the Cranfield model are outlined. Three iconic user-oriented studies and projects that all have contributed to how IIR is perceived and understood today are presented: The MEDLARS test, the Book House fiction retrieval system, and the OKAPI project. On this basis the call for alternative IIR evaluation approaches motivated by the three revolutions (the cognitive, the relevance, and the interactive revolutions) put forward by Robertson & Hancock-Beaulieu (1992) is presented. As a response to this call the 'IIR evaluation model' by Borlund (e.g., 2003a) is introduced. The objective of the IIR evaluation model is to facilitate IIR evaluation as close as possible to actual information searching and IR processes, though still in a relatively controlled evaluation environment, in which the test instrument of a simulated work task situation plays a central part.

인덱스 그래프 : 동적 문서 데이터베이스를 위한 IR 인덱스 구조 (Index Graph : An IR Index Structure for Dynamic Document Database)

  • 박병권
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제10권1호
    • /
    • pp.257-278
    • /
    • 2001
  • An IR(information retrieval) index for dynamic document databases where insertion, deletion, and update of documents happen frequently should be frequently updated. As the conventional structure of IR index is, however, focused on the information retrieval purpose, its structure is inefficient to handle dynamic update of it. In this paper, we propose a new structure for IR Index, we call it Index Graph, which is organized by connecting multiple indexes into a graph structure. By analysis and experiment, we prove the Index Graph is superior to the conventional structure of IR index in the performance of insertion, deletion, and update of documents as well as the performance of information retrieval.

  • PDF

Topic Level Disambiguation for Weak Queries

  • Zhang, Hui;Yang, Kiduk;Jacob, Elin
    • Journal of Information Science Theory and Practice
    • /
    • 제1권3호
    • /
    • pp.33-46
    • /
    • 2013
  • Despite limited success, today's information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queries. However, existing IR approaches such as query expansion are not overly effective because they make little effort to analyze and exploit the meanings of the queries. Furthermore, word sense disambiguation approaches, which rely on textual context, are ineffective against weak queries that are typically short. Motivated by the demand for a robust IR system that can consistently provide highly accurate results, the proposed study implemented a novel topic detection that leveraged both the language model and structural knowledge of Wikipedia and systematically evaluated the effect of query disambiguation and topic-based retrieval approaches on TREC collections. The results not only confirm the effectiveness of the proposed topic detection and topic-based retrieval approaches but also demonstrate that query disambiguation does not improve IR as expected.

WWW에서 데이터베이스와 검색엔진의 연동을 통한 SGML 검색시스템의 구현 (Implementation of SGML Retrieval System through Interoperability with Database and Search Engine based on WWW)

  • 김낙현;정수용;노명호
    • 한국전자거래학회:학술대회논문집
    • /
    • 한국전자거래학회 1999년도 학술대회지 vol.2
    • /
    • pp.575-586
    • /
    • 1999
  • The advent of the Internet and the enormous increase in volume of electronically stored information (SGML, Image, Sound, etc.) has led to substantial work on IR(Information Retrieval). To service on the WWW, construction and retrieval technology of SGML, which is the fundamental standard data format for CALS/EC, is needed specially. Due to such a change, it becomes essential to change the existing paradigm of conventional information retrieval systems and to adopt new Internet service system with search engine, SGML browser and advanced Internet technology on WWW. KIPRIS(Korea Industrial Property Rights Information Service), which is the specialized and integrated Internet service systems in the field of industrial property rights information service, is trying to be a guide for our country to establish its technological competitiveness with providing the online service of high quality. The objective of the paper identifies features and technologies of KIPRIS IR(Information Retrieval) system based on WWW as follows. First, it describes the development background and process of KIPRIS. Second, it presents a fundamental technology that consists of IR(Information Retrieval) concept, BRS(Bibliographical Retrieval System) search engine, SGML implementation technologies and the Internet/WWW technologies. Third, it provides information about system configuration, architecture, and the features and characteristics of KIPRIS. Finally, the implemented KIPRIS system is introduced.

  • PDF

개념적 데이터 모델링과 정보검색 시스템 디자인 (Conceptual Data Modeling and Information Retrieval System Design)

  • 오삼균
    • 한국문헌정보학회지
    • /
    • 제33권4호
    • /
    • pp.133-156
    • /
    • 1999
  • 이 논문의 목적은 개념적인 데이터 모델링이 기존의 정보 검색(IR) 시스템을 어떤 식으로 보다 향상시킬 수 있는지를 보여주는 것이다. 개념적인 데이터베이스 디자인은 1)개체들간의 관계에 기반하여 새로운 지식을 발견해 내는 데이터 마이닝 능력과 2)기존의 개별적으로 분리된 데이터베이스를 하나의 정보검색 시스템 안으로의 결합을 위해 사용된다 (예: ISI 인용, 시소러스, 서지 데이터베이스를 하나의 정보검색 시스템 안에 결집시킴). 더 나아가서, 개념적인 모델링은 수정을 용이하게 하므로, 새로운 이용자의 요구가 가미될 때마다, 개념적인 데이터 모델링에 기반한 정보검색 시스템을 수정하는 것은 기존의 정보검색 시스템 상에서보다 훨씬 수월해질 수 있다. 보다 향상된 개체-관계(Entity-Relationship) 모델이 이 논문에서 다룬 정보검색 데이터의 개념적 스키마를 개발하는데 사용되었다.

  • PDF

인터액티브 정보검색 모형 (Interactive Information Retrieval (IR) Models: Tradition and Development)

  • 김양우
    • 정보관리학회지
    • /
    • 제24권2호
    • /
    • pp.45-69
    • /
    • 2007
  • 본 논문은 다음과 같은 두 부분으로 구성된다. 논문의 전반부는 네 개의 정보검색 모형을 다루고 있는데 이는 전통적 정보검색 모형과 보다 최근에 나온 세 연구자의 이용자 중심 인터액티브 모형을 포함한다. 인터액티브 정보검색 모형은 Belkin, Ingwersen, 그리고 Saracevic에 의하여 제시된 것인데, 전통적 정보검색 모형을 포함한 각 모형의 장점과 한계점이 기술된다. 논문의 후반부에서 저자는 이상과 같은 모형들에 관한 분석을 토대로 그 자신의 인터액티브 모형, 즉 빙산모형(Iceberg Model)을 제시하고 있다. 빙산모형의 타당성으로 다음과 같은 세 가지 사항을 강조하고 있는데, 즉, 보다 구체화된 시스템 특성의 포함, 보다 명확한 인터액티브 정보검색 요소간의 상호작용, 그리고 정보매개자의 증가된 역할 등이 그것이다. 요약하면, 빙산모형은 변화하는 정보추구환경에서 진화할 수 있는 틀을 제시하고 있다.

REALM을 이용한 한국어 오픈도메인 질의 응답 (REALM for Open-domain Question Answering of Korean)

  • 강동찬;나승훈;최윤수;이혜우;장두성
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2020년도 제32회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.192-196
    • /
    • 2020
  • 최근 딥러닝 기술의 발전에 힘입어 오픈 도메인 QA 시스템의 발전은 가속화되고 있다. 특히 IR 시스템(Information Retrieval)과 추출 기반의 기계 독해 모델을 결합한 접근 방식(IRQA)의 경우, 문서와 질문 각각을 연속 벡터로 인코딩하는 IR 시스템(Dense Retrieval)의 연구가 진행되면서 검색 성능이 전통적인 키워드 기반 IR 시스템에 비해 큰 폭으로 상승하였고, 이를 기반으로 오픈 도메인 질의응답의 성능 또한 개선 되었다. 본 논문에서는 경량화 된 BERT 모델을 기반으로 하여 Dense Retrieval 모델 ORQA와 REALM을 사전 학습하고, 한국어 오픈 도메인 QA에서 QA 성능과 검색 성능을 도출한다. 실험 결과, 키워드 기반 IR 시스템 BM25를 기반으로 했던 이전 IRQA 실험결과와 비교하여 더 적은 문서로 더 나은 QA 성능을 보였으며, 검색 결과의 경우, BM25의 성능을 뛰어넘는 결과를 보였다.

  • PDF

A Conceptual Framework for an Information Behavior Model Based on the Collaboration Perspective between User and System for Information Retrieval

  • Yangyuen, Wachira;Phetkaew, Thimaporn;Nuntapichai, Siwanath
    • Journal of Information Science Theory and Practice
    • /
    • 제8권3호
    • /
    • pp.30-46
    • /
    • 2020
  • This research aimed (1) to study and analyze the ability of current information retrieval (IR) systems based on views of information behavior (IB), and (2) to propose a conceptual framework for an IB model based on the collaboration between the system and user, with the intent of developing an IR system that can apply intelligent techniques to enhance system efficiency. The methods in this study consisted of (1) document analysis which included studying the characteristics and efficiencies of the current IR systems and studying the IB models in the digital environment, and (2) implementation of the Delphi technique through an indepth interview method with experts. The research results were presented in three main parts. First, the IB model was categorized into eight stages, different from traditional IB, in the digital environment, which can correspond to all behaviors and be applied to with an IR system. Second, insufficient functions and log file storage hinder the system from effectively understanding and accommodating user behavior in the digital environment. Last, the proposed conceptual framework illustrated that there are stages that can add intelligent techniques to the IR system based on the collaboration perspective between the user and system to boost the users' cognitive ability and make the IR system more user-friendly. Importantly, the conceptual framework for the IB model based on the collaboration perspective between the user and system for IR assisted the ability of information systems to learn, recognize, and comprehend human IB according to individual characteristics, leading to enhancement of interaction between the system and users.

Online Searching Behavior of Social Science Researchers' in IR Interfaces of E-journal Database Systems: A Study on JMI, JNU, and DU

  • Kumar, Shailendra;Rai, Namrata
    • Journal of Information Science Theory and Practice
    • /
    • 제1권4호
    • /
    • pp.48-66
    • /
    • 2013
  • The aim of this study is to examine the user's online searching behavior in IR interfaces of e-journal database systems. The study is purely based on survey methods and tries to analyse the online searching behavior of respondents of social science disciplines who were doing research in three target central universities of Delhi (i.e. DU, JMI, and JNU). For measuring the responses of the respondents in IR interfaces of e-journal database systems, a total of 396 questionnaires were distributed among the students and out of all, 305 responses were used for the study. The findings of the study reveal that most of the students were not using all the facilities offered in IR interfaces of e-journal database systems for their retrieval process and also encourages menu based searches rather than command based searching.

A Study on the DB-IR Integration: Per-Document Basis Online Index Maintenance

  • Jin, Du-Seok;Jung, Hoe-Kyung
    • Journal of information and communication convergence engineering
    • /
    • 제7권3호
    • /
    • pp.275-280
    • /
    • 2009
  • While database(DB) and information retrieval(IR) have been developed independently, there have been emerging requirements that both data management and efficient text retrieval should be supported simultaneously in an information system such as health care, customer support, XML data management, and digital libraries. The great divide between DB and IR has caused different manners in index maintenance for newly arriving documents. While DB has extended its SQL layer to cope with text fields due to lack of intact mechanism to build IR-like index, IR usually treats a block of new documents as a logical unit of index maintenance since it has no concept of integrity constraint. However, In the DB-IR integrations, a transaction on adding or updating a document should include maintenance of the posting lists accompanied by the document. Although DB-IR integration has been budded in the research filed, the issue will remain difficult and rewarding areas for a while. One of the primary reasons is lack of efficient online transactional index maintenance. In this paper, performance of a few strategies for per-document basis transactional index maintenance - direct index update, pulsing auxiliary index and posting segmentation index - will be evaluated. The result shows that the pulsing auxiliary strategy and posting segmentation indexing scheme, can be a challenging candidates for text field indexing in DB-IR integration.