• Title/Summary/Keyword: relevant information retrieval

Search Result 186, Processing Time 0.032 seconds

A Model of Natural Language Information Retrieval Using Main Keywords and Sub-keywords (주 키워드와 부 키워드를 이용한 자연언어 정보 검색 모델)

  • Kang, Hyun-Kyu;Park, Se-Young
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.12
    • /
    • pp.3052-3062
    • /
    • 1997
  • An Information Retrieval (IR) is to retrieve relevant information that satisfies user's information needs. However a major role of IR systems is not just the generation of sets of relevant documents, but to help determine which documents are most likely to be relevant to the given requirements. Various attempts have been made in the recent past to use syntactic analysis methods for the generation of complex construction that are essential for content identification in various automatic text analysis systems. Unfortunately, it is known that methods based on syntactic understanding alone are not sufficiently powerful to Produce complete analyses of arbitrary text samples. In this paper, we present a document ranking method based on two-level ranking. The first level is used to retrieve the documents, and the second level to reorder the retrieved documents. The main keywords used in the first level can be defined as nouns and/or compound nouns that possess good document discrimination powers. The sub-keywords used in the second level can be also defined as adjectives, adverbs, and/or verbs that are not main keywords, and function words. An empirical study was conducted from a Korean encyclopedia with 23,113 entries and 161 Korean natural language queries collected by end users. 850% of the natural language queries contained sub-keywords. The two-level document ranking methods provides significant improvement in retrieval effectiveness over traditional ranking methods.

  • PDF

An Analysis of the Effect of an Ontology-Based Information Searching Model as a Supplementary Learning Tool (학습 보조 도구로서 온톨로지 검색 모델의 효과 분석)

  • Choi, Sook-Young
    • The Journal of Korean Association of Computer Education
    • /
    • v.14 no.1
    • /
    • pp.159-168
    • /
    • 2011
  • This study analyzed whether the ontology-based information-searching model affected the ability of students to effectively search for meaningful information to carry out their projects. The experiment results illustrated that the amount of relevant information sought by the ontology-based information retrieval (OIR) method was significantly greater than that of the existing information retrieval (EIR) method. In addition, the relevance rate of the bookmarked documents sought by the OIR method was significantly greater than that of the EIR method. Interviews showed that the OIR model was helpful for students to effectively find information and thus, it helped them to complete the project more easily. Furthermore, the OIR model was beneficial for them to understand the subordinate concepts and their relationships for an important learning concept. The results of this study indicate that the OIR model could be used as a supplementary learning tool for project-based learning.

  • PDF

The Kernel Trick for Content-Based Media Retrieval in Online Social Networks

  • Cha, Guang-Ho
    • Journal of Information Processing Systems
    • /
    • v.17 no.5
    • /
    • pp.1020-1033
    • /
    • 2021
  • Nowadays, online or mobile social network services (SNS) are very popular and widely spread in our society and daily lives to instantly share, disseminate, and search information. In particular, SNS such as YouTube, Flickr, Facebook, and Amazon allow users to upload billions of images or videos and also provide a number of multimedia information to users. Information retrieval in multimedia-rich SNS is very useful but challenging task. Content-based media retrieval (CBMR) is the process of obtaining the relevant image or video objects for a given query from a collection of information sources. However, CBMR suffers from the dimensionality curse due to inherent high dimensionality features of media data. This paper investigates the effectiveness of the kernel trick in CBMR, specifically, the kernel principal component analysis (KPCA) for dimensionality reduction. KPCA is a nonlinear extension of linear principal component analysis (LPCA) to discovering nonlinear embeddings using the kernel trick. The fundamental idea of KPCA is mapping the input data into a highdimensional feature space through a nonlinear kernel function and then computing the principal components on that mapped space. This paper investigates the potential of KPCA in CBMR for feature extraction or dimensionality reduction. Using the Gaussian kernel in our experiments, we compute the principal components of an image dataset in the transformed space and then we use them as new feature dimensions for the image dataset. Moreover, KPCA can be applied to other many domains including CBMR, where LPCA has been used to extract features and where the nonlinear extension would be effective. Our results from extensive experiments demonstrate that the potential of KPCA is very encouraging compared with LPCA in CBMR.

The study on the retrieval effectiveness of meta-search engine on the internet (인터넷상의 메타탐색엔진의 검색효율성 비교연구)

  • 김성희
    • Journal of Korean Library and Information Science Society
    • /
    • v.27
    • /
    • pp.457-483
    • /
    • 1997
  • This study was intended to compare the effectiveness of the Savvy search and Metacrawler in terms of the total number of relevant documents retrieved, precision, recall, and the number of deadlines. In addition, this study measured whether the Meta-search engine and general web search engines retrieved different web documents. As a result, Savvy search produced a higher precision and recall as compared with motacrawler search engine while the metacrawler had lower deadlines ration than savvy search, Also, Meta search engine was more effective than the general web search engine, The results show that the hybrid methodology of integrating a variety of web search engines can help solve retrieval effectiveness problems on the Internet.

  • PDF

Retrieval Effectiveness of the Two Indexing Systems in the Water Resources : A Qualitative Analysis (수자원분야 색인시스템의 검색효율 비교와 질적 분석)

  • Lee Myeong-Hee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.30 no.1
    • /
    • pp.49-67
    • /
    • 1996
  • The previous study showed a large variation in performance within the queries and suggested that characteristics of queries cotribute to retrieval performance. Three attributes, specificity, complexity and recency were used to analyze the different results within queries. The result showed that subject searching retrieve more relevant documents for a Query with low specificity than a query with high specificity and that queries from the doctoral students' dissertations were specific queries with high specificity.

  • PDF

A Wikipedia-based Query Expansion Method for In-depth Blog Distillation (주제를 깊이 있게 다루는 블로그 피드 검색을 위한 위키피디아 기반 질의 확장 방법)

  • Song, Woo-Sang;Lee, Ye-Ha;Lee, Jong-Hyeok;Yang, Gi-Joo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.11
    • /
    • pp.1121-1125
    • /
    • 2010
  • This paper proposes a Wikipedia-based feedback method for in-depth blog distillation whose goal is to find blogs that represent in-depth thoughts or analysis on a given query. The proposed method uses Wikipedia articles which are relevant to the query. TREC Blogs08 collection which is a large-scale blog corpus and English Wikipedia dump were used for experiments, The proposed method significantly increased the retrieval performance including MAP over the conventional post based feedback method.

An XML-based Multimedia News Management System (XML 기반 멀티미디어 뉴스 관리 시스템)

  • Kim Hyon Hee;Park Seung Soo
    • The KIPS Transactions:PartB
    • /
    • v.11B no.7 s.96
    • /
    • pp.785-792
    • /
    • 2004
  • With recent progress of related multimedia computing technologies, it is necessay to retrieve diverse types of multimedia data based on multi-media content and their relationships. However, different from alphanumeric data, it is difficult to provide relevant multimedia information, be-cause multimedia contents and their relationships are implied in multimedia data. Therefore, in case of a multimedia news service system that is a representative multimedia application, most of new services provide relevant news about text articles and retrieval of multimedia news such as video news or image news are provided independently. In this paper, we present an XML-based multimedia news management system, which provides integrating, retrieval, and delivery of relevant multimedia news. Our data model composed of media object, relationship object, and view object represents diverse types of multimedia news content and semantically related multimedia news. In addition, a proposed view mechanism makes it possible to customize multimedia news, and therefore provides multimedia news efficiently.

Relevant Image Retrieval of Korean Documents based on Sentence and Word Importance (문장 및 단어 중요도를 통한 한국어 문서 연관 이미지 검색)

  • Kim, Nam-Gyu;Kang, Shin-Jae
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.3
    • /
    • pp.43-48
    • /
    • 2019
  • While reading text-only documents and finding unknown words, readers will become the focus disturbed and not be able to understand the content of the documents. Because children have little experience, it is difficult to understand correctly if the description in context is unfamiliar or ambiguous. In this paper, in order to help understand the text and increase the interest of the readers, we analyze the texts of documents and select the contents that are considered important, and implement a system that displays the most relevant images automatically from the web and links the texts and the images together. The implementation of the system divides the article into paragraphs, analyzes the text, selects important sentences for each paragraph and the important words that best represent the meaning of the important sentences, searches for images related to the words on the web, and then links the images to each of the previous paragraphs. Experiments have shown how to select important sentences and how to select important words in the sentences. As a result of the experiment, we could get 60% performance by evaluating the accuracy of the relation between three selected images and corresponding important sentences.

A Hybrid Information Retrieval Model Using Metadata and Text (메타데이타와 텍스트 정보의 통합검색 모델)

  • Yoo, Jeong-Mok;Myaeng, Sung-Hyon;Kim, Sung-Soo;Lee, Mann-Ho
    • Journal of KIISE:Databases
    • /
    • v.34 no.3
    • /
    • pp.232-243
    • /
    • 2007
  • Metadata IR model has high precision and low recall because the query in Metadata IR model is strict that is, the query can express user information need exactly, while Full-text IR model has low precision and high recall because the query in Full-text IR model is a kind of simple keyword query which expresses user information need roughly. If user can translate one's information need into structured query well, the retrieval result will be improved. However, it is little possible to make relevant query without understanding characteristics of metadata. Unfortunately, most users do not interested in metadata, then they cannot construct well-made structured query. Amount of information contained in metadata is less than text information. In this paper, we suggest hybrid IR model using metadata and text which can provide users with lots of relevant documents by retrieving from metadata field and text field complementarily.

Mobile Software Agents for Information Retrieval in WWW-Databases

  • Baek, Seong-Min;Chung, Jae-Yong
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.75.6-75
    • /
    • 2002
  • Current database technique offers the possibility to store giant amounts of data, worldwide networks provide the technical base for everybody to access it. However, it is usually very time-consuming or even impossible to find the most relevant information. This article describes the usage of mobile soft-ware agents to query different databases on the Internet, to rate and compress the results and to present them to the user in a consistent form, It contains a general definition of soft-ware agents, a detailed description of the approach and a discussion of its main advantages and weaknesses.

  • PDF