• Title/Summary/Keyword: Web Document Retrieval

Search Result 129, Processing Time 0.022 seconds

Design of XML Document Query Language(XQL) Supported Link Retrieval (링크 검색을 지원하는 XML 문서 질의 언어의 설계)

  • 김용훈;이강찬;이규철
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10b
    • /
    • pp.350-352
    • /
    • 1998
  • 최근 들어서 사무자동화 시스템(Office Information System), 디지털 도서관(Digital Library), WWW(WorldWideWeb)등의 응용에서는 대량의 문서들의 정보를 효율적으로 저장하고 처리, 검색할 수 있는 기능을 요구하고 있다. 이에 대해 최근에 인터넷 기반의 무서 표준인 XML(eXtensible Markup Language)이 제시되었고, 이러한 XML 문서를 저장하고 처리, 검색하기 위한 다양한 연구들이 진행되고 있다. 그러나, 이러한 대부분의 연구들은 XML 문서의 구조적 정보만을 저장하고 검색하도록 설계되어 지고 있으며, XML 문서가 지닌 또 다른 정보인 링크 정보를 저장하고 검색하는 기능을 제공되지 않고 있다. 본 논문에서는 현재 파서나 브라우저 수준에서 제공해 주는 링크의 브라우징을 확장하여 데이터베이스로 수많은 XML문서의 링크 정부들을 저장하고 저장된 링크 정보들에 대해 사용자들이 검색할 수 있는 시스템을 개발하고자 한다. 이를 위해 링크 정보를 지워할 수 있는 XML 문서에 대한 데이터 모델을 제시하고 이러한 데이터 모델로 지원할 수 있는 질의어들을 설계하였다.

A Study of Knowledge Based Agent System for Web New-Document Retrieval (지식기반 방식을 이용한 웹 뉴스문서 검색 에이전트 시스템 연구)

  • 이성열;백혜정;박영택
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10b
    • /
    • pp.102-104
    • /
    • 2000
  • 현재 인터넷상의 정보와 문서의 양은 상상을 초월하는 증가추이를 나타내고 있다. 이와 더불어 표현하려는 목적에 따라 체계적으로 정리되고 정형화된 문서들 또한 증가하고 있다. 이러한 문서들 중에는 각 인터넷 신문사나 웹진과 같은 문서들이 포함되는데, 이러한 문서들은 각각의 내용구성과 표현 형식에 있어서 비슷한 구성을 지니고 있다. 본 논문에서는 이러한 체계적이고 정형화된 웹 뉴스 문서검색을 위하여 '지식기반 방식을 이용한 웹 뉴스문서 검색 에이전트 시스템'을 제안한다. 사용자는 시스템에서 제공하는 지식을 기반으로 검색하고자 하는 대상을 에이전트 시스템에게 요청하게 되고 지식기반을 이용한 에이전트 시스템은 보다 정확한 정보를 사용자에게 제공하게 된다.

  • PDF

Efficient Indexing Technique for Retrieval of an XML Document and Design of Query Language (TQL) (XML 문서의 검색을 위한 효율적인 색인 기법과 질의 언어(TQL)의 설계)

  • 이계준;신동욱;권택근
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10a
    • /
    • pp.57-59
    • /
    • 1999
  • 현재 WWW(World Wide Web), 사무 자동화 시스템(Office Information System), 전자 도서관(Digital Library) 등의 빠른 발전으로 인하여 정보가 기하급수적으로 증가하였다. 이러한 방대한 양의 정보를 처리하기 위하여 많은 인터넷 기반의 문서 표준들이 출현하였고, 대표적으로 XML(eXtensible Markup Language)이 차세대 인터넷 전자 문서의 표준으로 많은 곳에 응용되고 있다. 이에 따라 XML 문서의 정보들을 효율적이고 정확하게 저장하고 이용, 검색 할 수 있는 기능을 요구되어졌다. 현재 대부분의 연구들은 XML 문서에 대한 구조적인 정보만을 저장하고 검색하는 기능만을 지원 할 뿐 검색된 결과에 대한 재사용이나 재구성에 대한 기능의 제공은 미흡한 실정이다. 본 논문에서는 현재 검색기들이 제공하는 XML 문서에 대한 구조적인 검색 기능을 확장하여 XML 문서를 보다 효율적으로 검색하기 위하여 새로운 색인 기법을 제안하고, 데이터베이스 내에 저장된 XML문서에 대해 구조적인 검색과 이것을 바탕으로 문서를 재구성하고 재사용하는 기능을 수행할 수 있도록 새로운 질의어(TQL)을 설계하였다.

  • PDF

An EFASIT model considering the emotion criteria in Knowledge Monitoring System (지식모니터링시스템에서 감성기준을 고려한 EFASIT 모델)

  • Ryu, Kyung-Hyun;Pi, Su-Young
    • Journal of Internet Computing and Services
    • /
    • v.12 no.4
    • /
    • pp.107-117
    • /
    • 2011
  • The appearance of Web has brought an substantial revolution to all fields of society such knowledge management and business transaction as well as traditional information retrieval. In this paper, we propose an EFASIT(Extended Fuzzy AHP and SImilarity Technology) model considering the emotion analysis. And we combine the Extended Fuzzy AHP Method(EFAM) with SImilarity Technology(SIT) based on the domain corpus information in order to efficiently retrieve the document on the Web. The proposed the EFASIT model can generate the more definite rule according to integration of fuzzy knowledge of various decision-maker, and can give a help to decision-making, and confirms through the experiment.

User Profile based Personalized Web Agent (사용자 프로파일 기반 개인 웹 에이전트)

  • So, Young-Jun;Park, Young-Tack
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.3
    • /
    • pp.248-256
    • /
    • 2000
  • This paper presents a personalized web agent that constructs user profile which consists of user preferences on the web and recommends his/her relevant information to the user. The personalized web agent consists of monitor agent, user profile construction agent, and user profile refinement agent. The monitor agent makes a user describe his/her preferences directly and it creates the database of preference document, finally performs several keyword extraction to increase the accuracy of the DB. The user profile construction agent transforms the extracted keywords into user profile that could be confirmed and edited by the user. and the refinement agent refines user profile by recursively learning and processing user feedback. In this paper, we describe the several keyword weighting and inductive learning techniques in detail. Finally, we describe the adaptive web retrieval and push agent that perform adaptive services to the user.

  • PDF

An XML Tag Indexing Method Using on Lexical Similarity (XML 태그를 분류에 따른 가중치 결정)

  • Jeong, Hye-Jin;Kim, Yong-Sung
    • The KIPS Transactions:PartB
    • /
    • v.16B no.1
    • /
    • pp.71-78
    • /
    • 2009
  • For more effective index extraction and index weight determination, studies of extracting indices are carried out by using document content as well as structure. However, most of studies are concentrating in calculating the importance of context rather than that of XML tag. These conventional studies determine its importance from the aspect of common sense rather than verifying that through an objective experiment. This paper, for the automatic indexing by using the tag information of XML document that has taken its place as the standard for web document management, classifies major tags of constructing a paper according to its importance and calculates the term weight extracted from the tag of low weight. By using the weight obtained, this paper proposes a method of calculating the final weight while updating the term weight extracted from the tag of high weight. In order to determine more objective weight, this paper tests the tag that user considers as important and reflects it in calculating the weight by classifying its importance according to the result. Then by comparing with the search performance while using the index weight calculated by applying a method of determining existing tag importance, it verifies effectiveness of the index weight calculated by applying the method proposed in this paper.

Relevance Feedback Agent for Improving Precision in Korean Web Information Retrieval System (한국어 웹 정보검색 시스템의 정확도 향상을 위한 연관 피드백 에이전트)

  • Baek, Jun-Ho;Choe, Jun-Hyeok;Lee, Jeong-Hyeon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.7
    • /
    • pp.1832-1840
    • /
    • 1999
  • Since the existed Korean Web IR systems generally use boolean system, it is difficult to retrieve the information to be wanted at one time. Also, because of the feature that web documents have the frequent abbreviation and many links, the keyword extraction using the inverted document frequency extracts the improper keywords for adding ambiguous meaning problem. Therefore, users must repeat the modification of the queries until they get the proper information. In this paper, we design and implement the relevance feedback agent system for resolving the above problems. The relevance feedback agent system extracts the proper information in response to user's preferred keywords and stores these keywords in preference DB table. When users retrieve this information later, the relevance feedback agent system will search it adding relevant keywords to user's queries. As a result of this method, the system can reduce the number of modification of user's queries and improve the efficiency of the IR system.

  • PDF

Query Expansion and Term Weighting Method for Document Filtering (문서필터링을 위한 질의어 확장과 가중치 부여 기법)

  • Shin, Seung-Eun;Kang, Yu-Hwan;Oh, Hyo-Jung;Jang, Myung-Gil;Park, Sang-Kyu;Lee, Jae-Sung;Seo, Young-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.10B no.7
    • /
    • pp.743-750
    • /
    • 2003
  • In this paper, we propose a query expansion and weighting method for document filtering to increase precision of the result of Web search engines. Query expansion for document filtering uses ConceptNet, encyclopedia and documents of 10% high similarity. Term weighting method is used for calculation of query-documents similarity. In the first step, we expand an initial query into the first expanded query using ConceptNet and encyclopedia. And then we weight the first expanded query and calculate the first expanded query-documents similarity. Next, we create the second expanded query using documents of top 10% high similarity and calculate the second expanded query- documents similarity. We combine two similarities from the first and the second step. And then we re-rank the documents according to the combined similarities and filter off non-relevant documents with the lower similarity than the threshold. Our experiments showed that our document filtering method results in a notable improvement in the retrieval effectiveness when measured using both precision-recall and F-Measure.

Design and Implementation of Web-based Problem Management System for CT Radiological Technologist Education (CT 전문방사선사 교육을 위한 웹기반 문항관리 시스템의 설계 및 구현)

  • Shin Yong-Won;Koo Bong-Oh;Shim Choon-Bo
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.1
    • /
    • pp.27-35
    • /
    • 2005
  • Recently, despite of the rapid progress of information technology in the medical and health fields, the development and management of problem sets about medical and education contents related with radiological technologist has been still achieved by manual and offline method using document editor. In this study, the unique web-based problem management system is designed and implemented. That system can efficiently manage and present various kind of problem set about integrated education and personal license without time and space limitations in order to improve the efficiency of supplementary training and to obtain the professional license for CT radiological technologist. The proposed system is composed of administration module and user module. The former supports several functions such as problem creation, problem categorization, user management, and adjustment of leveled assessment. On the other hand, the latter functions examination applying , problem retrieval, personal score retrieval, and interpretation viewing, and so on. In addition, our system is expected as a useful and practical system which provides problem interpretation and analysis of score results after applying for the examination. It can elevate ability of learning and information interchange among them preparing for CT professional radiological technologist licensing examination

  • PDF

Semantic Search System using Ontology-based Inference (온톨로지기반 추론을 이용한 시맨틱 검색 시스템)

  • Ha Sang-Bum;Park Yong-Tack
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.3
    • /
    • pp.202-214
    • /
    • 2005
  • The semantic web is the web paradigm that represents not general link of documents but semantics and relation of document. In addition it enables software agents to understand semantics of documents. We propose a semantic search based on inference with ontologies, which has the following characteristics. First, our search engine enables retrieval using explicit ontologies to reason though a search keyword is different from that of documents. Second, although the concept of two ontologies does not match exactly, can be found out similar results from a rule based translator and ontological reasoning. Third, our approach enables search engine to increase accuracy and precision by using explicit ontologies to reason about meanings of documents rather than guessing meanings of documents just by keyword. Fourth, domain ontology enables users to use more detailed queries based on ontology-based automated query generator that has search area and accuracy similar to NLP. Fifth, it enables agents to do automated search not only documents with keyword but also user-preferable information and knowledge from ontologies. It can perform search more accurately than current retrieval systems which use query to databases or keyword matching. We demonstrate our system, which use ontologies and inference based on explicit ontologies, can perform better than keyword matching approach .