• Title/Summary/Keyword: 시소 시스템

Search Result 120, Processing Time 0.023 seconds

A Knowledge Based Thesaurus for Intelligent Information Retrieval (지능형 정보검색을 위한 지식 기반 시소러스)

  • 정정호;김민구
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1998.10c
    • /
    • pp.12-14
    • /
    • 1998
  • 지식구조로 시소러스를 이용하는 기존의 정보검색 시스템들이 사용자에게 만족할 만한 검색결과를 제시하지 못하고 있다. 이것은 기존의 정보검색 시스템들이 이용하고 있는 시소러스 구조가 사람의 지식구조와 다르고, 시소러스를 이용하는 검색 방법이 사람의 검색 방법과 차이가 있기 때문이다. 본 논문에서는 어떤 분야의 인간 전문가가 해당분야에 관한 전문지식이 없는 일반인이 필요로 하는 정보를 찾아주는 방법을 모델링한 지능형 정보검색 시스템을 개발하기 위하여 인간 전문가의 지식구조를 모방한 시소러스 구조를 설계하였고, 인간 전문가의 검색 방법을 모방한 검색 방법을 고안하였다. 설계된 시소러스 구조에는 인간 전문가의 지식구조 내에 표현되어 있는 여러 종류의 관계들이 포함되어있고, 고안된 검색방법은 관련도를 사용자의 질의어와 확장된 색인어 사이의 관계의 종류를 추론한 결과와 거리 단계를 고려하여 평가한다.

  • PDF

Facet Query Expansion with an Object-Based Thesaurus in Reusable Component Retrieval Systems (재사용 부품 검색 시스템에서 객체기반 시소러스를 이용한 패싯 질의의 확장)

  • Choi, Jae-Hun;Kim, Ki-Heon;Yang, Jae-Dong;Lee, Dong-Gil
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.2
    • /
    • pp.168-179
    • /
    • 2000
  • In reusable component retrieval systems with facet-based schemes, facet queries are generally used for representing the characteristics of components relevant to users. This paper proposes an expanded facet query equipped with an object-based thesaurus to precisely formulate user's intents. To evaluate the query, a component retrieval system is also designed and implemented. For exactly retrieving the components, user's query should include relevant facet values capable of fully specifying their characteristics. However, simply listing a series of facet values directly inputted by users, conventional queries fails to precisely represent user's intents. Our query, called expanded facet query, employs fuzzy boolean operators and object-based thesaurus; the former logically expresses the fuzzy connectives between facet queries and required components, whereas the latter helps users appropriately select the specific facet values into the query. A thesaurus query is provided to recommend the relevant facet values with their fuzzy degrees from the thesaurus as well. Furthermore, our retrieval system can automatically formulate queries with the recommended facet values, if necessary.

  • PDF

Design and Implementation of Thesaurus System for Geological Terms (지질용어 시소러스 시스템의 설계 및 구축)

  • Hwang, Jaehong;Chi, KwangHoon;Han, JongGyu;Yeon, Young Kwang;Ryu, Keun Ho
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.10 no.2
    • /
    • pp.23-35
    • /
    • 2007
  • With the development of semantic web technologies in information retrieval area, the necessity for thesaurus is recently increasing along with internet lexicons. A thesaurus is the combination of classification and a lexicon, and is the topic map of knowledge structure expressing relations among concepts(terms) subject to human knowledge activities such as learning and research using formally organized and controlled index terms for clarifying the context of superordinate and subordinate concepts. However, although thesaurus are regarded as essential tools for controlling and standardizing terms and searching and processing information efficiently, we do not have a Korean thesaurus for geology. To build a thesaurus, we need standardized and well-defined guidelines. The standardized guidelines enable efficient information management and help information users use correct information easily and conveniently. The present study purposed to build a thesaurus system with terms used in geology. For this, First, we surveyed related works for standardizing geological terms in Korea and other countries. Second, we defined geological topics in 15 areas and prepared a classification system(draft) for each topic. Third, based on the geological thesaurus classification system, we created the specification of geological thesaurus. Lastly, we designed and implemented an internet-based geological thesaurus system using the specification.

  • PDF

A Fuzzy Retrieval System to Facilitate Associated Learning in Problem Banks (문제 은행에서 연상학습을 지원하는 퍼지 검색 시스템)

  • Choi, Jae-hun;Kim, ji-Suk;Cho, Gi-Hwan
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.4
    • /
    • pp.278-288
    • /
    • 2002
  • This paper presents a design and implementation of fuzzy retrieval system that could support an associated learning in problem banks. It tries to retrieve some of the problems conceptually related to specific semantics described by user's queries. In particular, the problem retrieval system employs a fuzzy thesaurus which represents relationships between domain dependent vocabularies as fuzzy degrees. It would keep track of characteristics of the associated learning, which should guarantee high recall and acceptable precision for retrieval effectiveness. That is, since the thesaurus could make a vocabulary mismatch problem resolved among query terms and document index terms, this retrieval system could take a chance to effectively support user's associated teaming. Finally, we have evaluated whether the fuzzy retrieval system is appropriate for the associated teaming or not, by means of its precision and recall rate point of view.

Automated Keyword Extraction using Category Correlation of Data (데이터의 카테고리 연관성을 이용한 색인어 자동 추출)

  • Woo, Young-Ho;Hur, Tae-Sung;Her, Woong;Park, Young-Bae;Min, Hong-Ki
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2005.11a
    • /
    • pp.242-245
    • /
    • 2005
  • 본 논문에서는 특정 영역에서 나타날 수 있는 데이터를 카테고리별로 저장한 시소러스를 이용하여 색인어 후보를 추출한다. 그리고 각 데이터의 카테고리 간의 상호 연관성을 고려하여 검출되는 색인어의 정확도를 향상시킬 수 있는 연관 중요도를 적용한 색인어 자동 추출 시스템을 제안하였다. 제안된 시스템은 출현빈도를 고려한 방법보다 47% 시소러스를 이용한 방법보다 18% 향상된 성능을 보였다.

  • PDF

Design and Implementation of The Windows Thesaurus WTPM using Filename of Semantics Clustering (파일명의 의미 클러스터링에 의한 윈도우 시소러스 WTPM 설계와 구현)

  • Kim, Man-pil;Tcha, Hong-jun
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.2 no.1
    • /
    • pp.73-79
    • /
    • 2009
  • Analyze semantic of files recorded in the user's computer file system based on C++ program language which pursue modularization program and object-oriented programming language. And this refers to it, it design that clustering semantic of filename with thesaurus for user convenience. WTPM makes User Write Files into Cluster with thesaurus semantic structure and reserved words. WTPM process has designed for Icon file's display Mashup structure and implemented by automation algorithm of classification.

  • PDF

Concept-based Search Engine System Using MeSH (MeSH를 이용한 개념 기반 검색 엔진 시스템)

  • 고삼일;박사준;황수철;김기태
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.04c
    • /
    • pp.383-385
    • /
    • 2003
  • 본 논문에서는 개념 기반 검색엔진 시스템(Concept-based Search Engine System)의 검색 정확도를 향상시키기 위한 방법으로 MeSH를 이용하였다. MeSH는 Medical Subject Headings의 약자로서 MEDLINE 논문의 원활한 검색을 위하여 주제어를 코드화한 것으로 이를 개념 그래프의 시소러스로 사용하여 개념 그래프의 가장 중요한 부분인 개념 추출의 정확성을 보장하도록 하였다. 본 논문은 2003년 MeSH의 Descriptor Data의 Term 항목을 사용하여 개념과 관련이 있는 유의어를 추출했다. 추출된 유의어로 개념 그래프를 구성한 것과 문서 내에서의 단어 빈도수에 의하여 개념 그래프를 구성한 것의 검색 결과를 비교한 결과 MeSH 를 시소러스로 사용하여 개념 그래프를 구성한 것이 훨씬 더 정확한 결과를 내는 것을 확인할 수 있었다.

  • PDF

Reusable Component Retrieval System using Thesaurus (시소러스를 이용한 재사용 컴포넌트 검색 시스템)

  • 김귀정
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2003.05a
    • /
    • pp.368-371
    • /
    • 2003
  • This paper constructed component retrieval system for reusability of component. Constructed by thesaurus that use inheritance relation of class for component retrieval, and did so that component retrieval that use Queries may be available. Also, the retrieval result did to become faster retrieval about queries as that show by priority. Retrieved components made efficient component reusability to be possible as that support source code, component information, class diagram etc.

  • PDF

Document ranking methods using term dependencies from a thesaurus (시소러스의 연관성 정보를 이용한 문서의 순위 결정 방법)

  • 이준호
    • Journal of the Korean Society for information Management
    • /
    • v.10 no.2
    • /
    • pp.3-22
    • /
    • 1993
  • In recent years various document ranking methods such as Relevance. R-Distance and K-Distance have been developed wh~ch can be used in thesaurus-based boolean retrieval systems. They give high quality document rankings in many cases by using term dependence lnformatlon from a thesaurus. However, they suffer from several problems resulting from inefficient and Ineffective evaluation of boolean operators AND. OR and NOT. In this paper we propose new thesaurus-based document ranking methods called KB-FSM and KB-EBM by exploitmg the enhanced fuzzy set model and the extended boolean model. The proposed methods overcome the problems of the previous methods and use term dependencies from a thesaurs effectively. We also show through performance comparison that KB-FSM and KBEBM provide higher retrieval effectiveness than Relevance. R-D~stance and K-Distance.

  • PDF

A Study of Designing the Han-Guel Thesaurus Browser for Automatic Information Retrieval (자동정보검색을 위한 한글 시소러스 브라우저 구축에 관한 연구)

  • Seo, Whee
    • Journal of Korean Library and Information Science Society
    • /
    • v.31 no.2
    • /
    • pp.279-302
    • /
    • 2000
  • This study is to develop a new automatic system for the Korean thesaurus browser by which we can automatically control all the processes of searching queries such as, representation, generation, extension and construction of searching strategy and feedback searching. The system in this study is programmed by Delphi 4.0(PASCAL) and consists of database system, automatic indexing, clustering technique, establishing and expressing thesaurus, and automatic information retrieval technique. The results proved by this system are as follows: 1)By using the new automatic thesaurus browser developed by the new algorithm, we can perform information retrieval, automatic indexing, clustering technique, establishing and expressing thesaurus, information retrieval technique, and retrieval feedback. Thus it turns out that even the beginner user can easily access special terms about the field of a specific subject. 2) The thesaurus browser in this paper has such merits as the easiness of establishing, the convenience of using, and the good results of information retrieval in terms of the rate of speed, degree, and regeneration. Thus, it t m out very pragmatic.

  • PDF