• Title/Summary/Keyword: Searching Thesaurus

Search Result 30, Processing Time 0.024 seconds

A Study on Information Retrieval Techniques of VOCED Database (직업교육 데이터베이스 VOCED의 검색기법 연구)

  • Kim, Soon-Won
    • Journal of Information Management
    • /
    • v.27 no.1
    • /
    • pp.40-65
    • /
    • 1996
  • This study is to review information retrieval techniques of VOCED database. The VOCED database contains internationally relevant information on vocational and adult education, training and related subjects. The software used is CDS/ISIS and the records are indexed using the APSDEP Thesaurus. When searching the VOCED database, various types of search techniques can be used. Multiple word, phrase, boolean logic, term truncation, defind field, and proximity searching techniques or a mixture of all of them, make it possible to find exactly what you want in seconds.

  • PDF

A Study on the classification scheme for the design of Directory Search Engine on the web (web 데이터베이스의 디렉토리 설계를 위한 분류체계 연구)

  • 이명희
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.10 no.1
    • /
    • pp.243-268
    • /
    • 1999
  • The purpose of this study is to develop the classification scheme in subject-based directory search engine for educational research information on the web. Five classification systems. Yahoo Korea, Argus Clearinghouse, DDC, ERIC thesaurus and KEDI thesaurus were measured in terms of coverage of subject fields, system logic, accuracy of terminology and efficiency of searching. For the design of Classification Scheme, this study considered the content of subject areas, features of information resources and efficiency based on users. Finally, the Classification Scheme was established in terms of 16 main divisions and 47 sub-divisions in educational research information.

  • PDF

A Study on the Thesaurus-based Ontology System for the Semantic Web (시소러스를 기반으로 한 온톨로지 시스템 구현에 관한 연구)

  • Jeong, Do-Heon;Kim, Tae-Su
    • Journal of the Korean Society for information Management
    • /
    • v.20 no.3
    • /
    • pp.155-175
    • /
    • 2003
  • The purpose of the study was to construct a system based on the semantic web environment's ontology by utilizing the ontology schema derived from the facet-type Art and Architecture Thesaurus(AAT). The aforementioned ontology schema is based on the Web Ontology Language(OWL), which is being widely considered the standard ontology language for the W3C-centered semantic web environment. Also, the concepts were limited to terms within AAT'S Furniture Facet, and the system was tested using the Chair concept, which is a lower-level facet that has a diverse conceptual relationship and broad vocabulary base. The ontology system is capable of searching for concepts, while controlling the search results by always providing a 'Preferred term' for synonymous terms. In addition, the system provides the user with first, a relationship between the terms centered around the inquiry, and second, related terms along with their classification properties. Also, the system is presented as and application example of the ontology system that constructs a information system that intakes an Instance value and reproduces it into a RDF file. During this process, utilization of multiple ontologies was introduced, and the stored Instance value's meta-data elements were used.

A Study on Developing Facets for Subject Headings in Korea (한국 주제명 표목의 패싯 유형 개발에 관한 연구)

  • Choi, Yoon Kyung;Chung, Yeon-Kyoung
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.4
    • /
    • pp.179-201
    • /
    • 2015
  • The subject heading is an elaborate access tool for subject browsing and searching in information retrieval environment. The purpose of this study is to suggest the applicable facets to subject headings in Korea. First, the concepts of subject and the definitions of facets were investigated in the literature review. Second, six cases including OCLC's FAST, PRECIS, "Thesaurus construction and use", CC $7^{th}$ edition, BC $2^{nd}$ Edition, and UDC $3^{rd}$ Edition were analyzed to focus on configuration of facets as case studies. Based on the results, twenty-two facets were proposed including Topical, Event, Geography, Chronology, Personal and Corporate Name, Title, Form, Genre, Language, and Person facets as 11 top facets. Also, Topical-Thing/Entity and Topical-Action/Status, Part, Kind, Property, Whole, Material, Patient, Product, By-Product and Agent facets as sub-facets of Topical facet.

On the Characteristics and Information Retrieval Performance of Full-Text Databases (전문데이터베이스의 특성과 정보검색성능)

  • Cho Myung-Hi
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.17
    • /
    • pp.339-366
    • /
    • 1989
  • Appearance of full-text online is the most encouraging phenomenon ·during the development of databases. The full-text databases of today is derived from by-product of electronic publication of printed materials. Now, there are also some movements toward electronic production of documents in Korea although not powerful. The present study is designed to examine the characteristics and effective retrieval method of full-text databases now commercially available through various vendors. The outline of this paper IS as follows: First, background and present situation of existing full-text database services through national and worldwide are examined. Second, free-text searching system of full-text databases is compared with controlled vocabulary system. The factors influencing on free-text retrieval performance, searching thesaurus, and hybrid or compromising system, which is using limited controlled vocabulary in conjunction with natural language for the enrichment needed for practical operation of the . system, are examined. Third, user demands through the analysis of preceding studies on 'various types of full-text databases are recognised. Fouth, application of CD-ROM full-text database to the libraries and information centers is examined as prospective resources for them. Finally, some problems and prospect of full-text databases are presented.

  • PDF

Design and Implementation of Efficient Storage System for Storing and Searching Thesaurus Data (시소러스 데이터의 저장과 검색을 위한 효율적인 저장 시스템의 설계 및 구현)

  • 김점숙;안동언;정성종
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2000.06a
    • /
    • pp.205-209
    • /
    • 2000
  • 본 논문에서는 시소러스를 효율적으로 구축하고 사용할 수 있는 시소러스 저장 구조를 구현하였다. 정보 검색 시스템은 사용자의 질의어를 용어들과 용어들 사이에 관계 집합으로 구성된 일종의 용어 사전인 시소러스를 이용하여 문헌에 대한 색인과 검색을 정확하고 통제된 용어 형태로 바꾸어 색인과 검색 작업의 효율을 높인다. 데이터베이스에 저장된 시소러스 구조에서 시소러스 검색을 할 때 속도가 감소하는데 이를 해시함수를 이용한 리스트 구조를 이용함으로써 전체적인 시소러스 검색 속도의 증가를 기대할 수 있다. 또한 현재 데이터베이스 형태의 시소러스를 다른 곳에 이식하려면 데이터베이스 시스템이 있어야 한다. 따라서 메모리에 올릴 수 있는 구조를 가지면 시소러스 보급에 기여할 수 있다. 본 논문에서 제안한 데이터베이스에 저장된 시소러스 구조와 해시함수를 이용한 리스트 구조를 비교, 분석하고 보다 더 효율적인 시소러스의 역할 및 구조 형태에 대해 제안한다.

  • PDF

Design and Implementation of Efficient Storage System for Storing and Searching Thesaurus Data (시소러스 데이터의 저장과 검색을 위한 효율적인 저장 시스템의 설계 및 구현)

  • Kim, Jum-Suk;An, Dong-Un;Jong, Sung-Chung
    • Annual Conference on Human and Language Technology
    • /
    • 2000.10d
    • /
    • pp.205-209
    • /
    • 2000
  • 본 논문에서는 시소러스를 효율적으로 구축하고 사용할 수 있는 시소러스 저장 구조를 구현하였다. 정보 검색 시스템은 사용자의 질의어를 용어들과 용어들 사이의 관계 집합으로 구성된 일종의 용어 사전인 시소러스를 이용하여 문헌에 대한 색인과 검색을 정확하고 통제된 용어 형태로 바꾸어 색인과 검색 작업의 효율을 높인다. 데이터베이스에 저장된 시소러스 구조에서 시소러스 검색을 할 때 속도가 감소하는데 이를 해시함수를 이용한 리스트 구조를 이용함으로써 전체적인 시소러스 검색 속도의 증가를 기대할 수 있다. 또한 현재 데이터베이스 형태의 시소러스를 다른 곳에 이식하려면 데이터베이스 시스템이 있어야 한다. 따라서 메모리에 올릴 수 있는 구조를 가지면 시소러스 보급에 기여 할 수 있다. 본 논문에서 제안한 데이터베이스에 저장된 시소러스 구조와 해시함수를 이용한 리스트 구조를 비교, 분석하고 보다 더 효율적인 시소러스의 역할 및 구조 형태에 대해 제안한다.

  • PDF

A Study on the Model of Internet Public Library in Korea (IPL-Korea) (인터넷 공공도서관 구축 모형 연구)

  • 고영만;오삼균
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.4
    • /
    • pp.109-123
    • /
    • 1999
  • We are faced with a paradox in the age of information as finding quality information on the Internet becomes a more challenging task because of information overload. This paper describes the prototype for “IPL-Korea” (Internet Public Library in Korea) project which is an attempt to provide the public with quality information in the form of a metadata system. The system involves cataloging of resources, i.e. websites, that are filtered by library and information science majors as well as information professionals. The user focus of this system is on children, youth, women, and seniors; various classification schemes and resource descriptions relevant for each user group are incorporated into the system to allow efficient browsing of the resources. A thesaurus for “IPL-Korea”, which is based on the ERIC thesaurus, is being constructed for easy manipulation of the breath of searching. The “IPL-Korea” metadata system employs the entity-relationship model in the design of its conceptual schema. Metadata is being stored in the Oracle database system and Web interfaces to this database are provided through ASP, ColdFusion, and JAVA technology.

  • PDF

A Study on the Information Searching Behavior of MEDLINE Retrieval in Medical Librarian (의학전문사서의 정보이용행위에 관한 연구)

  • Lee Jin-Young;Jeong Sang-Kyung
    • Journal of Korean Library and Information Science Society
    • /
    • v.30 no.2
    • /
    • pp.123-153
    • /
    • 1999
  • This article aims at finding the ways, on the basis of the studies about the behaviors to search the existing CD-ROM databases, so that the searchers who retrieve the on-line MEDLINE used in the medical libraries can use the data more efficiently than now. We gave the questionnaires to the librarians in 60 medical libraries and searched the literatures and realities on the behaviors of the data uses to examine the search behaviors of the MEDLINE in the medical libraries. The result is as follows: 1) The medical data system rate for single users was $53\%$ and the ons for multi users $43\%$. As for the time which users retrieve for a week, under two hours was $75\%$, between 3 and 8 hours $18.3\%$, and eve. 9 hours $6.7\%$. 2) The increasing factors of the search result are (1) an enough discussion and interview between librarians and users, and (2) the use of the correct indexing terms, Thesaurus, and Keyword. In principle users must search directly. However, the librarians searched instead in case that the retrieval result was under two hours a week$(75\%)$. 3) As for the search fee, $91\%$ was free and $9\%$ was charged. Also search effectiveness was enhanced by the means of Inter-Library Loan Service & Information Network. 4) The medical librarians answered the questionnaire that they need the application education of professional knowledge, medical terms(thesaurus) and electronic medium, and also they need the computer education, interview technique and reeducation to give a satisfactory service. 5) As for the satisfactory degree of MEDLINE application, they answered $44.6\%$ for economy, $38.2\%$ for the conveniency of the time required, and $58.9\%$ for the users' search satisfaction answered respectively. 6) The application of MEDLINE system enhanced the medical libraries' image and had an effect on the users' satisfaction of using the data and search, the data activities and the research achievement. 7) In the past MeSH was used but as the time passes CD-ROM MEDLINE search behavior was preferred to On-line one.

  • PDF

ISAAC : An Integrated System with User Interface for Sentence Analysis (ISAAC :문장분석용 통합시스템 및 사용자 인터페이스)

  • Kim, Gon;Kim, Min-Chan;Bae, Jae-Hak;Lee, Jong-Hyuk
    • The KIPS Transactions:PartB
    • /
    • v.11B no.1
    • /
    • pp.107-116
    • /
    • 2004
  • This paper introduces ISAAC (An Interface for Sentence Analysis & Abstraction with Cogitation) which provides an integrated user interface for sentence analysis. Into ISAAC, the various linguistic tools and resources are integrated. They are necessary for sentence analysis. Most of the tools and resources for sentence analysis are developed and accumulated independently. In the sentence analyzing with these tools and resources, it is difficult for sentence analyst to manage and control information which is taken on each step. In this respect, we have integrated the usable tools and resources, and made ISAAC to provide the consistent user oriented interface to each function. We have been able to divide sentence analysis process Into 14 steps. In ISAAC, these steps are processed by four individual modules $\cicled1$syntactic analysis of sentence,$\cicled2$retrieval of a root word,$\cicled3$searching category information in Roget s Thesaurus, and $\cicled4$searching category information in OfN(Ontology for Narratives). Therefore, in case of sentence analysis with ISAAC, the process of total 14 steps falls into 4 steps. This means that it is able to improve the performance of sentence analyst to the extent 3.5 times or more. Furthermore, ISAAC undertaking tedious transcription needed to process each step, we expect that ISAAC can help the analyst to maintain the accuracy of sentence analysis.