• Title/Summary/Keyword: Ontology Extraction

Search Result 76, Processing Time 0.029 seconds

Method for improving search efficiency using relation of anatomical structure from Donguibogam(東醫寶鑑) ("동의보감"에 기재된 인체 용어 관계를 이용한 검색효율성 향상 방법)

  • Song, In-Woo;Lee, Byung-Wook
    • Journal of Korean Medical classics
    • /
    • v.25 no.4
    • /
    • pp.105-113
    • /
    • 2012
  • Objectives : Acquiring information from symptoms is one of the important method to gain clinically available information in korean medicine. Therefore, up to now, study of symptom terms was frequently implemented in promotion of various information project. In data extraction methods using symptom information from DB, information search using synonym and method using ontology is studied and utilized. However, considering concept of symptom has essential information of appeared body area and phenomenon we think that extending synonym and ontology relationship in symptom terms can be useful for search and set to this study. Methods : We collect terms relevant to human body area and structure described in Donguibogam. Synonymous relationship between collected terms is organized. Relationship between collected terms is build to human-body-knowledge table which has form of Concept+Relation+Concept. Type of relationship is limited on a range of expressing content about parts of human body. Result & Conclusion : Search condition is generated automatically using relationship of the upper area in knowledge table contents. Information of next and previous acupuncture point, upper and lower acupuncture point, left and right acupuncture point can be searched using information of acupuncture point location, order, relative position in area, direction in knowledge table contents.

An Expresson of Domain Searching Term Weight using Fuzzy (퍼지를 이용한 도메인 검색용어 중요성의 표시)

  • Jin, Hyun-Soo;Hong, You-Sik
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.9 no.4
    • /
    • pp.139-144
    • /
    • 2009
  • The leveling of technical internet domain term with its aim to accumulate knowledge that machine can comprehend, which has been used widely in recent years. If stratify domain term weight, we believe that machine can manage and analyze in formation on its own using the ontology. In this paper, we propose an algorithm that allows us to extract properties of ontology weight from structured information already existing in web documents. In particular by stratification of the domain knowledge that is composed of property information, we were able to make the algorithm better and improve the quality of extraction results. In our experiments with 50 thousands targeted documents, we were able to extract property information with 94% confidence.

  • PDF

Incremental Enrichment of Ontologies through Feature-based Pattern Variations (자질별 관계 패턴의 다변화를 통한 온톨로지 확장)

  • Lee, Sheen-Mok;Chang, Du-Seong;Shin, Ji-Ae
    • The KIPS Transactions:PartB
    • /
    • v.15B no.4
    • /
    • pp.365-374
    • /
    • 2008
  • In this paper, we propose a model to enrich an ontology by incrementally extending the relations through variations of patterns. In order to generalize initial patterns, combinations of features are considered as candidate patterns. The candidate patterns are used to extract relations from Wikipedia, which are sorted out according to reliability based on corpus frequency. Selected patterns then are used to extract relations, while extracted relations are again used to extend the patterns of the relation. Through making variations of patterns in incremental enrichment process, the range of pattern selection is broaden and refined, which can increase coverage and accuracy of relations extracted. In the experiments with single-feature based pattern models, we observe that the features of lexical, headword, and hypernym provide reliable information, while POS and syntactic features provide general information that is useful for enrichment of relations. Based on observations on the feature types that are appropriate for each syntactic unit type, we propose a pattern model based on the composition of features as our ongoing work.

Framework for Information Integration and Customization Using Ontology and Case-based Reasoning (온톨로지 및 사례기반추론을 이용한 맞춤형 통합 정보 생성 프레임워크의 제안)

  • Lee, Hyun-Jung;Sohn, M-Ye
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.4
    • /
    • pp.141-158
    • /
    • 2009
  • The requirements of knowledge customization have increased as information resources have become more various and the numbers of the resources are increased. Even if the method for collecting the information has improved like Really Simple Syndication (RSS), information users are still struggling for extracting and customizing the required information through the Web. To reduce the burden, we offer the dynamic knowledge customization framework by using ontology-based CBR. The framework consisting of three phases is comprised of the conversion phase of web information as a machine-accessible case, the extraction phase to find a case appropriate for information users' requirements, and the case customization phase to create knowledge depending on information user's requirements. Newly, the dynamic and intensity-based similarity is adopted to support timely dynamic change of users' requirements. The framework has adopted to create traveler's knowledge to the level users wanted.

  • PDF

An Automatic Business Service Identification for Effective Relevant Information Retrieval of Defense Digital Archive (국방 디지털 아카이브의 효율적 연관정보 검색을 위한 자동화된 비즈니스 서비스 식별)

  • Byun, Young-Tae;Hwang, Sang-Kyu;Jung, Chan-Ki
    • Journal of the Korean Society for information Management
    • /
    • v.27 no.4
    • /
    • pp.33-47
    • /
    • 2010
  • The growth of IT technology and the popularity of network based information sharing increase the number of digital contents in military area. Thus, there arise issues of finding suitable public information with the growing number of long-term preservation of digital public information. According to the source of raw data and the time of compilation may be variable and there can be existed in many correlations about digital contents. The business service ontology makes knowledge explicit and allows for knowledge sharing among information provider and information consumer for public digital archive engaged in improving the searching ability of digital public information. The business service ontology is at the interface as a bridge between information provider and information consumer. However, according to the difficulty of semantic knowledge extraction for the business process analysis, it is hard to realize the automation of constructing business service ontology for mapping from unformed activities to a unit of business service. To solve the problem, we propose a new business service auto-acquisition method for the first step of constructing a business service ontology based on Enterprise Architecture.

Movie Recommended System base on Analysis for the User Review utilizing Ontology Visualization (온톨로지 시각화를 활용한 사용자 리뷰 분석 기반 영화 추천 시스템)

  • Mun, Seong Min;Kim, Gi Nam;Choi, Gyeong cheol;Lee, Kyung Won
    • Design Convergence Study
    • /
    • v.15 no.2
    • /
    • pp.347-368
    • /
    • 2016
  • Recently, researches for the word of mouth(WOM) imply that consumers use WOM informations of products in their purchase process. This study suggests methods using opinion mining and visualization to understand consumers' opinion of each goods and each markets. For this study we conduct research that includes developing domain ontology based on reviews confined to "movie" category because people who want to have watching movie refer other's movie reviews recently, and it is analyzed by opinion mining and visualization. It has differences comparing other researches as conducting attribution classification of evaluation factors and comprising verbal dictionary about evaluation factors when we conduct ontology process for analyzing. We want to prove through the result if research method will be valid. Results derived from this study can be largely divided into three. First, This research explains methods of developing domain ontology using keyword extraction and topic modeling. Second, We visualize reviews of each movie to understand overall audiences' opinion about specific movies. Third, We find clusters that consist of products which evaluated similar assessments in accordance with the evaluation results for the product. Case study of this research largely shows three clusters containing 130 movies that are used according to audiences'opinion.

Natural language processing techniques for bioinformatics

  • Tsujii, Jun-ichi
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2003.10a
    • /
    • pp.3-3
    • /
    • 2003
  • With biomedical literature expanding so rapidly, there is an urgent need to discover and organize knowledge extracted from texts. Although factual databases contain crucial information the overwhelming amount of new knowledge remains in textual form (e.g. MEDLINE). In addition, new terms are constantly coined as the relationships linking new genes, drugs, proteins etc. As the size of biomedical literature is expanding, more systems are applying a variety of methods to automate the process of knowledge acquisition and management. In my talk, I focus on the project, GENIA, of our group at the University of Tokyo, the objective of which is to construct an information extraction system of protein - protein interaction from abstracts of MEDLINE. The talk includes (1) Techniques we use fDr named entity recognition (1-a) SOHMM (Self-organized HMM) (1-b) Maximum Entropy Model (1-c) Lexicon-based Recognizer (2) Treatment of term variants and acronym finders (3) Event extraction using a full parser (4) Linguistic resources for text mining (GENIA corpus) (4-a) Semantic Tags (4-b) Structural Annotations (4-c) Co-reference tags (4-d) GENIA ontology I will also talk about possible extension of our work that links the findings of molecular biology with clinical findings, and claim that textual based or conceptual based biology would be a viable alternative to system biology that tends to emphasize the role of simulation models in bioinformatics.

  • PDF

Dynamic ontology construction algorithm from Wikipedia and its application toward real-time nation image analysis (국가이미지 분석을 위한 위키피디아 실시간 동적 온톨로지 구축 알고리즘 및 적용)

  • Lee, Youngwhan
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.979-991
    • /
    • 2016
  • Measuring nation images was a challenging task when employing offline surveys was the only option. It was not only prohibitively expensive, but too much time-consuming and therefore unfitted to this rapidly changing world. Although demands for monitoring real-time nation images were ever-increasing, an affordable and reliable solution to measure nation images has not been available up to this date. The researcher in this study developed a semi-automatic ontology construction algorithm, named "double-crossing double keyword collection (or DCDKC)" to measure nation images from Wikipedia in real-time. The ontology, WikiOnto, can be used to reflect dynamic image changes. In this study, an instance of WikiOnto was constructed by applying the algorithm to the big-three exporting countries in East Asia, Korea, Japan, and China. Then, the numbers of page views for words in the instance of WikiOnto were counted. A collection of the counting for each country was compared to each other to inspect the possibility to use for dynamic nation images. As for the conclusion, the result shows how the images of the three countries have changed for the period the study was performed. It confirms that DCDKC can very well be used for a real-time nation-image monitoring system.

Linking Korean Predicates to Knowledge Base Properties (한국어 서술어와 지식베이스 프로퍼티 연결)

  • Won, Yousung;Woo, Jongseong;Kim, Jiseong;Hahm, YoungGyun;Choi, Key-Sun
    • Journal of KIISE
    • /
    • v.42 no.12
    • /
    • pp.1568-1574
    • /
    • 2015
  • Relation extraction plays a role in for the process of transforming a sentence into a form of knowledge base. In this paper, we focus on predicates in a sentence and aim to identify the relevant knowledge base properties required to elucidate the relationship between entities, which enables a computer to understand the meaning of a sentence more clearly. Distant Supervision is a well-known approach for relation extraction, and it performs lexicalization tasks for knowledge base properties by generating a large amount of labeled data automatically. In other words, the predicate in a sentence will be linked or mapped to the possible properties which are defined by some ontologies in the knowledge base. This lexical and ontological linking of information provides us with a way of generating structured information and a basis for enrichment of the knowledge base.

Design and Implementation of an Ontology-based Knowledge Management System

  • Hideki-Mima;Yoon, Tae-Sung;Katsumori-Matsushima
    • Proceedings of the CALSEC Conference
    • /
    • 2004.02a
    • /
    • pp.107-111
    • /
    • 2004
  • The purpose of the study is to develop an integrated knowledge management system for the domains of genome and nano-technology, in which terminology-based literature mining, knowledge acquisition, knowledge structuring, and knowledge retrieval are combined. The system supports integrating different types of databases (papers and patents, technologies and innovations) and retrieving different types of knowledge simultaneously. The main objective of the system is to facilitate knowledge acquisition from documents and new knowledge discovery through a terminology-based similarity calculation and a visualization of automatically structured knowledge. Implementation issue of the system is also mentioned.

  • PDF