• Title/Summary/Keyword: Ontology Extraction

Search Result 76, Processing Time 0.033 seconds

WTO, an ontology for wheat traits and phenotypes in scientific publications

  • Nedellec, Claire;Ibanescu, Liliana;Bossy, Robert;Sourdille, Pierre
    • Genomics & Informatics
    • /
    • v.18 no.2
    • /
    • pp.14.1-14.11
    • /
    • 2020
  • Phenotyping is a major issue for wheat agriculture to meet the challenges of adaptation of wheat varieties to climate change and chemical input reduction in crop. The need to improve the reuse of observations and experimental data has led to the creation of reference ontologies to standardize descriptions of phenotypes and to facilitate their comparison. The scientific literature is largely under-exploited, although extremely rich in phenotype descriptions associated with cultivars and genetic information. In this paper we propose the Wheat Trait Ontology (WTO) that is suitable for the extraction and management of scientific information from scientific papers, and its combination with data from genomic and experimental databases. We describe the principles of WTO construction and show examples of WTO use for the extraction and management of phenotype descriptions obtained from scientific documents.

Semantic Ontology Speech Information Extraction using Non-parametric Correlation Coefficient (비모수적 상관계수를 이용한 시맨틱 온톨로지 음성 정보 추출)

  • Lee, Byungwook
    • Journal of Digital Convergence
    • /
    • v.11 no.9
    • /
    • pp.147-151
    • /
    • 2013
  • On retrieving high frequency keywords in information retrieval system, mismatchings to user's request are problems because of the various meanings of keywords in the existing ontology configuration. In this paper, it is to construct personnel selection ontology and rules in personnel management which are composed of various concepts and knowledges based on semantic web technology and suggest selection procedures to support these rules and knowledge retrieval system to verify suitability of selection results. This system utilizes a method of extraction of speech features by using non-parametric correlation coefficient. This proposed method has been validated by showing that the result average SNR of the experiment evaluation of the proposed techniques was shown to be decreased by .752dB.

Knowledge Extraction Methodology and Framework from Wikipedia Articles for Construction of Knowledge-Base (지식베이스 구축을 위한 한국어 위키피디아의 학습 기반 지식추출 방법론 및 플랫폼 연구)

  • Kim, JaeHun;Lee, Myungjin
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.43-61
    • /
    • 2019
  • Development of technologies in artificial intelligence has been rapidly increasing with the Fourth Industrial Revolution, and researches related to AI have been actively conducted in a variety of fields such as autonomous vehicles, natural language processing, and robotics. These researches have been focused on solving cognitive problems such as learning and problem solving related to human intelligence from the 1950s. The field of artificial intelligence has achieved more technological advance than ever, due to recent interest in technology and research on various algorithms. The knowledge-based system is a sub-domain of artificial intelligence, and it aims to enable artificial intelligence agents to make decisions by using machine-readable and processible knowledge constructed from complex and informal human knowledge and rules in various fields. A knowledge base is used to optimize information collection, organization, and retrieval, and recently it is used with statistical artificial intelligence such as machine learning. Recently, the purpose of the knowledge base is to express, publish, and share knowledge on the web by describing and connecting web resources such as pages and data. These knowledge bases are used for intelligent processing in various fields of artificial intelligence such as question answering system of the smart speaker. However, building a useful knowledge base is a time-consuming task and still requires a lot of effort of the experts. In recent years, many kinds of research and technologies of knowledge based artificial intelligence use DBpedia that is one of the biggest knowledge base aiming to extract structured content from the various information of Wikipedia. DBpedia contains various information extracted from Wikipedia such as a title, categories, and links, but the most useful knowledge is from infobox of Wikipedia that presents a summary of some unifying aspect created by users. These knowledge are created by the mapping rule between infobox structures and DBpedia ontology schema defined in DBpedia Extraction Framework. In this way, DBpedia can expect high reliability in terms of accuracy of knowledge by using the method of generating knowledge from semi-structured infobox data created by users. However, since only about 50% of all wiki pages contain infobox in Korean Wikipedia, DBpedia has limitations in term of knowledge scalability. This paper proposes a method to extract knowledge from text documents according to the ontology schema using machine learning. In order to demonstrate the appropriateness of this method, we explain a knowledge extraction model according to the DBpedia ontology schema by learning Wikipedia infoboxes. Our knowledge extraction model consists of three steps, document classification as ontology classes, proper sentence classification to extract triples, and value selection and transformation into RDF triple structure. The structure of Wikipedia infobox are defined as infobox templates that provide standardized information across related articles, and DBpedia ontology schema can be mapped these infobox templates. Based on these mapping relations, we classify the input document according to infobox categories which means ontology classes. After determining the classification of the input document, we classify the appropriate sentence according to attributes belonging to the classification. Finally, we extract knowledge from sentences that are classified as appropriate, and we convert knowledge into a form of triples. In order to train models, we generated training data set from Wikipedia dump using a method to add BIO tags to sentences, so we trained about 200 classes and about 2,500 relations for extracting knowledge. Furthermore, we evaluated comparative experiments of CRF and Bi-LSTM-CRF for the knowledge extraction process. Through this proposed process, it is possible to utilize structured knowledge by extracting knowledge according to the ontology schema from text documents. In addition, this methodology can significantly reduce the effort of the experts to construct instances according to the ontology schema.

Design and Construction of a NLP Based Knowledge Extraction Methodology in the Medical Domain Applied to Clinical Information

  • Moreno, Denis Cedeno;Vargas-Lombardo, Miguel
    • Healthcare Informatics Research
    • /
    • v.24 no.4
    • /
    • pp.376-380
    • /
    • 2018
  • Objectives: This research presents the design and development of a software architecture using natural language processing tools and the use of an ontology of knowledge as a knowledge base. Methods: The software extracts, manages and represents the knowledge of a text in natural language. A corpus of more than 200 medical domain documents from the general medicine and palliative care areas was validated, demonstrating relevant knowledge elements for physicians. Results: Indicators for precision, recall and F-measure were applied. An ontology was created called the knowledge elements of the medical domain to manipulate patient information, which can be read or accessed from any other software platform. Conclusions: The developed software architecture extracts the medical knowledge of the clinical histories of patients from two different corpora. The architecture was validated using the metrics of information extraction systems.

Syntactic and semantic information extraction from NPP procedures utilizing natural language processing integrated with rules

  • Choi, Yongsun;Nguyen, Minh Duc;Kerr, Thomas N. Jr.
    • Nuclear Engineering and Technology
    • /
    • v.53 no.3
    • /
    • pp.866-878
    • /
    • 2021
  • Procedures play a key role in ensuring safe operation at nuclear power plants (NPPs). Development and maintenance of a large number of procedures reflecting the best knowledge available in all relevant areas is a complex job. This paper introduces a newly developed methodology and the implemented software, called iExtractor, for the extraction of syntactic and semantic information from NPP procedures utilizing natural language processing (NLP)-based technologies. The steps of the iExtractor integrated with sets of rules and an ontology for NPPs are described in detail with examples. Case study results of the iExtractor applied to selected procedures of a U.S. commercial NPP are also introduced. It is shown that the iExtractor can provide overall comprehension of the analyzed procedures and indicate parts of procedures that need improvement. The rich information extracted from procedures could be further utilized as a basis for their enhanced management.

A Method on Automatically Creating an Ontology by Extracting Various Relationships between Terms (용어 간의 다양한 관계 추출을 통해 온톨로지를 자동으로 생성하는 방법)

  • Young-tae Kim
    • Journal of Practical Engineering Education
    • /
    • v.15 no.2
    • /
    • pp.321-330
    • /
    • 2023
  • In this paper, we propose a method of automatically creating an ontology by extracting various relationships between terms necessary for constructing an ontology of a specific domain. The extracted relationship is constructed as an ontology by encoding it into an axiomatic set in the structure of the ontology. To solve efficiently, we represent the search space of the set as an integer programming problem, and we reduce the matrix by using a simple reduction that eliminates rules that are not very helpful for optimization. In conclusion, this paper proposes a way to generalize patterns using given data, reduce search space while maintaining useful patterns, and automatically generate efficient ontology using extracted relationships by applying algorithms composed of structured ontology.

A Methodology for Ontology-based Knowledge Acquisition and Structuring in an Industry-Academic-Government Project ″Go Japan!″

  • Hideki-Mima;Yoon, Tae-Sung
    • Proceedings of the CALSEC Conference
    • /
    • 2003.09a
    • /
    • pp.197-203
    • /
    • 2003
  • The purpose of the study is to develop an integrated knowledge structuring system for the domain of engineering, in which ontology-based literature mining, knowledge acquisition, knowledge integration, and knowledge retrieval are combined using XML-based tag information and ontology management. The system supports combining different types of databases (papers and patents, technologies and innovations) and retrieving different types of knowledge simultaneously. The main objective of the system is to facilitate knowledge acquisition and knowledge retrieval from documents through an ontology-based dynamic similarity calculation and a visualization of automatically structured knowledge. Through experimentations we conducted using 100,000 words economic documents reported in the "Go! Japan" project for analyzing Japanese industrial situation, and 100,000 words molecular biology Papers, we show the system is Practical enough for accelerating knowledge acquisition and knowledge discovery from the information sea.

  • PDF

A Method for Extracting Relationships Between Terms Using Pattern-Based Technique (패턴 기반 기법을 사용한 용어 간 관계 추출 방법)

  • Kim, Young Tae;Kim, Chi Su
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.8
    • /
    • pp.281-286
    • /
    • 2018
  • With recent increase in complexity and variety of information and massively available information, interest in and necessity of ontology has been on the rise as a method of extracting a meaningful search result from massive data. Although there have been proposed many methods of extracting the ontology from a given text of a natural language, the extraction based on most of the current methods is not consistent with the structure of the ontology. In this paper, we propose a method of automatically creating ontology by distinguishing a term needed for establishing the ontology from a text given in a specific domain and extracting various relationships between the terms based on the pattern-based method. To extract the relationship between the terms, there is proposed a method of reducing the size of a searching space by taking a matching set of patterns into account and connecting a join-set concept and a pattern array. The result is that this method reduces the size of the search space by 50-95% without removing any useful patterns from the search space.

Extraction method of spatial relation by analyzing location tag in folksonomy (폭소노미에서 위치태그 분석을 통한 공간관계 추출 기법)

  • Choi, Yun-Hee;Yong, Hwan-Seung
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.8
    • /
    • pp.1043-1054
    • /
    • 2009
  • As the semantic web receives higher concern with an intensified necessity in these days, the research on the ontology as its core technology has been carried out in various fields. The ontology has been adopted as an alternative to work out lots of problematic issues resulted from the insufficient vocabulary selection rules in folksonomy, widely accepted under Web 2.0. Therefore the importance of research to complementarily consolidate the two disciplines, the folksonomy and the ontology, has been increased. Based on this idea this research proposes a system, which pulls out, using open services, the location information tags from folksonomy-based metadata, ultimately extracts, following location information analyses, spatial relationships among tags, and in turn automatically constructs self-correcting location information domain ontology. The system devised in this study will associate data derived from easily accessible folksonomy with meaningful and technological information from ontology.

  • PDF

A Statistical Approach for Extracting and Miming Relation between Concepts (개념간 관계의 추출과 명명을 위한 통계적 접근방법)

  • Kim Hee-soo;Choi Ikkyu;Kim Minkoo
    • The KIPS Transactions:PartB
    • /
    • v.12B no.4 s.100
    • /
    • pp.479-486
    • /
    • 2005
  • The ontology was proposed to construct the logical basis of semantic web. Ontology represents domain knowledge in the formal form and it enables that machine understand domain knowledge and provide appropriate intelligent service for user request. However, the construction and the maintenance of ontology requires large amount of cost and human efforts. This paper proposes an automatic ontology construction method for defining relation between concepts in the documents. The Proposed method works as following steps. First we find concept pairs which compose association rule based on the concepts in domain specific documents. Next, we find pattern that describes the relation between concepts by clustering the context between two concepts composing association rule. Last, find generalized pattern name by clustering the clustered patterns. To verify the proposed method, we extract relation between concepts and evaluate the result using documents set provide by TREC(Text Retrieval Conference). The result shows that proposed method cant provide useful information that describes relation between concepts.