• Title/Summary/Keyword: knowledge base population

Search Result 24, Processing Time 0.028 seconds

A Machine learning Approach for Knowledge Base Construction Incorporating GIS Data for land Cover Classification of Landsat ETM+ Image (지식 기반 시스템에서 GIS 자료를 활용하기 위한 기계 학습 기법에 관한 연구 - Landsat ETM+ 영상의 토지 피복 분류를 사례로)

  • Kim, Hwa-Hwan;Ku, Cha-Yang
    • Journal of the Korean Geographical Society
    • /
    • v.43 no.5
    • /
    • pp.761-774
    • /
    • 2008
  • Integration of GIS data and human expert knowledge into digital image processing has long been acknowledged as a necessity to improve remote sensing image analysis. We propose inductive machine learning algorithm for GIS data integration and rule-based classification method for land cover classification. Proposed method is tested with a land cover classification of a Landsat ETM+ multispectral image and GIS data layers including elevation, aspect, slope, distance to water bodies, distance to road network, and population density. Decision trees and production rules for land cover classification are generated by C5.0 inductive machine learning algorithm with 350 stratified random point samples. Production rules are used for land cover classification integrated with unsupervised ISODATA classification. Result shows that GIS data layers such as elevation, distance to water bodies and population density can be effectively integrated for rule-based image classification. Intuitive production rules generated by inductive machine learning are easy to understand. Proposed method demonstrates how various GIS data layers can be integrated with remotely sensed imagery in a framework of knowledge base construction to improve land cover classification.

Iterative learning system design for relation extraction and knowledge base population (관계 추출 및 지식베이스 확장을 위한 반복 학습 시스템 설계)

  • Jeong, Yong-Bin;Nam, Sang-Ha;Kim, Ji-Seong;Lee, Min-Ho;Choi, Key-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2019.10a
    • /
    • pp.185-189
    • /
    • 2019
  • 관계추출기의 학습을 위해서는 많은 학습 데이터가 필요한데, 사람이 모으게 되면 많은 비용이 필요하여 원격 지도 학습을 이용한 데이터 수집이 많은 연구에서 사용되고 있다. 원격 지도 학습은 지식베이스를 기반으로 학습 데이터를 자동으로 만들어 내는 방식이기에 비용이 거의 들지 않지만, 지식베이스의 질과 양에 영향을 받는다. 본 연구는 원격 지도 학습을 기본으로 관계추출기의 성능을 향상 시키고, 지식베이스를 확장하는 방안으로 반복학습을 제안한다. 실험을 적은 비용으로 빠르게 진행하기 위해 반복학습을 자동화 하는 시스템을 설계하여 실험을 하였고, 이 시스템으로 관계추출기의 성능이 향상 될 수 있는 가능성을 보였으며, 반복학습을 통한 지식베이스의 확장 방안을 제시한다.

  • PDF

The Development of Diesel Engine Room Fault Diagnosis System Using a Correlation Analysis Method (상관분석법에 의한 선박기관실 고장진단 시스템 개발)

  • Kim, Young-Il;Oh, Hyun-Kyung;Yu, Yung-Ho
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.30 no.2
    • /
    • pp.253-259
    • /
    • 2006
  • There is few study which automatically diagnoses the fault from ship's monitored data. The bigger control and monitoring system is. the more important fault diagnosis and maintenance is to reduce damage caused by system fault. This paper proposes fault diagnosis system using a correlation analysis algorithm which is able to diagnose and forecast the fault from monitored data and is composed of fault detection knowledge base and fault diagnosis knowledge base. For all kinds of ship's engine room monitored data are classified with combustion subsystem, heat exchange subsystem and electric motor and pump subsystem, To verify capability of fault detection, diagnosis and prediction, FMS(Fault Management System) is developed by C++. Simulation by FMS is carried out with population data set made by the log book data of 2 months duration from a large full container ship of H shipping company.

The Development of Diesel Engine Room Fault Diagnosis SystemUsing a Correlation Analysis Method (상관분석법에 의한 선박기관실 고장진단 시스템 개발)

  • Kim, Young-Il;Oh, Hyun-Gyeong;Cheon, Hang-Chun;Yu, Yung-Ho
    • Proceedings of the Korean Society of Marine Engineers Conference
    • /
    • 2005.06a
    • /
    • pp.251-256
    • /
    • 2005
  • There is few study which automatically diagnose the fault from ship's monitored signal. The bigger control and monitoring system is, the more important fault diagnosis and maintenance is to reduce damage brought forth by system fault. This paper proposes fault diagnosis system using a correlation analysis algorithm which is able to diagnose and forecast the fault and is composed to fault detection knowledge base and fault diagnosis knowledge base. For this all kinds of ship's engine room monitored data are classified with combustion subsystem, heat exchange subsystem and electric motor and pump subsystem by analyzing ship's operation data. To verifying capability of fault detection, diagnosis and prediction, Fault Management System(FMS) is developed by C++. Simulation experiment by FMS is carried out with population data set made by log book data of 2 months duration from a large full container ship of H shipping company.

  • PDF

Comparing the Performance of 17 Machine Learning Models in Predicting Human Population Growth of Countries

  • Otoom, Mohammad Mahmood
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.1
    • /
    • pp.220-225
    • /
    • 2021
  • Human population growth rate is an important parameter for real-world planning. Common approaches rely upon fixed parameters like human population, mortality rate, fertility rate, which is collected historically to determine the region's population growth rate. Literature does not provide a solution for areas with no historical knowledge. In such areas, machine learning can solve the problem, but a multitude of machine learning algorithm makes it difficult to determine the best approach. Further, the missing feature is a common real-world problem. Thus, it is essential to compare and select the machine learning techniques which provide the best and most robust in the presence of missing features. This study compares 17 machine learning techniques (base learners and ensemble learners) performance in predicting the human population growth rate of the country. Among the 17 machine learning techniques, random forest outperformed all the other techniques both in predictive performance and robustness towards missing features. Thus, the study successfully demonstrates and compares machine learning techniques to predict the human population growth rate in settings where historical data and feature information is not available. Further, the study provides the best machine learning algorithm for performing population growth rate prediction.

Detection and Analysis of DNA Hybridization Characteristics by using Thermodynamic Method (열역학법을 이용한 DNA hybridization 특성 검출 및 해석)

  • Kim, Do-Gyun;Gwon, Yeong-Su
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • v.51 no.6
    • /
    • pp.265-270
    • /
    • 2002
  • The determination of DNA hybridization reaction can apply the molecular biology research, clinic diagnostics, bioengineering, environment monitoring, food science and application area. So, the improvement of DNA hybridization detection method is very important for the determination of this hybridization reaction. Several molecular biological techniques require accurate predictions of matched versus mismatched hybridization thermodynamics, such as PCR, sequencing by hybridization, gene diagnostics and antisense oligonucleotide probes. In addition, recent developments of oligonucleotide chip arrays as means for biochemical assays and DNA sequencing requires accurate knowledge of hybridization thermodynamics and population ratios at matched and mismatched target sites. In this study, we report the characteristics of the probe and matched, mismatched target oligonucleotide hybridization reaction using thermodynamic method. Thermodynamic of 5 oligonucleotides with central and terminal mismatch sequences were obtained by measured UV-absorbance as a function of temperature. The data show that the nearest-neighbor base-pair model is adequate for predicting thermodynamics of oligonucleotides with average deviations for $\Delta$H$^{0}$ , $\Delta$S$^{0}$ , $\Delta$G$_{37}$ $^{0}$ and T$_{m}$, respectively.>$^{0}$ and T$_{m}$, respectively.

Matrix Factorization Models for Knowledge Base Population (지식베이스 확장을 위한 행렬 분해 모델)

  • Kim, Jiho;Nam, Sangha;Choi, Key-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2017.10a
    • /
    • pp.3-7
    • /
    • 2017
  • 지식베이스의 목표는 세상의 모든 지식을 데이터베이스화 하는 것이지만 지식 획득 능력의 부족으로 항상 지식 부족 문제에 시달린다. 지식 획득은 주로 웹 상에 있는 자연언어문장을 지식화 하는 외부적인 지식 획득을 통해 이루어지지만, 지식베이스 내부에서 지식을 확장해 나가는 방법에 대해서는 연구가 소홀히 이루어지고 있다. 따라서 본 논문에서는 내부적인 지식 획득을 위한 지식베이스 행렬 분해 모델을 소개한다. 본 논문에서 소개하는 방법은 지식베이스를 행렬로 변환한 뒤 행렬 분해 모델을 통해 새로운 지식에 대한 신뢰도를 점수화하는 방법이다. 본 논문에서 소개한 방법의 우수성과 실효성을 입증하기 위해 한국어 지식베이스인 한국어 디비피디아(2016-10)를 대상으로 본 모델의 정확도 측정 실험 결과를 소개한다.

  • PDF

Matrix Factorization Models for Knowledge Base Population (지식베이스 확장을 위한 행렬 분해 모델)

  • Kim, Jiho;Nam, Sangha;Choi, Key-Sun
    • 한국어정보학회:학술대회논문집
    • /
    • 2017.10a
    • /
    • pp.3-7
    • /
    • 2017
  • 지식베이스의 목표는 세상의 모든 지식을 데이터베이스화 하는 것이지만 지식 획득 능력의 부족으로 항상 지식 부족 문제에 시달린다. 지식 획득은 주로 웹 상에 있는 자연언어문장을 지식화 하는 외부적인 지식 획득을 통해 이루어지지만, 지식베이스 내부에서 지식을 확장해 나가는 방법에 대해서는 연구가 소홀히 이루어지고 있다. 따라서 본 논문에서는 내부적인 지식 획득을 위한 지식베이스 행렬 분해 모델을 소개한다. 본 논문에서 소개하는 방법은 지식베이스를 행렬로 변환한 뒤 행렬 분해 모델을 통해 새로운 지식에 대한 신뢰도를 점수화하는 방법이다. 본 논문에서 소개한 방법의 우수성과 실효성을 입증하기 위해 한국어 지식베이스인 한국어 디비피디아(2016-10)를 대상으로 본 모델의 정확도 측정 실험 결과를 소개한다.

  • PDF

Knowledge Base Population Method using Wikipedia (위키피디아를 이용한 지식베이스 개념 확장 방법)

  • Hwang, Myung-Gwon;Choi, Dong-Jin;Kim, Pan-Koo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2010.06c
    • /
    • pp.1-4
    • /
    • 2010
  • 다양한 분야에 소속된 사람들이 사용하고 있는 개념들을 기존의 워드넷과 같은 지식베이스가 모두 포함하지 못한다는 한계점이 지적되었다. 본 연구에서는 이를 해결하기 위해 위키피디아 문서집합의 분석을 통하여 해결하고자 한다. 위키피디아는 현재 320만개 이상의 유/무형의 개체에 대한 상세한 설명을 포함하고 있으며, 현재도 해당 분야의 전문가들에 의해 지속적으로 제목(주제) 생성 및 내용 작성이 수행되고 있다. 이에, 위키피디아 문서는 지식베이스의 개념 확장을 위해 아주 유용한 자원이 될 수 있으며, 본 논문에서는 이러한 위키피디아 문서 제목의 개념화를 통해 기존의 지식베이스와 연결하는 의미적인 방법을 기술한다. 이를 이용한 간단한 실험을 통하여 본 연구가 우월한 가능성이 있음을 파악하였다.

  • PDF

KAB: Knowledge Augmented BERT2BERT Automated Questions-Answering system for Jurisprudential Legal Opinions

  • Alotaibi, Saud S.;Munshi, Amr A.;Farag, Abdullah Tarek;Rakha, Omar Essam;Al Sallab, Ahmad A.;Alotaibi, Majid
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.6
    • /
    • pp.346-356
    • /
    • 2022
  • The jurisprudential legal rules govern the way Muslims react and interact to daily life. This creates a huge stream of questions, that require highly qualified and well-educated individuals, called Muftis. With Muslims representing almost 25% of the planet population, and the scarcity of qualified Muftis, this creates a demand supply problem calling for Automation solutions. This motivates the application of Artificial Intelligence (AI) to solve this problem, which requires a well-designed Question-Answering (QA) system to solve it. In this work, we propose a QA system, based on retrieval augmented generative transformer model for jurisprudential legal question. The main idea in the proposed architecture is the leverage of both state-of-the art transformer models, and the existing knowledge base of legal sources and question-answers. With the sensitivity of the domain in mind, due to its importance in Muslims daily lives, our design balances between exploitation of knowledge bases, and exploration provided by the generative transformer models. We collect a custom data set of 850,000 entries, that includes the question, answer, and category of the question. Our evaluation methodology is based on both quantitative and qualitative methods. We use metrics like BERTScore and METEOR to evaluate the precision and recall of the system. We also provide many qualitative results that show the quality of the generated answers, and how relevant they are to the asked questions.