• 제목/요약/키워드: Knowledge Discovery

검색결과 392건 처리시간 0.023초

프로세스 마이닝을 위한 거리 기반의 API(Anomaly Process Instance) 탐지법 (Detection of API(Anomaly Process Instance) Based on Distance for Process Mining)

  • 전대욱;배혜림
    • 대한산업공학회지
    • /
    • 제41권6호
    • /
    • pp.540-550
    • /
    • 2015
  • There have been many attempts to find knowledge from data using conventional statistics, data mining, artificial intelligence, machine learning and pattern recognition. In those research areas, knowledge is approached in two ways. Firstly, researchers discover knowledge represented in general features for universal recognition, and secondly, they discover exceptional and distinctive features. In process mining, an instance is sequential information bounded by case ID, known as process instance. Here, an exceptional process instance can cause a problem in the analysis and discovery algorithm. Hence, in this paper we develop a method to detect the knowledge of exceptional and distinctive features when performing process mining. We propose a method for anomaly detection named Distance-based Anomaly Process Instance Detection (DAPID) which utilizes distance between process instances. DAPID contributes to a discovery of distinctive characteristic of process instance. For verifying the suggested methodology, we discovered characteristics of exceptional situations from log data. Additionally, we experiment on real data from a domestic port terminal to demonstrate our proposed methodology.

의미 기반의 지식모델 통합과 탐색에 관한 연구 (A study on integrating and discovery of semantic based knowledge model)

  • 전승수
    • 인터넷정보학회논문지
    • /
    • 제15권6호
    • /
    • pp.99-106
    • /
    • 2014
  • 최근 자연어 및 정형언어 처리, 인공지능 알고리즘 등을 활용한 효율적인 의미 기반 지식모델의 생성과 분석 방법이 제시되고 있다. 이러한 의미 기반 지식모델은 효율적 의사결정트리(Decision Making Tree)와 특정 상황에 대한 체계적인 문제해결(Problem Solving) 경로 분석에 활용된다. 특히 다양한 복잡계 및 사회 연계망 분석에 있어 정적 지표 생성과 회귀 분석, 행위적 모델을 통한 추이분석, 거시예측을 지원하는 모의실험 모형의 기반이 된다. 하지만 대부분의 지식 모델은 특정 지표나 정제된 데이터를 수동적으로 모델링하여 분석에 활용한다. 본 논문에서는 텍스트 마이닝 기술을 통해 방대한 비정형 정보로부터 지식 모델을 구성하는 토픽인자와 관계 노드를 생성하고 이를 통합하는 방법과 정형적 알고리즘을 제시한다. 이를 위해 먼저, 텍스트 마이닝을 통해 도출되는 키워드 맵을 동치적 지식맵으로 변환하고 이를 의미적 지식모델로 통합하는 방법을 설명한다. 또한 키워드 맵으로부터 유의미한 토픽 맵을 투영하는 방법과 의미적 동치 모델을 유도하는 알고리즘을 제안한다.

테크놀로지 환경에서의 수학적 발견 탐구학습 : Joy의 닮은 사격형 (Another discovery in the technology-based classroom : Joy's Similar Quadrilaterals)

  • 정인철
    • 한국학교수학회논문집
    • /
    • 제8권3호
    • /
    • pp.411-422
    • /
    • 2005
  • 1980년 LOGO 이후로 테크놀로지의 활용에 대하여 논의가 지속되어 왔다. 교수학습 상황에서의 테크놀로지의 역할은 무엇인지, 테크놀로지가 학습자들의 효과적인 이해를 위해서 어떤 역할을 학습자들에게 제공할 수 있는지, 그리고 특히 전통적인 교수학습 상황과는 달리 테크놀로지를 활용하여 과거에는 할 수 없었던 수학학습이라든지 또는 우리가 현재 가지고 있는 지식의 확장을 가능하게 한다는 측면에서의 논의가 수학교육계에서는 늘 있어 왔다. 본 논문은 테크놀로지를 활용하여 우리의 지식을 확장하여 탐구를 배경으로 하여 새로운 수학적 지식의 발견의 한 실례를 소개하고 탐구를 중심으로 한 수학학습에 대하여 논한다.

  • PDF

최적 규칙 발견 시스템의 구현: 개념 계층과 정보 이득 및 라프셋에 의한 통합 접근 (An Implementation of Optimal Rules Discovery System: An Integrated Approach Based on Concept Hierarchies, Information Gain, and Rough Sets)

  • 김진상
    • 한국지능시스템학회논문지
    • /
    • 제10권3호
    • /
    • pp.232-241
    • /
    • 2000
  • 본 연구는 대량의 데이터에서 효율적으로 최적 규칙을 발견하기 위해 개념 계층과 정보 이득 및 라프셋 이론에 딕반한 통합 방법을 제시하고,이를 최적 규칙 발견 시스템으로 구현한다. 본 방법은 데이터베이스에 있는 데이터에서 일반화된 지식을 추출하기 위한 속성중심의 개념 상승 기법과 불필요한 속성 및 속성값을 제거하기 위한 지식 감축 기법을 적용하며, 최적 규칙의 도출을 위해 속성의 중요도를 사용한다. 본 시스템은 먼저, 속성값 개념의 일반화에 의해 종복 튜플을 제거함으로써 데이터 베이스의 크기를 줄이고, 결정속성에 뎡향을 주지않는 조건속성을 제거하여 간략화된 최적 규칙을 유도한다.그리고 실제 데이터에 적용하여 결정 규칙을 유도하고 그 규칙을 새로운 데이터에 테스트햐 봄으로써 새로운 데이터에도 잘 적용됨을 보인다.

  • PDF

Neighbor Discovery in a Wireless Sensor Network: Multipacket Reception Capability and Physical-Layer Signal Processing

  • Jeon, Jeongho;Ephremides, Anthony
    • Journal of Communications and Networks
    • /
    • 제14권5호
    • /
    • pp.566-577
    • /
    • 2012
  • In randomly deployed networks, such as sensor networks, an important problem for each node is to discover its neighbor nodes so that the connectivity amongst nodes can be established. In this paper, we consider this problem by incorporating the physical layer parameters in contrast to the most of the previous work which assumed a collision channel. Specifically, the pilot signals that nodes transmit are successfully decoded if the strength of the received signal relative to the interference is sufficiently high. Thus, each node must extract signal parameter information from the superposition of an unknown number of received signals. This problem falls naturally in the purview of random set theory (RST) which generalizes standard probability theory by assigning sets, rather than values, to random outcomes. The contributions in the paper are twofold: First, we introduce the realistic effect of physical layer considerations in the evaluation of the performance of logical discovery algorithms; such an introduction is necessary for the accurate assessment of how an algorithm performs. Secondly, given the double uncertainty of the environment (that is, the lack of knowledge of the number of neighbors along with the lack of knowledge of the individual signal parameters), we adopt the viewpoint of RST and demonstrate its advantage relative to classical matched filter detection method.

A Methodology for Ontology-based Knowledge Acquisition and Structuring in an Industry-Academic-Government Project ″Go Japan!″

  • Hideki-Mima;Yoon, Tae-Sung
    • 한국전자거래학회:학술대회논문집
    • /
    • 한국전자거래학회 2003년도 종합학술대회 논문집
    • /
    • pp.197-203
    • /
    • 2003
  • The purpose of the study is to develop an integrated knowledge structuring system for the domain of engineering, in which ontology-based literature mining, knowledge acquisition, knowledge integration, and knowledge retrieval are combined using XML-based tag information and ontology management. The system supports combining different types of databases (papers and patents, technologies and innovations) and retrieving different types of knowledge simultaneously. The main objective of the system is to facilitate knowledge acquisition and knowledge retrieval from documents through an ontology-based dynamic similarity calculation and a visualization of automatically structured knowledge. Through experimentations we conducted using 100,000 words economic documents reported in the "Go! Japan" project for analyzing Japanese industrial situation, and 100,000 words molecular biology Papers, we show the system is Practical enough for accelerating knowledge acquisition and knowledge discovery from the information sea.

  • PDF

Design and Implementation of an Ontology-based Knowledge Management System

  • Hideki-Mima;Yoon, Tae-Sung;Katsumori-Matsushima
    • 한국전자거래학회:학술대회논문집
    • /
    • 한국전자거래학회 2004년도 e-Biz World Conference
    • /
    • pp.107-111
    • /
    • 2004
  • The purpose of the study is to develop an integrated knowledge management system for the domains of genome and nano-technology, in which terminology-based literature mining, knowledge acquisition, knowledge structuring, and knowledge retrieval are combined. The system supports integrating different types of databases (papers and patents, technologies and innovations) and retrieving different types of knowledge simultaneously. The main objective of the system is to facilitate knowledge acquisition from documents and new knowledge discovery through a terminology-based similarity calculation and a visualization of automatically structured knowledge. Implementation issue of the system is also mentioned.

  • PDF

From The Discovery Challenge on Thrombosis Data

  • Takabayashi, Katsuhiko;Tsumoto, Shusaku
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2001년도 The Pacific Aisan Confrence On Intelligent Systems 2001
    • /
    • pp.361-363
    • /
    • 2001
  • Although data mining promises a new paradigm to discover medical knowledge form a database, there are many problems to be solved before real application is feasible. We had the chance to provide a data set to be analyzed as a discovery challenge by using various data mining techniques at the PKDD conference. As data providers, we evaluated and discussed results and clarified problems.

  • PDF

A Proposal of Some Analysis Methods for Discovery of User Information from Web Data

  • Ahn, JeongYong;Han, Kyung Soo
    • Communications for Statistical Applications and Methods
    • /
    • 제8권1호
    • /
    • pp.281-289
    • /
    • 2001
  • The continuous growth in the use of the World Wide Web is creating the data with very large scale and different types. Analyzing such data can help to determine the life time value of users, evaluate the effectiveness of web sites, and design marketing strategies and services. In this paper, we propose some analysis methods for web data and present an example of a prototypical web data analysis.

  • PDF

A Perspective on Pharmaceutical Industrial Research on Antihypertensive drugs

  • Lee, Jang-Yun;John F. DeBernardis
    • Archives of Pharmacal Research
    • /
    • 제10권4호
    • /
    • pp.245-249
    • /
    • 1987
  • Cardiovascular disease is at present the leading cause of deth in the United States and other in dustrilized countries. A major contributing factor of cardiovascular disease is essential hypertension. Untreated, essential hypertension is considered a risk factor for sudden death due to myocardial infarctions, as well as a risk factor for cerebral vascular disease, renal failure and congestive heart failure. During the last decade, significant progress has been made in the basic knowledge of the pathogenesis of hypertension as well as in the development of new antihypertensive drugs.

  • PDF