• Title/Summary/Keyword: knowledge discovery process

Search Result 99, Processing Time 0.024 seconds

Discovery of promising business items by technology-industry concordance and keyword co-occurrence analysis of US patents. (기술-산업 연계구조 및 특허 분석을 통한 미래유망 아이템 발굴)

  • Cho Byoung-Youl;Rho Hyun-Sook
    • Journal of Korea Technology Innovation Society
    • /
    • v.8 no.2
    • /
    • pp.860-885
    • /
    • 2005
  • This study relates to develop a quantitative method through which promising technology-based business items can be discovered and selected. For this study, we utilized patent trend analysis, technology-industry concordance analysis, and keyword co-occurrence analysis of US patents. By analyzing patent trends and technology-industry concordance, we were able to find out the emerging industry trends : prevalence of bio industry, service industry, and B2C business. From the direct and co-occurrence analysis of newly discovered patent keywords in the year, 2000, 28 promising business item candidates were extracted. Finally, the promising item candidates were prioritized using 4 business attractiveness determinants; market size, product life cycle, degree of the technological innovation, and coincidence with the industry trends. This result implicates that reliable discovery and selection of promising technology-based business items can be performed by a quantitative, objective and low- cost process using knowledge discovery method from patent database instead of peer review.

  • PDF

A Study on a Statistical Matching Method Using Clustering for Data Enrichment

  • Kim Soon Y.;Lee Ki H.;Chung Sung S.
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.2
    • /
    • pp.509-520
    • /
    • 2005
  • Data fusion is defined as the process of combining data and information from different sources for the effectiveness of the usage of useful information contents. In this paper, we propose a data fusion algorithm using k-means clustering method for data enrichment to improve data quality in knowledge discovery in database(KDD) process. An empirical study was conducted to compare the proposed data fusion technique with the existing techniques and shows that the newly proposed clustering data fusion technique has low MSE in continuous fusion variables.

Discovering Temporal Work Transference Networks from Workflow Execution Logs

  • Pham, Dinh-Lam;Ahn, Hyun;Kim, Kwanghoon Pio
    • Journal of Internet Computing and Services
    • /
    • v.20 no.2
    • /
    • pp.101-108
    • /
    • 2019
  • Workflow management systems (WfMSs) automate and manage workflows, which are implementations of organizational processes operated in process-centric organizations. In this paper, wepropose an algorithm to discover temporal work transference networks from workflow execution logs. The temporal work transference network is a special type of enterprise social networks that consists of workflow performers, and relationships among them that are formed by work transferences between performers who are responsible in performing precedent and succeeding activities in a workflow process. In terms of analysis, the temporal work transference network is an analytical property that has significant value to be analyzed to discover organizational knowledge for human resource management and related decision-making steps for process-centric organizations. Also, the beginning point of implementinga human-centered workflow intelligence framework dealing with work transference networks is to develop an algorithm for discovering temporal work transference cases on workflow execution logs. To this end, we first formalize a concept of temporal work transference network, and next, we present a discovery algorithm which is for the construction of temporal work transference network from workflow execution logs. Then, as a verification of the proposed algorithm, we apply the algorithm to an XES-formatted log dataset that was released by the process mining research group and finally summarize the discovery result.

A Better Prediction for Higher Education Performance using the Decision Tree

  • Hilal, Anwar;Zamani, Abu Sarwar;Ahmad, Sultan;Rizwanullah, Mohammad
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.4
    • /
    • pp.209-213
    • /
    • 2021
  • Data mining is the application of specific algorithms for extracting patterns from data and KDD is the automated or convenient extraction of patterns representing knowledge implicitly stored or captured in large databases, data warehouses, the Web, other massive information repositories or data streams. Data mining can be used for decision making in educational system. But educational institution does not use any knowledge discovery process approach on these data; this knowledge can be used to increase the quality of education. The problem was happening in the educational management system, but to make education system more flexible and discover knowledge from it huge data, we will use data mining techniques to solve problem.

A Workflow-based Social Network Intelligence Discovery Algorithm (워크플로우 소셜네트워크 인텔리전스 발견 알고리즘)

  • Kim, Kwang-Hoon
    • Journal of Internet Computing and Services
    • /
    • v.13 no.2
    • /
    • pp.73-86
    • /
    • 2012
  • This paper theoretically derives an algorithm to discover a new type of social networks from workflow models, which is termed workflow-based social network intelligence. In general, workflow intelligence (or business process intelligence) technology consists of four types of techniques that discover, analyze, monitor and control, and predict from workflow models and their execution histories. So, this paper proposes an algorithm, which is termed ICN-based workflow-based social network intelligence discovery algorithm, to be classified into the type of discovery techniques, which are able to discover workflow-based social network intelligence that are formed among workflow performers through a series of workflow models and their executions, In order particularly to prove the correctness and feasibility of the proposed algorithm, this paper tries to apply the algorithm to a specific workflow model and to show that it is able to generate its corresponding workflow-based social network intelligence.

Pattern Discovery by Genetic Algorithm in Syntactic Pattern Based Chart Analysis for Stock Market

  • Kim, Hyun-Soo
    • The Journal of Information Systems
    • /
    • v.3
    • /
    • pp.147-169
    • /
    • 1994
  • This paper present s a pattern generation scheme from financial charts. The patterns constitute knowledge which consists of patterns as the conditional part and the impact of the pattern as the conclusion part. The patterns in charts are represented in a syntactic approach. If the pattern elements and the impact of patterns are defined, the patterns are synthesized from simple to the more highly credible by evaluating each intermediate pattern from the instances. The overall process is divided into primitive discovery by Genetic Algorithms and pattern synthesis from the discovered primitives by the Syntactic Pattern-based Inductive Learning (SYNPLE) algorithm which we have developed. We have applied the scheme to a chart : the trend lines of stock price in daily base. The scheme can generate very credible patterns from training data sets.

  • PDF

An Algorithm for Sequential Sampling Method in Data Mining (데이터 마이닝에서 샘플링 기법을 이용한 연속패턴 알고리듬)

  • 홍지명;김낙현;김성집
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.21 no.45
    • /
    • pp.101-112
    • /
    • 1998
  • Data mining, which is also referred to as knowledge discovery in database, means a process of nontrivial extraction of implicit, previously unknown and potentially useful information (such as knowledge rules, constraints, regularities) from data in databases. The discovered knowledge can be applied to information management, decision making, and many other applications. In this paper, a new data mining problem, discovering sequential patterns, is proposed which is to find all sequential patterns using sampling method. Recognizing that the quantity of database is growing exponentially and transaction database is frequently updated, sampling method is a fast algorithm reducing time and cost while extracting the trend of customer behavior. This method analyzes the fraction of database but can in general lead to results of a very high degree of accuracy. The relaxation factor, as well as the sample size, can be properly adjusted so as to improve the result accuracy while minimizing the corresponding execution time. The superiority of the proposed algorithm will be shown through analyzing accuracy and efficiency by comparing with Apriori All algorithm.

  • PDF

Design of Process Management System based on Data Mining and Artificial Modelling for the Etching Process (데이터 마이닝과 지능 모델링에 기반한 에칭공정의 공정관리시스템 설계)

  • Bae, Hyeon;Kim, Sung-shin;Woo, Kwang-Bang
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.4
    • /
    • pp.390-395
    • /
    • 2004
  • A semiconductor manufacturing process is the complicate and dynamic process, and consists of many sub-processes. An etching process is the most important process in the semiconductor fabrication. In this paper, the decision support system based upon data mining and knowledge discovery is an important factor to improve the productivity and yield. The proposed decision support system consists of a neural network model and an inference system based on fuzzy logic Firstly, the product results are predicted by the neural network model constructed by the product patterns that represent the quality of the etching process. And the product patters are classified by expert's knowledge. Finally, the product conditions are estimated by the fuzzy inference system using the rules extracted from the classified patterns. Prediction of product qualities can be linked to each input and process variables. We employ data mining and intelligent techniques to find the best condition of the etching process. The proposed decision support system is efficient and easy to be implemented for the process management based upon expert's knowledge.

Inferring Undiscovered Public Knowledge by Using Text Mining-driven Graph Model (텍스트 마이닝 기반의 그래프 모델을 이용한 미발견 공공 지식 추론)

  • Heo, Go Eun;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.31 no.1
    • /
    • pp.231-250
    • /
    • 2014
  • Due to the recent development of Information and Communication Technologies (ICT), the amount of research publications has increased exponentially. In response to this rapid growth, the demand of automated text processing methods has risen to deal with massive amount of text data. Biomedical text mining discovering hidden biological meanings and treatments from biomedical literatures becomes a pivotal methodology and it helps medical disciplines reduce the time and cost. Many researchers have conducted literature-based discovery studies to generate new hypotheses. However, existing approaches either require intensive manual process of during the procedures or a semi-automatic procedure to find and select biomedical entities. In addition, they had limitations of showing one dimension that is, the cause-and-effect relationship between two concepts. Thus;this study proposed a novel approach to discover various relationships among source and target concepts and their intermediate concepts by expanding intermediate concepts to multi-levels. This study provided distinct perspectives for literature-based discovery by not only discovering the meaningful relationship among concepts in biomedical literature through graph-based path interference but also being able to generate feasible new hypotheses.

Cell Death and Stress Signaling in Glycogen Storage Disease Type I

  • Kim, So Youn;Bae, Yun Soo
    • Molecules and Cells
    • /
    • v.28 no.3
    • /
    • pp.139-148
    • /
    • 2009
  • Cell death has been traditionally classified in apoptosis and necrosis. Apoptosis, known as programmed cell death, is an active form of cell death mechanism that is tightly regulated by multiple cellular signaling pathways and requires ATP for its appropriate process. Apoptotic death plays essential roles for successful development and maintenance of normal cellular homeostasis in mammalian. In contrast to apoptosis, necrosis is classically considered as a passive cell death process that occurs rather by accident in disastrous conditions, is not required for energy and eventually induces inflammation. Regardless of different characteristics between apoptosis and necrosis, it has been well defined that both are responsible for a wide range of human diseases. Glycogen storage disease type I (GSD-I) is a kind of human genetic disorders and is caused by the deficiency of a microsomal protein, glucose-6-phosphatase-${\alpha}$ ($G6Pase-{\alpha}$) or glucose-6-phosphate transporter (G6PT) responsible for glucose homeostasis, leading to GSD-Ia or GSD-Ib, respectively. This review summarizes cell deaths in GSD-I and mostly focuses on current knowledge of the neutrophil apoptosis in GSD-Ib based upon ER stress and redox signaling.