• Title/Summary/Keyword: Data Mining Tool Framework

Search Result 19, Processing Time 0.03 seconds

Framework for False Alarm Pattern Analysis of Intrusion Detection System using Incremental Association Rule Mining

  • Chon Won Yang;Kim Eun Hee;Shin Moon Sun;Ryu Keun Ho
    • Proceedings of the KSRS Conference
    • /
    • 2004.10a
    • /
    • pp.716-718
    • /
    • 2004
  • The false alarm data in intrusion detection systems are divided into false positive and false negative. The false positive makes bad effects on the performance of intrusion detection system. And the false negative makes bad effects on the efficiency of intrusion detection system. Recently, the most of works have been studied the data mining technique for analysis of alert data. However, the false alarm data not only increase data volume but also change patterns of alert data along the time line. Therefore, we need a tool that can analyze patterns that change characteristics when we look for new patterns. In this paper, we focus on the false positives and present a framework for analysis of false alarm pattern from the alert data. In this work, we also apply incremental data mining techniques to analyze patterns of false alarms among alert data that are incremental over the time. Finally, we achieved flexibility by using dynamic support threshold, because the volume of alert data as well as included false alarms increases irregular.

  • PDF

Smart Agents and Multimedia Systems

  • Kim, Steven H.
    • Proceedings of the Korea Database Society Conference
    • /
    • 1997.10a
    • /
    • pp.215-269
    • /
    • 1997
  • Outline $\textbullet$ Introduction $\textbullet$ Multimedia - Types of Data - Motivation - Key issue - Hardware Products - Application Areas $\textbullet$ Agents - Rationale for Agents - Sedentary vs. Mobile - Functional Categories - Application Areas $\textbullet$ Data Mining - 2-D Framework for Data Mining Tools - Classification of Tool - Application Areas - Learning Methodologies * Case Based Reasoning * Neural Networks * Statistical Learning: Orthogonal Arrays * Multi-strategy Learning $\textbullet$ Case Study - Finbot $\textbullet$ Conclusion

  • PDF

A case study of ECN data conversion for Korean and foreign ecological data integration

  • Lee, Hyeonjeong;Shin, Miyoung;Kwon, Ohseok
    • Journal of Ecology and Environment
    • /
    • v.41 no.5
    • /
    • pp.142-144
    • /
    • 2017
  • In recent decades, as it becomes increasingly important to monitor and research long-term ecological changes, worldwide attempts are being conducted to integrate and manage ecological data in a unified framework. Especially domestic ecological data in South Korea should be first standardized based on predefined common protocols for data integration, since they are often scattered over many different systems in various forms. Additionally, foreign ecological data should be converted into a proper unified format to be used along with domestic data for association studies. In this study, our interest is to integrate ECN data with Korean domestic ecological data under our unified framework. For this purpose, we employed our semi-automatic data conversion tool to standardize foreign data and utilized ground beetle (Carabidae) datasets collected from 12 different observatory sites of ECN. We believe that our attempt to convert domestic and foreign ecological data into a standardized format in a systematic way will be quite useful for data integration and association analysis in many ecological and environmental studies.

The Construction of Engineering Educational Statistics System in Korea (국내 공학 교육통계 시스템 구축)

  • An, Hye Jeong;Kim, Ji Hyeon;Hong, Sung-Jo
    • Journal of Engineering Education Research
    • /
    • v.19 no.2
    • /
    • pp.53-59
    • /
    • 2016
  • Along with the industry growth, engineering colleges in Korea has have a quantitative growth. Many of the policy promotions and budgets for engineering colleges from the government are supported. And the various monitoring methods to verify their achievement have demanded. This paper deals with the construction of engineering educational statistics system in Korea. It named Korea Engineering Data Management System(K-EDMS). This system is based on the data mining tool and supports data-based decision making for an advanced engineering education service. This paper presents related researches of case studies. Then, we have designed K-EDMS, and constructed 157 cases for engineering colleges of the year 2014.

Improving Process Mining with Trace Clustering (자취 군집화를 통한 프로세스 마이닝의 성능 개선)

  • Song, Min-Seok;Gunther, C.W.;van der Aalst, W.M.P.;Jung, Jae-Yoon
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.34 no.4
    • /
    • pp.460-469
    • /
    • 2008
  • Process mining aims at mining valuable information from process execution results (called "event logs"). Even though process mining techniques have proven to be a valuable tool, the mining results from real process logs are usually too complex to interpret. The main cause that leads to complex models is the diversity of process logs. To address this issue, this paper proposes a trace clustering approach that splits a process log into homogeneous subsets and applies existing process mining techniques to each subset. Based on log profiles from a process log, the approach uses existing clustering techniques to derive clusters. Our approach are implemented in ProM framework. To illustrate this, a real-life case study is also presented.

Prediction of User Preferred Cosmetic Brand Based on Unified Fuzzy Rule Inference

  • Kim, Jin-Sung
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2005.11a
    • /
    • pp.271-275
    • /
    • 2005
  • In this research, we propose a Unified Fuzzy rule-based knowledge Inference Systems UFIS) to help the expert in cosmetic brand detection. Users' preferred cosmetic product detection is very important in the level of CRM. To this Purpose, many corporations trying to develop an efficient data mining tool. In this study, we develop a prototype fuzzy rule detection and inference system. The framework used in this development is mainly based on two different mechanisms such as fuzzy rule extraction and RDB (Relational DB)-based fuzzy rule inference. First, fuzzy clustering and fuzzy rule extraction deal with the presence of the knowledge in data base and its value is presented with a value between $0\∼1$. Second, RDB and SQL(Structured Query Language)-based fuzzy rule inference mechanism provide more flexibility in knowledge management than conventional non-fuzzy value-based KMS(Knowledge Management Systems)

  • PDF

Prediction of User's Preference by using Fuzzy Rule & RDB Inference: A Cosmetic Brand Selection

  • Kim, Jin-Sung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.5 no.4
    • /
    • pp.353-359
    • /
    • 2005
  • In this research, we propose a Unified Fuzzy rule-based knowledge Inference Systems (UFIS) to help the expert in cosmetic brand detection. Users' preferred cosmetic product detection is very important in the level of CRM. To this purpose, many corporations trying to develop an efficient data mining tool. In this study, we develop a prototype fuzzy rule detection and inference system. The framework used in this development is mainly based on two different mechanisms such as fuzzy rule extraction and RDB (Relational DB)-based fuzzy rule inference. First, fuzzy clustering and fuzzy rule extraction deal with the presence of the knowledge in data base and its value is presented with a value between 0 -1. Second, RDB and SQL (Structured Query Language)-based fuzzy rule inference mechanism provide more flexibility in knowledge management than conventional non-fuzzy value-based KMS (Knowledge Management Systems).

Prediction of Length of ICU Stay Using Data-mining Techniques: an Example of Old Critically Ill Postoperative Gastric Cancer Patients

  • Zhang, Xiao-Chun;Zhang, Zhi-Dan;Huang, De-Sheng
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.1
    • /
    • pp.97-101
    • /
    • 2012
  • Objective: With the background of aging population in China and advances in clinical medicine, the amount of operations on old patients increases correspondingly, which imposes increasing challenges to critical care medicine and geriatrics. The study was designed to describe information on the length of ICU stay from a single institution experience of old critically ill gastric cancer patients after surgery and the framework of incorporating data-mining techniques into the prediction. Methods: A retrospective design was adopted to collect the consecutive data about patients aged 60 or over with a gastric cancer diagnosis after surgery in an adult intensive care unit in a medical university hospital in Shenyang, China, from January 2010 to March 2011. Characteristics of patients and the length their ICU stay were gathered for analysis by univariate and multivariate Cox regression to examine the relationship with potential candidate factors. A regression tree was constructed to predict the length of ICU stay and explore the important indicators. Results: Multivariate Cox analysis found that shock and nutrition support need were statistically significant risk factors for prolonged length of ICU stay. Altogether, eight variables entered the regression model, including age, APACHE II score, SOFA score, shock, respiratory system dysfunction, circulation system dysfunction, diabetes and nutrition support need. The regression tree indicated comorbidity of two or more kinds of shock as the most important factor for prolonged length of ICU stay in the studied sample. Conclusions: Comorbidity of two or more kinds of shock is the most important factor of length of ICU stay in the studied sample. Since there are differences of ICU patient characteristics between wards and hospitals, consideration of the data-mining technique should be given by the intensivists as a length of ICU stay prediction tool.

Development of Needs Assessment tool and Extraction Algorithm Fitting for Individuals in Care Management for the disabled in Home (재가장애인 사례관리의 욕구사정 정확도 향상을 위한 사정도구 개발과 욕구추출 알고리즘 과정 연구 - 데이터 마이닝 분석기법을 활용하여 -)

  • Kim, Young-Sook;Jung, Kook-In
    • Korean Journal of Social Welfare
    • /
    • v.60 no.2
    • /
    • pp.155-173
    • /
    • 2008
  • The study aims to develop a assessment tool to provide the in-home disabled in a local community with appropriate services in consideration of physical, emotional, social and environmental circumstances. After collection of assesment data of 200 in-home disabled through use of the tool, a desire-extracting algorithm was developed to provide a service to real needs through the use of decision tree analysis on data mining. The study was conducted for Five months from June 2006 through October 2006, and it is divided into development of an assessment tool and extraction of real needs through the use of the tool. The basic framework of the development of the tool was established through the examination of related literature, the subjective satisfaction of the assessment tool and items were developed through the use of a focus group and experts, and verification was implemented through the use of statistics to confirm the validity of the tool. As a result of the verification, the tool secured following validity and credibility as seen in

    and
    . In addition, real needs-extraction algorithm was established through the use of the assessment tool, and the algorithm according each desire was suggested as seen in . The assessment tool and algorithm suggested as a result of the study can be used as data to conduct systematic management of examples through the confirmation of objective desire of in-home disabled.

  • PDF
  • Effective Classification Framework Design and Implementation for Rural Regional Information using Principal Component Analysis and Cluster Analysis (주성분 분석 및 군집분석을 이용한 지역정보 유형화 프레임워크의 설계와 구현)

    • Suh, Kyo;Kim, Tae-Gon;Lee, Ji-Min;Lee, Jeong-Jae
      • Journal of The Korean Society of Agricultural Engineers
      • /
      • v.54 no.1
      • /
      • pp.73-81
      • /
      • 2012
    • For planning and developing rural regions, it is very important to understand and utilize regional characteristics including social, demographic, and economic aspects. The purpose of this study is to find effective analysis techniques and provide a procedure design for mining regional characteristics in South Korea through reviewing and analyzing 41 related studies. The engaged research methods can be classified into five categories (PCA+CA, PCA, CA, GIS, and PCA+GIS) with the combination of three methodologies: principal component analysis (PCA), cluster analysis (CA), and geographical information system (GIS). The combination of PCA and CA occupied about 40 % of research methods used in related studies. The analysis tool of Korean Rural Information Supporting System (KRISS) is designed based on the outcomes of this study and applied to classify the regional capacity of agriculture using agricultural census data (2000) for evaluating its applicability.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.