• 제목/요약/키워드: Data Mining Tool Framework

검색결과 19건 처리시간 0.02초

Framework for False Alarm Pattern Analysis of Intrusion Detection System using Incremental Association Rule Mining

  • Chon Won Yang;Kim Eun Hee;Shin Moon Sun;Ryu Keun Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2004년도 Proceedings of ISRS 2004
    • /
    • pp.716-718
    • /
    • 2004
  • The false alarm data in intrusion detection systems are divided into false positive and false negative. The false positive makes bad effects on the performance of intrusion detection system. And the false negative makes bad effects on the efficiency of intrusion detection system. Recently, the most of works have been studied the data mining technique for analysis of alert data. However, the false alarm data not only increase data volume but also change patterns of alert data along the time line. Therefore, we need a tool that can analyze patterns that change characteristics when we look for new patterns. In this paper, we focus on the false positives and present a framework for analysis of false alarm pattern from the alert data. In this work, we also apply incremental data mining techniques to analyze patterns of false alarms among alert data that are incremental over the time. Finally, we achieved flexibility by using dynamic support threshold, because the volume of alert data as well as included false alarms increases irregular.

  • PDF

Smart Agents and Multimedia Systems

  • Kim, Steven H.
    • 한국데이타베이스학회:학술대회논문집
    • /
    • 한국데이타베이스학회 1997년도 International Conference MULTIMEDIA DATABASES on INTERNET
    • /
    • pp.215-269
    • /
    • 1997
  • Outline $\textbullet$ Introduction $\textbullet$ Multimedia - Types of Data - Motivation - Key issue - Hardware Products - Application Areas $\textbullet$ Agents - Rationale for Agents - Sedentary vs. Mobile - Functional Categories - Application Areas $\textbullet$ Data Mining - 2-D Framework for Data Mining Tools - Classification of Tool - Application Areas - Learning Methodologies * Case Based Reasoning * Neural Networks * Statistical Learning: Orthogonal Arrays * Multi-strategy Learning $\textbullet$ Case Study - Finbot $\textbullet$ Conclusion

  • PDF

A case study of ECN data conversion for Korean and foreign ecological data integration

  • Lee, Hyeonjeong;Shin, Miyoung;Kwon, Ohseok
    • Journal of Ecology and Environment
    • /
    • 제41권5호
    • /
    • pp.142-144
    • /
    • 2017
  • In recent decades, as it becomes increasingly important to monitor and research long-term ecological changes, worldwide attempts are being conducted to integrate and manage ecological data in a unified framework. Especially domestic ecological data in South Korea should be first standardized based on predefined common protocols for data integration, since they are often scattered over many different systems in various forms. Additionally, foreign ecological data should be converted into a proper unified format to be used along with domestic data for association studies. In this study, our interest is to integrate ECN data with Korean domestic ecological data under our unified framework. For this purpose, we employed our semi-automatic data conversion tool to standardize foreign data and utilized ground beetle (Carabidae) datasets collected from 12 different observatory sites of ECN. We believe that our attempt to convert domestic and foreign ecological data into a standardized format in a systematic way will be quite useful for data integration and association analysis in many ecological and environmental studies.

국내 공학 교육통계 시스템 구축 (The Construction of Engineering Educational Statistics System in Korea)

  • 안혜정;김지현;홍성조
    • 공학교육연구
    • /
    • 제19권2호
    • /
    • pp.53-59
    • /
    • 2016
  • Along with the industry growth, engineering colleges in Korea has have a quantitative growth. Many of the policy promotions and budgets for engineering colleges from the government are supported. And the various monitoring methods to verify their achievement have demanded. This paper deals with the construction of engineering educational statistics system in Korea. It named Korea Engineering Data Management System(K-EDMS). This system is based on the data mining tool and supports data-based decision making for an advanced engineering education service. This paper presents related researches of case studies. Then, we have designed K-EDMS, and constructed 157 cases for engineering colleges of the year 2014.

자취 군집화를 통한 프로세스 마이닝의 성능 개선 (Improving Process Mining with Trace Clustering)

  • 송민석;;;정재윤
    • 대한산업공학회지
    • /
    • 제34권4호
    • /
    • pp.460-469
    • /
    • 2008
  • Process mining aims at mining valuable information from process execution results (called "event logs"). Even though process mining techniques have proven to be a valuable tool, the mining results from real process logs are usually too complex to interpret. The main cause that leads to complex models is the diversity of process logs. To address this issue, this paper proposes a trace clustering approach that splits a process log into homogeneous subsets and applies existing process mining techniques to each subset. Based on log profiles from a process log, the approach uses existing clustering techniques to derive clusters. Our approach are implemented in ProM framework. To illustrate this, a real-life case study is also presented.

Prediction of User Preferred Cosmetic Brand Based on Unified Fuzzy Rule Inference

  • 김진성
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2005년도 추계학술대회 학술발표 논문집 제15권 제2호
    • /
    • pp.271-275
    • /
    • 2005
  • In this research, we propose a Unified Fuzzy rule-based knowledge Inference Systems UFIS) to help the expert in cosmetic brand detection. Users' preferred cosmetic product detection is very important in the level of CRM. To this Purpose, many corporations trying to develop an efficient data mining tool. In this study, we develop a prototype fuzzy rule detection and inference system. The framework used in this development is mainly based on two different mechanisms such as fuzzy rule extraction and RDB (Relational DB)-based fuzzy rule inference. First, fuzzy clustering and fuzzy rule extraction deal with the presence of the knowledge in data base and its value is presented with a value between $0\∼1$. Second, RDB and SQL(Structured Query Language)-based fuzzy rule inference mechanism provide more flexibility in knowledge management than conventional non-fuzzy value-based KMS(Knowledge Management Systems)

  • PDF

Prediction of User's Preference by using Fuzzy Rule & RDB Inference: A Cosmetic Brand Selection

  • Kim, Jin-Sung
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제5권4호
    • /
    • pp.353-359
    • /
    • 2005
  • In this research, we propose a Unified Fuzzy rule-based knowledge Inference Systems (UFIS) to help the expert in cosmetic brand detection. Users' preferred cosmetic product detection is very important in the level of CRM. To this purpose, many corporations trying to develop an efficient data mining tool. In this study, we develop a prototype fuzzy rule detection and inference system. The framework used in this development is mainly based on two different mechanisms such as fuzzy rule extraction and RDB (Relational DB)-based fuzzy rule inference. First, fuzzy clustering and fuzzy rule extraction deal with the presence of the knowledge in data base and its value is presented with a value between 0 -1. Second, RDB and SQL (Structured Query Language)-based fuzzy rule inference mechanism provide more flexibility in knowledge management than conventional non-fuzzy value-based KMS (Knowledge Management Systems).

Prediction of Length of ICU Stay Using Data-mining Techniques: an Example of Old Critically Ill Postoperative Gastric Cancer Patients

  • Zhang, Xiao-Chun;Zhang, Zhi-Dan;Huang, De-Sheng
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제13권1호
    • /
    • pp.97-101
    • /
    • 2012
  • Objective: With the background of aging population in China and advances in clinical medicine, the amount of operations on old patients increases correspondingly, which imposes increasing challenges to critical care medicine and geriatrics. The study was designed to describe information on the length of ICU stay from a single institution experience of old critically ill gastric cancer patients after surgery and the framework of incorporating data-mining techniques into the prediction. Methods: A retrospective design was adopted to collect the consecutive data about patients aged 60 or over with a gastric cancer diagnosis after surgery in an adult intensive care unit in a medical university hospital in Shenyang, China, from January 2010 to March 2011. Characteristics of patients and the length their ICU stay were gathered for analysis by univariate and multivariate Cox regression to examine the relationship with potential candidate factors. A regression tree was constructed to predict the length of ICU stay and explore the important indicators. Results: Multivariate Cox analysis found that shock and nutrition support need were statistically significant risk factors for prolonged length of ICU stay. Altogether, eight variables entered the regression model, including age, APACHE II score, SOFA score, shock, respiratory system dysfunction, circulation system dysfunction, diabetes and nutrition support need. The regression tree indicated comorbidity of two or more kinds of shock as the most important factor for prolonged length of ICU stay in the studied sample. Conclusions: Comorbidity of two or more kinds of shock is the most important factor of length of ICU stay in the studied sample. Since there are differences of ICU patient characteristics between wards and hospitals, consideration of the data-mining technique should be given by the intensivists as a length of ICU stay prediction tool.

재가장애인 사례관리의 욕구사정 정확도 향상을 위한 사정도구 개발과 욕구추출 알고리즘 과정 연구 - 데이터 마이닝 분석기법을 활용하여 - (Development of Needs Assessment tool and Extraction Algorithm Fitting for Individuals in Care Management for the disabled in Home)

  • 김영숙;정국인
    • 한국사회복지학
    • /
    • 제60권2호
    • /
    • pp.155-173
    • /
    • 2008
  • 본 연구는 지역사회 내에 거주하는 재가 장애인의 신체적, 심리적, 사회 환경적 상황을 종합적으로 평가하여 그에 적합한 서비스를 제공하기 위한 욕구 중심의 사정도구를 개발하고, 개발된 도구를 활용하여 재가 장애인 200명의 사정 데이터를 수집한 후 데이터마이닝의 의사결정 나무분석 기법을 활용하여 욕구에 적합한 서비스제공을 위한 욕구 추출 알고리즘을 구성하였다. 본 연구는 2006년 6월부터 10월까지 5개월간 이루어졌으며, 크게 사정도구 개발과 개발된 도구를 활용한 욕구추출 과정으로 나뉠 수 있다. 도구개발은 문헌고찰을 통하여 기본적인 틀을 구성하였고, 포커스집단과 전문가들을 통하여 사정도구의 주관적 호소와 욕구 문항을 개발하였으며, 도구의 타당도를 확인하기 위해 통계적인 검증과정을 거쳤다. 검증결과 본 도구는 <표 2>와 <표 3>의 결과처럼 타당도와 신뢰도를 확보하였으며, 이 도구를 활용하여 욕구추출 알고리즘 요약을 <표 5>와 같이 제시하였다. 본 연구의 결과로 제시한 사정도구와 알고리즘은 재가 장애인의 객관적 욕구를 사정하고 확인함으로써 체계적인 사례관리를 수행하는 자료로 활용될 수 있다.

  • PDF

주성분 분석 및 군집분석을 이용한 지역정보 유형화 프레임워크의 설계와 구현 (Effective Classification Framework Design and Implementation for Rural Regional Information using Principal Component Analysis and Cluster Analysis)

  • 서교;김태곤;이지민;이정재
    • 한국농공학회논문집
    • /
    • 제54권1호
    • /
    • pp.73-81
    • /
    • 2012
  • For planning and developing rural regions, it is very important to understand and utilize regional characteristics including social, demographic, and economic aspects. The purpose of this study is to find effective analysis techniques and provide a procedure design for mining regional characteristics in South Korea through reviewing and analyzing 41 related studies. The engaged research methods can be classified into five categories (PCA+CA, PCA, CA, GIS, and PCA+GIS) with the combination of three methodologies: principal component analysis (PCA), cluster analysis (CA), and geographical information system (GIS). The combination of PCA and CA occupied about 40 % of research methods used in related studies. The analysis tool of Korean Rural Information Supporting System (KRISS) is designed based on the outcomes of this study and applied to classify the regional capacity of agriculture using agricultural census data (2000) for evaluating its applicability.