• Title/Summary/Keyword: Decision Tree analysis

Search Result 725, Processing Time 0.323 seconds

FMECA using Fault Tree Analysis (FTA) and Fuzzy Logic (결함수분석법과 퍼지논리를 이용한 FMECA 평가)

  • Kim, Dong-Jin;Shin, Jun-Seok;Kim, Hyung-Jun;Kim, Jin-O;Kim, Hyung-Chul
    • Proceedings of the KSR Conference
    • /
    • 2007.11a
    • /
    • pp.1529-1532
    • /
    • 2007
  • Failure Mode, Effects, and Criticality Analysis (FMECA) is an extension of FMEA which includes a criticality analysis. The criticality analysis is used to chart the probability of failure modes against the severity of their consequences. The result highlights failure modes with relatively high probability and severity of consequences, allowing remedial effort to be directed where it will produce the greatest value. However, there are several limitations. Measuring severity of failure consequences is subjective and linguistic. Since The result of FMECA only gives qualitative and quantitative informations, it should be re-analysed to prioritize critical units. Fuzzy set theory has been introduced by Lotfi A. Zadeh (1965). It has extended the classical set theory dramatically. Based on fuzzy set theory, fuzzy logic has been developed employing human reasoning process. IF-THEN fuzzy rule based assessment approach can model the expert's decision logic appropriately. Fault tree analysis (FTA) is one of most common fault modeling techniques. It is widely used in many fields practically. In this paper, a simple fault tree analysis is proposed to measure the severity of components. Fuzzy rule based assessment method interprets linguistic variables for determination of critical unit priorities. An rail-way transforming system is analysed to describe the proposed method.

  • PDF

Pattern Classification Model Design and Performance Comparison for Data Mining of Time Series Data (시계열 자료의 데이터마이닝을 위한 패턴분류 모델설계 및 성능비교)

  • Lee, Soo-Yong;Lee, Kyoung-Joung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.6
    • /
    • pp.730-736
    • /
    • 2011
  • In this paper, we designed the models for pattern classification which can reflect the latest trend in time series. It has been shown that fusion models based on statistical and AI methods are superior to traditional ones for the pattern classification model supporting decision making. Especially, the hit rates of pattern classification models combined with fuzzy theory are relatively increased. The statistical SVM models combined with fuzzy membership function, or the models combining neural network and FCM has shown good performance. BPN, PNN, FNN, FCM, SVM, FSVM, Decision Tree, Time Series Analysis, and Regression Analysis were used for pattern classification models in the experiments of this paper. The economical indices DB with time series properties of the financial market(Korea, KOSPI200 DB) and the electrocardiogram DB of arrhythmia patients in hospital emergencies(USA, MIT-BIH DB) were used for data base.

A Study on the Prediction Models of Used Car Prices for Domestic Brands Using Machine Learning (머신러닝을 활용한 브랜드별 국내 중고차 가격 예측 모델에 관한 연구)

  • Seungjun Yim;Joungho Lee;Choonho Ryu
    • Journal of Service Research and Studies
    • /
    • v.13 no.3
    • /
    • pp.105-126
    • /
    • 2023
  • The domestic used car market continues to grow along with the used car online platform service. The used car online platform service discloses vehicle specifications, accident history, inspection history, and detailed options to service consumers. Most of the preceding studies were predictions of used car prices using vehicle specifications and some options for vehicles. As a result of the study, it was confirmed that there was a nonlinear relationship between used car prices and some specification variables. Accordingly, the researchers tried to solve the nonlinear problem by executing a Machine Learning model. In common, the Regression based Machine Learning model had the advantage of knowing the actual influence and direction of variables, but there was a disadvantage of low Cost Function figures compared to the Decision Tree based Machine Learning model. This study attempted to predict used car prices of six domestic brands by utilizing both vehicle specifications and vehicle options. Through this, we tried to collect the advantages of the two types of Machine Learning models. To this end, we sequentially conducted a regression based Machine Learning model and a decision tree based Machine Learning model. As a result of the analysis, the practical influence and direction of each brand variable, and the best tree based Machine Learning model were selected. The implications of this study are as follows. It will help buyers and sellers who use used car online platform services to predict approximate used car prices. And it is hoped that it will help solve the problem caused by information inequality among users of the used car online platform service.

Analysis of Healthcare Quality Indicator using Data Mining and Decision Support System

  • Young M.Chae;Kim, Hye S.;Seung H. Ho
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.01a
    • /
    • pp.352-357
    • /
    • 2001
  • This study presents an analysis of healthcare quality indicators using data mining for developing quality improvement strategies. Specifically, important factors influencing the inpatient mortality were identified using a decision tree method for data mining based on 8,405 patients who were discharged from the study hospital during the period of December 1, 2000 and January 31, 2001. Important factors for the inpatient mortality were length of stay, disease classes, discharge departments, and age groups. The optimum range of target group in inpatient healthcare quality indicators were identified from the gains chart. In addition, a decision support system was developed to analyze and monitor trends of quality indicators using Visual Basic 6.0. Guidelines and tutorial for quality improvement activities were also included in the system. In the future, other quality indicators should be analyze to effectively support a hospital-wide continuous quality improvement (CQI) activity and the decision support system should be well integrated with the hospital OCS (Order Communication System) to support concurrent review.

  • PDF

Data Mining Algorithm Based on Fuzzy Decision Tree for Pattern Classification (퍼지 결정트리를 이용한 패턴분류를 위한 데이터 마이닝 알고리즘)

  • Lee, Jung-Geun;Kim, Myeong-Won
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.11
    • /
    • pp.1314-1323
    • /
    • 1999
  • 컴퓨터의 사용이 일반화됨에 따라 데이타를 생성하고 수집하는 것이 용이해졌다. 이에 따라 데이타로부터 자동적으로 유용한 지식을 얻는 기술이 필요하게 되었다. 데이타 마이닝에서 얻어진 지식은 정확성과 이해성을 충족해야 한다. 본 논문에서는 데이타 마이닝을 위하여 퍼지 결정트리에 기반한 효율적인 퍼지 규칙을 생성하는 알고리즘을 제안한다. 퍼지 결정트리는 ID3와 C4.5의 이해성과 퍼지이론의 추론과 표현력을 결합한 방법이다. 특히, 퍼지 규칙은 속성 축에 평행하게 판단 경계선을 결정하는 방법으로는 어려운 속성 축에 평행하지 않는 경계선을 갖는 패턴을 효율적으로 분류한다. 제안된 알고리즘은 첫째, 각 속성 데이타의 히스토그램 분석을 통해 적절한 소속함수를 생성한다. 둘째, 주어진 소속함수를 바탕으로 ID3와 C4.5와 유사한 방법으로 퍼지 결정트리를 생성한다. 또한, 유전자 알고리즘을 이용하여 소속함수를 조율한다. IRIS 데이타, Wisconsin breast cancer 데이타, credit screening 데이타 등 벤치마크 데이타들에 대한 실험 결과 제안된 방법이 C4.5 방법을 포함한 다른 방법보다 성능과 규칙의 이해성에서 보다 효율적임을 보인다.Abstract With an extended use of computers, we can easily generate and collect data. There is a need to acquire useful knowledge from data automatically. In data mining the acquired knowledge needs to be both accurate and comprehensible. In this paper, we propose an efficient fuzzy rule generation algorithm based on fuzzy decision tree for data mining. We combine the comprehensibility of rules generated based on decision tree such as ID3 and C4.5 and the expressive power of fuzzy sets. Particularly, fuzzy rules allow us to effectively classify patterns of non-axis-parallel decision boundaries, which are difficult to do using attribute-based classification methods.In our algorithm we first determine an appropriate set of membership functions for each attribute of data using histogram analysis. Given a set of membership functions then we construct a fuzzy decision tree in a similar way to that of ID3 and C4.5. We also apply genetic algorithm to tune the initial set of membership functions. We have experimented our algorithm with several benchmark data sets including the IRIS data, the Wisconsin breast cancer data, and the credit screening data. The experiment results show that our method is more efficient in performance and comprehensibility of rules compared with other methods including C4.5.

Questionnaire Survey and Analysis Using Data Mining (데이터마이닝을 이용한 설문조사 및 분석)

  • 박만희;채화성;신완선
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.25 no.5
    • /
    • pp.46-52
    • /
    • 2002
  • Today's database system needs to collect huge amount of questionnaire that results from development of the information technology by the internet, so it has to be administrable. However, there are many difficulties concerned with finding analytic data or useful information in the high capacity-database. Data mining can solve these problems and utilize the database. Questionnaire analysis that uses data mining has drawn relevant patterns that did not look or was tended to overlook before. These patterns can be applied by a new business rule. The purpose of this research is to analyze the questionnaire results and to present the result that can help to make decision easily with data mining. Recognition and analysis about these techniques of data mining show suitable type of questionnaire survey. This research focus on the form of present composition and the model of suitable questionnaire to analyze the type of it. Also, the comparison between the actual questionnaire result and the conventional statistical analysis is examined.

Analysis on the Usage of Internet Games for Children with Decision Tree Rules (의사결정규칙을 이용한 아동의 교육용 인터넷 게임 활용실태 분석)

  • Kim, Yong-Dae;Jung, Hui-Suk;Choi, Eun-Jeong;Park, Byung-Sun;Han, Jeong-Hye
    • Journal of The Korean Association of Information Education
    • /
    • v.5 no.3
    • /
    • pp.389-400
    • /
    • 2001
  • The Internet Game is widespreaded quickly on web, and there are many kinds of funny games for users to use easily, so that can be applied to ICT(Information Communication Technology)education. In this paper, we provide the analysis on the usage of Internet games for children and teachers that is conducted by the decision tree algorithm, which is one of the popular data mining techniques. The results show the pattern of children's and teachers' usages of Internet games.

  • PDF

Hybrid Learning Architectures for Advanced Data Mining:An Application to Binary Classification for Fraud Management (개선된 데이터마이닝을 위한 혼합 학습구조의 제시)

  • Kim, Steven H.;Shin, Sung-Woo
    • Journal of Information Technology Application
    • /
    • v.1
    • /
    • pp.173-211
    • /
    • 1999
  • The task of classification permeates all walks of life, from business and economics to science and public policy. In this context, nonlinear techniques from artificial intelligence have often proven to be more effective than the methods of classical statistics. The objective of knowledge discovery and data mining is to support decision making through the effective use of information. The automated approach to knowledge discovery is especially useful when dealing with large data sets or complex relationships. For many applications, automated software may find subtle patterns which escape the notice of manual analysis, or whose complexity exceeds the cognitive capabilities of humans. This paper explores the utility of a collaborative learning approach involving integrated models in the preprocessing and postprocessing stages. For instance, a genetic algorithm effects feature-weight optimization in a preprocessing module. Moreover, an inductive tree, artificial neural network (ANN), and k-nearest neighbor (kNN) techniques serve as postprocessing modules. More specifically, the postprocessors act as second0order classifiers which determine the best first-order classifier on a case-by-case basis. In addition to the second-order models, a voting scheme is investigated as a simple, but efficient, postprocessing model. The first-order models consist of statistical and machine learning models such as logistic regression (logit), multivariate discriminant analysis (MDA), ANN, and kNN. The genetic algorithm, inductive decision tree, and voting scheme act as kernel modules for collaborative learning. These ideas are explored against the background of a practical application relating to financial fraud management which exemplifies a binary classification problem.

  • PDF

A Study of the Integration of Individual Classification Model in Data Mining for the Credit Evaluation (신용평가를 위한 데이터마이닝 분류모형의 통합모형에 관한 연구)

  • Kim Kap Sik
    • The KIPS Transactions:PartD
    • /
    • v.12D no.2 s.98
    • /
    • pp.211-218
    • /
    • 2005
  • This study presents an integrated data mining model for the credit evaluation of the customers of a capital company. Based on customer information and financing processes in capital market, we derived individual models from multi-layered perceptrons(MLP), multivariate discrimination analysis(MDA), and decision tree. Further, the results from the existing models were compared with the results from the integrated model using genetic algorithm. The integrated model presented by this study turned out to be superior to the existing models. This study contributes not only to verifying the existing individual models but also to overcoming the limitations of the existing approaches.

Predictors of Suicide Ideation in Rural Residents: Based on Comparison Predictors of Suicide Ideation in Urban Residents (농촌 주민의 자살생각 예측요인 -도시 주민의 자살생각 예측요인과의 비교를 중심으로-)

  • Kim, Yun Jeong;Kang, Hyun Jeong
    • Journal of Agricultural Extension & Community Development
    • /
    • v.19 no.3
    • /
    • pp.617-647
    • /
    • 2012
  • The purpose of this study was to identify the predictors of suicidal ideation of rural residents. This study was based on predictors of suicidal ideation of urban residents. The participants were adolescents, adults, and seniors sampled from 10 provinces all over the country, from May to Aug, 2010. The data for the study were analysed as decision tree analysis. The major results of the study were as follows. First, a main predictor of suicidal ideation for rural residents was high depression. Unlike rural residents, urban residents reporting high depression and influence of mass media showed high suicidal ideation. Second, interaction of depression and family solidarity was important predictor of suicide ideation both rural and urban residents, but a condition that effects the situation differed between rural and urban residents. Rural residents reporting high depression and high family solidarity showed high suicidal ideation, urban residents reporting low depression and high family solidarity showed low suicidal ideation. Stress was also operate differently. Rural residents reporting moderate depression, low family solidarity and high stress showed high suicidal ideation, but stress of urban resident was not a important predictors of suicidal ideation. And rural residents reporting low depression and low stress showed the lowest level of suicidal ideation, urban residents reporting low family solidarity and low depression showed the lowest level of suicidal ideation.