• Title/Summary/Keyword: 의사결정 나무

Search Result 563, Processing Time 0.028 seconds

A study for improving data mining methods for continuous response variables (연속형 반응변수를 위한 데이터마이닝 방법 성능 향상 연구)

  • Choi, Jin-Soo;Lee, Seok-Hyung;Cho, Hyung-Jun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.5
    • /
    • pp.917-926
    • /
    • 2010
  • It is known that bagging and boosting techniques improve the performance in classification problem. A number of researchers have proved the high performance of bagging and boosting through experiments for categorical response but not for continuous response. We study whether bagging and boosting improve data mining methods for continuous responses such as linear regression, decision tree, neural network through bagging and boosting. The analysis of eight real data sets prove the high performance of bagging and boosting empirically.

Measuring Pattern Recognition from Decision Tree and Geometric Data Analysis of Industrial CR Images (산업용 CR영상의 기하학적 데이터 분석과 의사결정나무에 의한 측정 패턴인식)

  • Hwang, Jung-Won;Hwang, Jae-Ho
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.45 no.5
    • /
    • pp.56-62
    • /
    • 2008
  • This paper proposes the use of decision tree classification for the measuring pattern recognition from industrial Computed Radiography(CR) images used in nondestructive evaluation(NDE) of steel-tubes. It appears that NDE problems are naturally desired to have machine learning techniques identify patterns and their classification. The attributes of decision tree are taken from NDE test procedure. Geometric features, such as radiative angle, gradient and distance, are estimated from the analysis of input image data. These factors are used to make it easy and accurate to classify an input object to one of the pre-specified classes on decision tree. This algerian is to simplify the characterization of NDE results and to facilitate the determination of features. The experimental results verify the usefulness of proposed algorithm.

의사결정나무를 이용한 개인휴대통신 해지자 분석

  • 최종후;서두성
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1998.10a
    • /
    • pp.377-380
    • /
    • 1998
  • 본 논문에서는 최근 데이터마이닝의 도구로 활발하게 소개되고 있는 의사결정나무 분석을 이용하여 개인휴대통신의 해지자 분석을 실시한다. 또한 로지스틱 회귀모형을 이용하여 가입고객의 해지 가능성에 대한 점수화를 시도한다.

  • PDF

Comparative Analysis of Predictors of Depression for Residents in a Metropolitan City using Logistic Regression and Decision Making Tree (로지스틱 회귀분석과 의사결정나무 분석을 이용한 일 대도시 주민의 우울 예측요인 비교 연구)

  • Kim, Soo-Jin;Kim, Bo-Young
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.12
    • /
    • pp.829-839
    • /
    • 2013
  • This study is a descriptive research study with the purpose of predicting and comparing factors of depression affecting residents in a metropolitan city by using logistic regression analysis and decision-making tree analysis. The subjects for the study were 462 residents ($20{\leq}aged{\angle}65$) in a metropolitan city. This study collected data between October 7, 2011 and October 21, 2011 and analyzed them with frequency analysis, percentage, the mean and standard deviation, ${\chi}^2$-test, t-test, logistic regression analysis, roc curve, and a decision-making tree by using SPSS 18.0 program. The common predicting variables of depression in community residents were social dysfunction, perceived physical symptom, and family support. The specialty and sensitivity of logistic regression explained 93.8% and 42.5%. The receiver operating characteristic (roc) curve was used to determine an optimal model. The AUC (area under the curve) was .84. Roc curve was found to be statistically significant (p=<.001). The specialty and sensitivity of decision-making tree analysis were 98.3% and 20.8% respectively. As for the whole classification accuracy, the logistic regression explained 82.0% and the decision making tree analysis explained 80.5%. From the results of this study, it is believed that the sensitivity, the classification accuracy, and the logistics regression analysis as shown in a higher degree may be useful materials to establish a depression prediction model for the community residents.

The Transfer Technique among Decision Tree Models for Distributed Data Mining (분산형 데이터마이닝 구현을 위한 의사결정나무 모델 전송 기술)

  • Kim, Choong-Gon;Woo, Jung-Geun;Baik, Sung-Wook
    • Journal of Digital Contents Society
    • /
    • v.8 no.3
    • /
    • pp.309-314
    • /
    • 2007
  • A decision tree algorithm should be modified to be suitable in distributed and collaborative environments for distributed data mining. The distributed data mining system proposed in this paper consists of several agents and a mediator. Each agent deals with a local data mining for data in each local site and communicates with one another to build the global decision tree model. The mediator helps several agents to efficiently communicate among them. One of advantages in distributed data mining is to save much time to analyze huge data with several agents. The paper focuses on a transfer technique among agents dealing with each local decision tree model to reduce huge overhead in communication among them.

  • PDF

Study on the Classification Methodology for DSRC Travel Speed Patterns Using Decision Trees (의사결정나무 기법을 적용한 DSRC 통행속도패턴 분류방안)

  • Lee, Minha;Lee, Sang-Soo;Namkoong, Seong;Choi, Keechoo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.2
    • /
    • pp.1-11
    • /
    • 2014
  • In this paper, travel speed patterns were deducted based on historical DSRC travel speed data using Decision Tree technique to improve availability of the massive amount of historical data. These patterns were designed to reflect spatio-temporal vicissitudes in reality by generating pattern units classified by months, time of day, and highway sections. The study area was from Seoul TG to Ansung IC sections on Gyung-bu highway where high peak time of day frequently occurs in South Korea. Decision Tree technique was applied to categorize travel speed according to day of week. As a result, five different pattern groups were generated: (Mon)(Tue Wed Thu)(Fri)(Sat)(Sun). Statistical verification was conducted to prove the validity of patterns on nine different highway sections, and the accuracy of fitting was found to be 93%. To reduce travel pattern errors against individual travel speed data, inclusion of four additional variables were also tested. Among those variables, 'traffic condition on previous month' variable improved the pattern grouping accuracy by reducing 50% of speed variance in the decision tree model developed.

Prediction Model of Construction Safety Accidents using Decision Tree Technique (의사결정나무기법을 이용한 건설재해 사전 예측모델 개발)

  • Cho, Yerim;Kim, Yeon-Choel;Shin, Yoonseok
    • Journal of the Korea Institute of Building Construction
    • /
    • v.17 no.3
    • /
    • pp.295-303
    • /
    • 2017
  • Over the past 7 years, the number of victims of construction disasters has been gradually increasing. Compared with projects in other industries, construction projects are highly exposed to safety risks. For this reason, the research methods of predicting and managing the risk of construction disasters are urgently needed that can be applied to a construction site. This study aims to propose a prediction model for a construction disaster using the decision tree technique. The developed the model is reviewed the applicability by evaluating its accuracy based on disaster data. The top three of the prediction values obtained from the proposed model were enumerated, and then the cumulative accuracy were also calculated. The prediction accuracy was 40 percent for the first value, but the cumulative accuracy was 80 percent. Thus, as more disaster data was accumulated, the cumulative accuracy appeared to be higher. If utilized in construction sites, the model proposed in this study would contribute to a reduction in the rate of construction disasters.

Decision Tree Techniques with Feature Reduction for Network Anomaly Detection (네트워크 비정상 탐지를 위한 속성 축소를 반영한 의사결정나무 기술)

  • Kang, Koohong
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.29 no.4
    • /
    • pp.795-805
    • /
    • 2019
  • Recently, there is a growing interest in network anomaly detection technology to tackle unknown attacks. For this purpose, diverse studies using data mining, machine learning, and deep learning have been applied to detect network anomalies. In this paper, we evaluate the decision tree to see its feasibility for network anomaly detection on NSL-KDD data set, which is one of the most popular data mining techniques for classification. In order to handle the over-fitting problem of decision tree, we select 13 features from the original 41 features of the data set using chi-square test, and then model the decision tree using TensorFlow and Scik-Learn, yielding 84% and 70% of binary classification accuracies on the KDDTest+ and KDDTest-21 of NSL-KDD test data set. This result shows 3% and 6% improvements compared to the previous 81% and 64% of binary classification accuracies by decision tree technologies, respectively.

Pattern Analysis of Clinical Signs in Cultured Olive Flounder, Paralichthys Olivaceus, with Edwardsielosis using the Decision Tree Technique (의사결정 나무 기법을 이용한 양식넙치의 에드워드병 증상 패턴 분석)

  • Kim, Kyeong-Im;Jung, Sung-Ju;Kim, Sung-Hyun;Han, Soon-Hee;Ceong, Hee-Taek;Kim, Tae-Ho;Park, Jeong-Seon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.4
    • /
    • pp.661-674
    • /
    • 2021
  • Edwardsiellosis is difficult to treat in cultured olive flounder, Paralichthys olivaceus. It is present in the fish for a long period during all growth stages, and it often leads to mass mortalites. In this paper, the clinical patterns of Edwardsiellosis were analyzed by dividing the data into the whole-water temperature, low-water temperature, low-high water temperature, high-water temperature, and high-low water temperature groups based on various clinical signs of diseased cultured olive flounder using a decision tree technique. In the clinical sign patterns in the decision trees analyzed in the experiment, clinical signs in the liver, such as liver nodules, liver hemorrhages, and liver degeneration, were selected as the criteria for determining Edwardsiellosis. The selected clinical signs were known as the major clinical signs of Edwardsiellosis, and through consultation with fishery disease experts, the analysis confirmed that the clinical signs of Edwardsiellosis were successfully found in this study.

Analysis of Korean Adolescents' Life Satisfaction based on Public Database and Data Mining Techniques: Emphasis on Decision Tree (공공 DB 데이터마이닝 기법을 활용한 국내 청소년 삶의 만족도 분석에 관한 실증연구: 의사결정나무 기법을 중심으로)

  • Jo, Hyun Jin;Ko, Geo Nu;Lee, Kun Chang
    • Journal of Digital Convergence
    • /
    • v.18 no.6
    • /
    • pp.297-309
    • /
    • 2020
  • This study focuses on the application of the data mining technique logistic regression analysis and decision tree analysis to the domestic public database called Korean Children Youth Panel Survey (KCYPS) to derive a series of important factors affecting the enhancement of life satisfaction of domestic youth. As a result, the general impact factors on life satisfaction for each grade were derived from logistic regression. Using decision tree analysis, we came to conclusions that those factors such as depression, overall grade satisfaction, household economic level, and school adaptation play crucial roles in affecting high school adolesscents' life satisfaction.