• Title/Summary/Keyword: 의사결정나무기법

Search Result 252, Processing Time 0.029 seconds

Flood Mitigation Planing for a Basin Using a Decision Tree Model (의사결정나무모형을 이용한 유역내 구조적 홍수방어 대안 도출)

  • Byeon, Sungho;Kang, Hyunjin;Han, Jeongwoo;Kim, Tae-Woong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.1B
    • /
    • pp.33-40
    • /
    • 2008
  • Intensive rainfalls in wet season (June~September) result in serious flood damage which is about 95% of natural hazard in Korea. Recently, in order to cope with repeated flood hazard, comprehensive flood control plans have been carried out in large basins in Korea. The plans suggest structural alternative plans for flood mitigation as well as non-structural plans. In this study, a practical method using a decision tree was developed to systematically allocate structural facilities for flood control, which maximizes the flood control capacity in a basin. This study also presents a practical guidance to organize structural defensive alternatives for a comprehensive flood control plan in a large basin.

Development to Prediction Technique of Slope Hazards in Gneiss Area using Decision Tree Model (의사결정나무모형을 이용한 편마암 지역에서의 급경사지재해 예측기법 개발)

  • Song, Young-Suk;Chae, Byung-Gon
    • The Journal of Engineering Geology
    • /
    • v.18 no.1
    • /
    • pp.45-54
    • /
    • 2008
  • Based on the data obtained from field investigation and soil testing to slope hazards occurrence section and non-occurrence section in gneiss area, a prediction technique was developed by the use of a decision tree model, which is one of the statistical analysis methods. The slope hazards data of Seoul and Kyonggi Province, which were induced by heavy rainfall in 1998, were 104 sections in gneiss area. The number of data applied in developing prediction model was 61 sections except a vacant value. Among these data, the number of data occurred slope hazards was 34 sections and the number of data non-occurred slope hazards was 27 sections. The statistical analyses using the decision tree model were applied to chi-square statistics, gini index and entrophy index. As the results of analyses, a slope angle, a degree of saturation and an elevation were selected as the classification standard. The prediction model of decision tree using entrophy index is most likely accurate. The classification standard of the selected prediction model is composed of the slope angle, the degree of saturation and the elevation from the first choice stage. The classification standard values of the slope angle, the degree of saturation and elevation are $17.9^{\circ}$, 52.1% and 320 m, respectively.

데이터마이닝 기법을 활용한 스팸메일 분류 및 예측모형 구축에 관한 연구

  • 안수산;신경식
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2000.11a
    • /
    • pp.359-366
    • /
    • 2000
  • 기업의 환경에서 이-메일(e-mail)은 회사내의 업무흐름을 완전히 뒤바꾸며 혁명적인 변화를 이끌고 있다. 업무 공간의 극복, 사내 커뮤니케이션의 극대화 등 이-메일이 제공하는 장점이 매우 많다. 그러나 최근 사회적 문제가 되고 있는 스팸 메일(spam mail)의 등장은 이러한 장점의 커다란 반대급부를 제공한다. 스팸메일이란 인터넷이용자들에게 원하지도 않았는데 무작위로 발송되는 광고성 이-메일을 일컫는 말로, 벌크(bulk)메일, 정크(junk)메일, 언솔리시티드(Unsolicited)메일과도 유사한 의미로 사용된다. 스팸메일은 사용자들로 하여금 스트레쓰의 요인이 되게 함은 물론, 이를 발신하고 수신하는 과정에서 이용되는 서버에 엄청난 부하를 줄 뿐만 아니라, 공공의 성격을 지니는 네트웍 자원을 아무런 비용의 지불 없이 독점하게 되는 좋지 않은 결과를 가져오게 된다. 본 연구에서는 데이터마이닝의 기법 중 분류(classification tack) 문제에 적웅이 활발한 인공신경망 (artificial neural networks)과 의사결정나무(decision tree)기법을 이용하여 스팸메일의 분류와 예측을 가능케 하는 모형을 구축한다.

  • PDF

Convergence outpatient medical service patient experience research using data mining (데이터마이닝 기법을 이용한 융복합 외래 의료서비스 환자경험조사 연구)

  • Yoo, Jin-Yeong
    • Journal of Digital Convergence
    • /
    • v.18 no.7
    • /
    • pp.299-306
    • /
    • 2020
  • The purpose of this study is to find out specific measures that can help the management strategy of patient-centered medical institutions by conducting research on patient experience surveys of convergence outpatient medical services using data mining techniques according to changes in patient-centered medical culture. Using the raw data of the 2018 Medical Service Experience Survey, 8,843 people over the age of 15 who had patient experience in outpatient medical services were analyzed. Decision tree analysis was performed. The determinants of satisfaction with outpatient medical services patient experience were the doctor's area and patient's rights protection area, and the determinants of intention to recommend outpatient medical services were the doctor's area and facilities comfort. Women evaluated the experience positively in overall satisfaction as compared to men, and those over the age of 60 positively evaluated the overall satisfaction and intention to recommend. It is significant that the outpatient experience decision-making model is presented, and that the doctor's area, patient's rights protection area, and facility comfort are important factors. Long-term research on the 'Medical Service Experience Survey' is needed, and research on the inpatient medical service experience is needed.

Application of a Decision Tree to Alternative Plans for the Urban Flood Mitigation (Decision Tree를 이용한 도시유역홍수방어 대안 도출)

  • Byeon, Sung-Ho;Kang, Hyun-Jik;Han, Jeong-Woo;Ahn, Jae-Hyun;Kim, Tae-Woong
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.726-730
    • /
    • 2007
  • 우리나라는 6월부터 9월까지의 우기에 강우가 집중 발생하는 기상특성으로 인해 자연재해의 95% 이상이 집중호우와 태풍에 의한 풍수해로 집계되고 있을 만큼 홍수피해에 취약하며, 오래전부터 홍수방어에 대한 구조적 대책이 시행되어왔다. 본 연구의 목적은 의사결정기법인 Decision Tree(의사결정나무)를 활용하여 유역종합치수계획의 구조적 홍수방어 최적대안 선정을 위한 후보대안들을 제시하여 홍수저감능력을 효율적으로 극대화 하는데 그 목적이 있다. 본 연구는 유역이 가지고 있는 치수적 기능을 최대한 살리고 상 하류의 유기적인 방어 기능을 도모하고자 하였으며, 또한 도시유역 홍수방어 대안 조합 지침을 마련하여 실무에 적용가능한 안을 제시하였다.

  • PDF

An Empirical Comparison of Bagging, Boosting and Support Vector Machine Classifiers in Data Mining (데이터 마이닝에서 배깅, 부스팅, SVM 분류 알고리즘 비교 분석)

  • Lee Yung-Seop;Oh Hyun-Joung;Kim Mee-Kyung
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.343-354
    • /
    • 2005
  • The goal of this paper is to compare classification performances and to find a better classifier based on the characteristics of data. The compared methods are CART with two ensemble algorithms, bagging or boosting and SVM. In the empirical study of twenty-eight data sets, we found that SVM has smaller error rate than the other methods in most of data sets. When comparing bagging, boosting and SVM based on the characteristics of data, SVM algorithm is suitable to the data with small numbers of observation and no missing values. On the other hand, boosting algorithm is suitable to the data with number of observation and bagging algorithm is suitable to the data with missing values.

Analysis of Characteristics of the Cancelled Districts of Housing Redevelopment Project - Focusing on Decision Tree Analysis - (재정비사업 해제구역 의사결정 특성 연구 - 의사결정나무기법 중심으로 -)

  • Lee, Do-Ghil
    • Journal of the Korean Regional Science Association
    • /
    • v.37 no.4
    • /
    • pp.49-59
    • /
    • 2021
  • This study aims to identify the characteristics of the cancelled districts of housing redevelopment and housing reconstruction project. The subject of this study is 189 project districts(121 promoted districts, 68 cancelled districts). Both 121 promoted districts and 68 cancelled districts were analyzed by Decision Tree Analysis. The first separation of the release zone influencing factors was made by the Development Actors. In other words, the most important independent variable for determining the release zone influence factor was shown to be the presence or absence of propulsion actors. Of the 89 districts without propellers, 41 were lifted and 48 were promoted, and 9 out of 100 districts with propellers were lifted and 91 were promoted. The second separation of the impact factors on the zone was then made by Land Owners, and the probability of cancellation increased if the number of landowners was less than 468 and 37 out of 62 were removed. On the other hand, four out of 27 districts with more than 468 landowners were lifted and 23 districts were promoted. The third separation was made by the Average Land Assessment, and 35 zones were lifted below the standard of KRW 269.64 million/m2 approximately KRW 8.91 million per pyeong, and two zones were lifted at higher official prices. In the second division, the number of landowners was 468 or more, and in node4, four areas were removed from areas with a public land area ratio of 29.43% or more, and no areas less were released. This study used SPSS Statistics 26 S/W for analysis.

Empirical Analysis of Influential Factors Affecting Domestic Workers' Turnover Intention: Emphasis on Public Database and Decision Tree Method (근로자들의 이직 의도에 영향을 주는 요인에 관한 실증연구: 공공 데이터베이스와 의사결정나무 기법을 중심으로)

  • Geo Nu Ko;Hyun Jin Jo;Kun Chang Lee
    • Information Systems Review
    • /
    • v.22 no.4
    • /
    • pp.41-58
    • /
    • 2020
  • This study addresses the issue of which factors make domestic works have turnover intention. To pursue this research issue, we utilized a public database "2017 Occupational Migration Path Survey", administerd by Korea Employment Information Service (KEIS). Decision tree method was applied to extract crucial factors influencing workers' turnover intention. They include 'the degree of matching the level of education with the level of work', 'the possibility of individual development', 'the job-related education and training', 'the promotion system', 'wage and income', 'social reputation for work' and 'the stability of employment'.

실시간 CRM을 위한 분류 기법과 연관성 규칙의 통합적 활용;신용카드 고객 이탈 예측에 활용

  • Lee, Ji-Yeong;Kim, Jong-U
    • 한국경영정보학회:학술대회논문집
    • /
    • 2007.06a
    • /
    • pp.135-140
    • /
    • 2007
  • 이탈 고객 예측은 데이터 마이닝에서 다루는 주요한 문제 중에 하나이다. 이탈 고객 예측은 일종의 분류(classification) 문제로 의사결정나무추론, 로지스틱 회귀분석, 인공신경망 등의 기법이 많이 활용되어왔다. 일반적으로 이탈 고객 예측을 위한 모델은 고객의 인구통계학적 정보와 계약이나 거래 정보를 입력변수로 하여 이탈 여부를 목표변수로 보는 형태로 분류 모델을 생성하게 된다. 본 연구에서는 고객과의 지속적인 접촉으로 발생되는 추가적인 사건 정보를 활용하여 연관성 규칙을 생성하고 이 결과를 기존의 방식으로 생성된 분류 모델과 결합하는 이탈 고객 예측 방법을 제시한다. 제시한 방법의 유용성을 확인하기 위해서 특정 국내 신용카드사의 실제 데이터를 활용하여 실험을 수행하였다. 실험 결과 제시된 방법이 기존의 전통적인 분류 모델에 비해서 향상된 성능을 보이는 것을 확인할 수 있었다. 제시된 예측 방법의 장점은 기존의 이탈 예측을 위한 입력 변수들 이외에 고객과 회사간의 접촉을 통해서 생성된 동적 정보들을 통합적으로 활용하여 예측 정확도를 높이고 실시간으로 이탈 확률을 갱신할 수 있다는 점이다.

  • PDF