• Title/Summary/Keyword: 의사결정나무알고리즘

Search Result 106, Processing Time 0.028 seconds

A Simulation-based Optimization for Scheduling in a Fab: Comparative Study on Different Sampling Methods (시뮬레이션 기반 반도체 포토공정 스케줄링을 위한 샘플링 대안 비교)

  • Hyunjung Yoon;Gwanguk Han;Bonggwon Kang;Soondo Hong
    • Journal of the Korea Society for Simulation
    • /
    • v.32 no.3
    • /
    • pp.67-74
    • /
    • 2023
  • A semiconductor fabrication facility(FAB) is one of the most capital-intensive and large-scale manufacturing systems which operate under complex and uncertain constraints through hundreds of fabrication steps. To improve fab performance with intuitive scheduling, practitioners have used weighted-sum scheduling. Since the determination of weights in the scheduling significantly affects fab performance, they often rely on simulation-based decision making for obtaining optimal weights. However, a large-scale and high-fidelity simulation generally is time-intensive to evaluate with an exhaustive search. In this study, we investigated three sampling methods (i.e., Optimal latin hypercube sampling(OLHS), Genetic algorithm(GA), and Decision tree based sequential search(DSS)) for the optimization. Our simulation experiments demonstrate that: (1) three methods outperform greedy heuristics in performance metrics; (2) GA and DSS can be promising tools to accelerate the decision-making process.

Development and Application of the Butterfly Algorithm Based on Decision Making Tree for Contradiction Problem Solving (모순 문제 해결을 위한 의사결정트리 기반 나비 알고리즘의 개발과 적용)

  • Hyun, Jung Suk;Ko, Ye June;Kim, Yung Gyeol;Jean, Seungjae;Park, Chan Jung
    • The Journal of Korean Association of Computer Education
    • /
    • v.22 no.1
    • /
    • pp.87-98
    • /
    • 2019
  • It is easy to assume that contradictions are logically incorrect or empty sets that have no solvability. This dilemma, which can not be done, is difficult to solve because it has to solve the contradiction hidden in it. Paradoxically, therefore, contradiction resolution has been viewed as an innovative and creative problem-solving. TRIZ, which analyzes the solution of the problem from the perspective of resolving contradictions, has been used for people rather than computers. The Butterfly model, which analyzes the problem from the perspective of solving the contradiction like TRIZ, analyzed the type of contradiction problem using symbolic logic. In order to apply an appropriate concrete solution strategy for a given contradiction problems, we designed the Butterfly algorithm based on decision making tree. We also developed a visualization tool based on Python tkInter to find concrete solution strategies for given contradiction problems. In order to verify the developed tool, the third grade students of middle school learned the Butterfly algorithm, analyzed the contradiction of the wooden support, and won the grand prize at an invention contest in search of a new solution. The Butterfly algorithm developed in this paper systematically reduces the solution space of contradictory problems in the beginning of problem solving and can help solve contradiction problems without trial and errors.

The Prediction Model for Self-Reported Voice Problem Using a Decision Tree Model (의사결정나무 모형을 이용한 주관적 음성장애 예측모형)

  • Byeon, Haewon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.7
    • /
    • pp.3368-3373
    • /
    • 2013
  • The purpose of this study was to analyze the risk factors of self-reported voice problem. Data were from the Korea National Health and Nutritional Examination Survey 2008. Subjects were 3,600 persons (1,501 men, 2,099 women) aged 19 years and older. A prediction model was developed by the use of a exhaustive CHAID (Chi Squared Automatic Interaction Detection) algorism of decision tree model. In the decision tree analysis, pain and discomfort during the last 2 weeks, age, the longest occupation and thyroid disorders was significantly associated with self-reported voice problem. The findings of associated factors suggest potential ways of targeting counseling and prevention efforts to control self-reported voice problem.

Automated Scoring of Scientific Argumentation Using Expert Morpheme Classification Approaches (전문가의 형태소 분류를 활용한 과학 논증 자동 채점)

  • Lee, Manhyoung;Ryu, Suna
    • Journal of The Korean Association For Science Education
    • /
    • v.40 no.3
    • /
    • pp.321-336
    • /
    • 2020
  • We explore automated scoring models of scientific argumentation. We consider how a new analytical approach using a machine learning technique may enhance the understanding of spoken argumentation in the classroom. We sampled 2,605 utterances that occurred during a high school student's science class on molecular structure and classified the utterances into five argumentative elements. Next, we performed Text Preprocessing for the classified utterances. As machine learning techniques, we applied support vector machines, decision tree, random forest, and artificial neural network. For enhancing the identification of rebuttal elements, we used a heuristic feature-engineering method that applies experts' classification of morphemes of scientific argumentation.

A Study on Sensor Data Analysis and Product Defect Improvement for Smart Factory (스마트 팩토리를 위한 센서 데이터 분석과 제품 불량 개선 연구)

  • Hwang, Sewong;Kim, Jonghyuk;Hwangbo, Hyunwoo
    • The Journal of Bigdata
    • /
    • v.3 no.1
    • /
    • pp.95-103
    • /
    • 2018
  • In recent years, many people in the manufacturing field have been making efforts to increase efficiency while analyzing manufacturing data generated in the process according to the development of ICT technology. In this study, we propose a data mining based manufacturing process using decision tree algorithm (CHAID) as part of a smart factory. We used 432 sensor data from actual manufacturing plant collected for about 5 months to find out the variables that show a significant difference between the stable process period with low defect rate and the unstable process period with high defect rate. We set the range of the stable value of the variable to determine whether the selected final variable actually has an effect on the defect rate improvement. In addition, we measured the effect of the defect rate improvement by adjusting the process set-point so that the sensor did not deviate from the stable value range in the 14 day process. Through this, we expect to be able to provide empirical guidelines to improve the defect rate by utilizing and analyzing the process sensor data generated in the manufacturing industry.

Analysis of the Factors and Patterns Associated with Death in Aircraft Accidents and Incidents Using Data Mining Techniques (데이터 마이닝 기법을 활용한 항공기 사고 및 준사고로 인한 사망 발생 요인 및 패턴 분석)

  • Kim, Jeong-Hun;Kim, Tae-Un;Yoo, Dong-Hee
    • Journal of Digital Convergence
    • /
    • v.17 no.9
    • /
    • pp.79-88
    • /
    • 2019
  • This study analyzes the influential factors and patterns associated with death from aircraft accidents and incidents using data mining techniques. To this end, we used two datasets for aircraft accidents and incidents, one from the National Transportation Safety Board (NTSB) and the other from the Federal Aviation Administration (FAA). We developed our prediction models using the decision tree classifier to predict death from aircraft accidents or aircraft incidents and thereby derive the main cause factors and patterns that can cause death based on these prediction models. In the NTSB data, deaths occurred frequently when the aircraft was destroyed or people were performing dangerous missions or maneuver. In the FAA data, deaths were mainly caused by pilots who were less skilled or less qualified when their aircraft were partially destroyed. Several death-related patterns were also found for parachute jumping and aircraft ascending and descending phases. Using the derived patterns, we proposed helpful strategies to prevent death from the aircraft accidents or incidents.

A Study on the Prediction Model for Sales of Women's Golfwear with Data Mining: Focus on Macroeconomic Factors and Consumer Sales Price (데이터마이닝을 적용한 여성 골프웨어 판매 예측 모델 연구: 거시경제요인과 소비자판매가격을 중심으로)

  • Han, Ki-Hyang
    • Journal of Digital Convergence
    • /
    • v.19 no.11
    • /
    • pp.445-456
    • /
    • 2021
  • The purpose of this study is to identify the importance of variables affecting women's golf wear sales with macroeconomic variables and consumer selling prices that affect consumers' purchasing behavior, and to propose a price strategy to increase sales of golf wear. Data of domestic women's golf wear brands were analyzed using decision tree algorithms and ensemble. Consumer selling price is the most significant factors in terms of sales volume for T-shirt, pants and knit, while categories were found to be the most important factors in addition to consumer sales prices for skirt and one piece dress. These findings suggest that items have different economic variables that affect consumers' purchasing behavior, suggesting that sales and profits can be maximized through appropriate price strategies.

A Study of the Integration of Individual Classification Model in Data Mining for the Credit Evaluation (신용평가를 위한 데이터마이닝 분류모형의 통합모형에 관한 연구)

  • Kim Kap Sik
    • The KIPS Transactions:PartD
    • /
    • v.12D no.2 s.98
    • /
    • pp.211-218
    • /
    • 2005
  • This study presents an integrated data mining model for the credit evaluation of the customers of a capital company. Based on customer information and financing processes in capital market, we derived individual models from multi-layered perceptrons(MLP), multivariate discrimination analysis(MDA), and decision tree. Further, the results from the existing models were compared with the results from the integrated model using genetic algorithm. The integrated model presented by this study turned out to be superior to the existing models. This study contributes not only to verifying the existing individual models but also to overcoming the limitations of the existing approaches.

Correlated variable importance for random forests (랜덤포레스트를 위한 상관예측변수 중요도)

  • Shin, Seung Beom;Cho, Hyung Jun
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.177-190
    • /
    • 2021
  • Random forests is a popular method that improves the instability and accuracy of decision trees by ensembles. In contrast to increasing the accuracy, the ease of interpretation is sacrificed; hence, to compensate for this, variable importance is provided. The variable importance indicates which variable plays a role more importantly in constructing the random forests. However, when a predictor is correlated with other predictors, the variable importance of the existing importance algorithm may be distorted. The downward bias of correlated predictors may reduce the importance of truly important predictors. We propose a new algorithm remedying the downward bias of correlated predictors. The performance of the proposed algorithm is demonstrated by the simulated data and illustrated by the real data.

Development of Needs Extraction Algorithm Fitting for Individuals in Care Management for the Elderly in Home (재가노인 사례관리의 욕구사정 정확도 향상을 위한 욕구추출 알고리즘 개발 - 데이터 마이닝 분석기법을 활용하여 -)

  • Kim, Young-Sook;Jung, Kook-In;Park, So-Rah
    • Korean Journal of Social Welfare
    • /
    • v.60 no.1
    • /
    • pp.187-209
    • /
    • 2008
  • The authors developed 28 needs assessment tools for integrated assessment centered on needs, which is the core element in care management for the elderly in home. Also, the authors collected the assessment data of 676 elderly persons in home from 120 centers under the Korea Association of Senior Welfare Centers by using the needs assessment tools, and finally developed needs extraction algorithm through decision tree analysis in data mining to identify their actual needs and provide social welfare service suitable for such needs. The needs extraction algorithm for 28 needs of the elderly in home are summarized in

    . The Need No. 8 "Having need of help in going out" of the decision-making model, for example, was divided into 80.3% of asking for help and 11.4% not asking for help with Appeal No. 23 as a major variable. The need increased by 87.9% when the elderly appealed for help to go out and they had a caregiver but decreased by 47.4% when they had no caregiver. When the elderly asked for help in going out, they had a caregiver, and they needed complete help in cleaning, their need of help in going out was shown as 94.2%. However, seen from their answer that they needed complete help in bathing of ADL even if they did not ask for help in going out, it was found that the need of help in going out sharply increased from 11.4% to 80.0%. On the other hand, when they needed partial help or self-supported in bathing, the potential for them to be classified as asking for help in going out was shown to be low as 7.7%. In the said decision-making model, the number of cases for parent node and child node was designated as 50 and 25, respectively, with level 5 of the maximum tree depth as stopping rule. By this, it was shown that their decision-making was found to be effective as 182.13% for the need "Having need of help in going out". The algorithm presented in this study can be useful as systematic and scientific fundamental data in assessment of needs of the elderly in home.

  • PDF

  • (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.