• 제목/요약/키워드: Decision Tree analysis

Search Result 725, Processing Time 0.027 seconds

Using Data Mining Techniques for Analysis of the Impacts of COVID-19 Pandemic on the Domestic Stock Prices: Focusing on Healthcare Industry (데이터 마이닝 기법을 통한 COVID-19 팬데믹의 국내 주가 영향 분석: 헬스케어산업을 중심으로)

  • Kim, Deok Hyun;Yoo, Dong Hee;Jeong, Dae Yul
    • The Journal of Information Systems
    • /
    • v.30 no.3
    • /
    • pp.21-45
    • /
    • 2021
  • Purpose This paper analyzed the impacts of domestic stock market by a global pandemic such as COVID-19. We investigated how the overall pattern of the stock market changed due to the impact of the COVID-19 pandemic. In particular, we analyzed in depth the pattern of stock price, as well, tried to find what factors affect on stock market index(KOSPI) in the healthcare industry due to the COVID-19 pandemic. Design/methodology/approach We built a data warehouse from the databases in various industrial and economic fields to analyze the changes in the KOSPI due to COVID-19, particularly, the changes in the healthcare industry centered on bio-medicine. We collected daily stock price data of the KOSPI centered on the KOSPI-200 about two years before and one year after the outbreak of COVID-19. In addition, we also collected various news related to COVID-19 from the stock market by applying text mining techniques. We designed four experimental data sets to develop decision tree-based prediction models. Findings All prediction models from the four data sets showed the significant predictive power with explainable decision tree models. In addition, we derived significant 10 to 14 decision rules for each prediction model. The experimental results showed that the decision rules were enough to explain the domestic healthcare stock market patterns for before and after COVID-19.

Determinant of the Elderly Poverty Using Decision Tree Analysis (의사결정나무분석을 활용한 노인빈곤 결정요인 분석)

  • Park, Mi-Young
    • Journal of Digital Convergence
    • /
    • v.16 no.7
    • /
    • pp.63-69
    • /
    • 2018
  • This study is to examine the determinants of the elderly poverty by using the Decision-tree analysis. In line with this perspective, this study includes individual characteristics, family characteristics, working characteristics, and periodic income characteristics after retirement as determinants for senior poverty. The study uses data from the Korean Retirement and Income Study based on panel survey and employs the Decision-tree analysis to explain the causes of the elderly poverty. As the result of analysis, earned wage has the greatest effect on the elderly poverty. Depending on status of the earned wage, there are 2 different variable groups. One with no earned wage includes public pension, education, and residence, paid employee and gender in the other with earned wage. Based on the analytical results, the study suggests measures to address the elderly poverty.

A Study on Factors of Internet Overdependence for Adults Using the Decision Tree Analysis Model (성인층의 인터넷 과의존 영향요인: 의사결정나무분석을 활용하여)

  • Seo, Hyung-Jun;Shin, Ji-Woong
    • Informatization Policy
    • /
    • v.25 no.2
    • /
    • pp.20-45
    • /
    • 2018
  • This study aims to find the factors of Internet overdependence in adults, through the decision tree analysis model, which is a data mining method using National Information Society Agency's raw data from the survey on Internet overdependence in 2016. As a result of the decision tree analysis, a total 16 nodes of Internet overdependence risk groups were identified. The main predicated variables were the amount of time spent per smart media usage in weekdays; amount of time spent per smart media usage in weekends; experiences of purchasing cash items; percentage of using smart media for leisure; negative personality; percentage of using smart media for information search and utilization; and awareness on good functions of the Internet, all of which in order had greater impact on the risk groups. Users in the highest risk node spent the smart media for more than 5 minutes per use and less than 5~10 minutes in weekdays, had experiences of cash item purchase, and had lower level of awareness on the good functions of the Internet. The analysis led to the following recommendations: First, even a short-time use has higher chances of causing Internet overdependence, and therefore, guidelines need to be developed based on research on the usage behavior rather than the usage time. Second, self-regulation is required because factors that affect overindulgence in games, such as the cash items, increase Internet overdependence. Third, using the Internet for leisure causes higher risk of overdependence and therefore, other means of leisure should be recommended.

Human Normalization Approach based on Disease Comparative Prediction Model between Covid-19 and Influenza

  • Janghwan Kim;Min-Yong Jung;Da-Yun Lee;Na-Hyeon Cho;Jo-A Jin;R. Young-Chul Kim
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.3
    • /
    • pp.32-42
    • /
    • 2023
  • There are serious problems worldwide, such as a pandemic due to an unprecedented infection caused by COVID-19. On previous approaches, they invented medical vaccines and preemptive testing tools for medical engineering. However, it is difficult to access poor medical systems and medical institutions due to disparities between countries and regions. In advanced nations, the damage was even greater due to high medical and examination costs because they did not go to the hospital. Therefore, from a software engineering-based perspective, we propose a learning model for determining coronavirus infection through symptom data-based software prediction models and tools. After a comparative analysis of various models (decision tree, Naive Bayes, KNN, multi-perceptron neural network), we decide to choose an appropriate decision tree model. Due to a lack of data, additional survey data and overseas symptom data are applied and built into the judgment model. To protect from thiswe also adapt human normalization approach with traditional Korean medicin approach. We expect to be possible to determine coronavirus, flu, allergy, and cold without medical examination and diagnosis tools through data collection and analysis by applying decision trees.

Using Predictive Analytics to Profile Potential Adopters of Autonomous Vehicles

  • Lee, Eun-Ju;Zafarzon, Nordirov;Zhang, Jing
    • Asia Marketing Journal
    • /
    • v.20 no.2
    • /
    • pp.65-83
    • /
    • 2018
  • Technological advances are bringing autonomous vehicles to the ever-evolving transportation system. Anticipating adoption of these technologies by users is essential to vehicle manufacturers for making more precise production and marketing strategies. The research investigates regulatory focus and consumer innovativeness with consumers' adoption of autonomous vehicles (AVs) and to consumers' subsequent willingness to pay for AVs. An online questionnaire was fielded to confirm predictions, and regression analysis was conducted to verify the model's validity. The results show that a promotion focus does not have a significantly positive effect on the automation level at which consumers will adopt AVs, but a prevention focus has a significantly positive effect on conditional AV adoption. Consumer innovativeness, consumers' novelty-seeking have a significantly positive relationship with high and full AV adoption, and consumers' independent decision-making has a significantly positive effect on full AV adoption. The higher the level of automation at which a consumer adopts AVs, the higher the willingness to pay for them. Finally, using a neural network and decision tree analyses, we show methods with which to describe three categories for potential adopters of AVs.

Design Analysis of Current Density in Lithium Secondary Battery Using Data Mining Techniques (데이터 마이닝을 이용한 리튬 이차전지의 전류밀도 영향인자 분석)

  • Jeong, Dong Ho;Lee, Jongsoo;Choi, Ha-Young
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.38 no.6
    • /
    • pp.677-682
    • /
    • 2014
  • In the present study, a decision tree and artificial neural network were used to determine critical design parameters for lithium ion batteries and compare their performances. First, a design method that used a decision tree-artificial neural network model was used to determine the major design factors among early pole plate design factors that showed nonlinearity. Then, the artificial neural network was used to implement a weighted value analysis of the importance of the design factors and their effect on the current density. The second method involved the use of an artificial neural network model to construct artificial networks without separate determinations of the major early design factors to analyze the connections and weighted values related to the current density.

Application of Data Mining Techniques to Explore Predictors of HCC in Egyptian Patients with HCV-related Chronic Liver Disease

  • Omran, Dalia Abd El Hamid;Awad, AbuBakr Hussein;Mabrouk, Mahasen Abd El Rahman;Soliman, Ahmad Fouad;Aziz, Ashraf Omar Abdel
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.16 no.1
    • /
    • pp.381-385
    • /
    • 2015
  • Background:Hepatocellular carcinoma (HCC) is the second most common malignancy in Egypt. Data mining is a method of predictive analysis which can explore tremendous volumes of information to discover hidden patterns and relationships. Our aim here was to develop a non-invasive algorithm for prediction of HCC. Such an algorithm should be economical, reliable, easy to apply and acceptable by domain experts. Methods: This cross-sectional study enrolled 315 patients with hepatitis C virus (HCV) related chronic liver disease (CLD); 135 HCC, 116 cirrhotic patients without HCC and 64 patients with chronic hepatitis C. Using data mining analysis, we constructed a decision tree learning algorithm to predict HCC. Results: The decision tree algorithm was able to predict HCC with recall (sensitivity) of 83.5% and precession (specificity) of 83.3% using only routine data. The correctly classified instances were 259 (82.2%), and the incorrectly classified instances were 56 (17.8%). Out of 29 attributes, serum alpha fetoprotein (AFP), with an optimal cutoff value of ${\geq}50.3ng/ml$ was selected as the best predictor of HCC. To a lesser extent, male sex, presence of cirrhosis, AST>64U/L, and ascites were variables associated with HCC. Conclusion: Data mining analysis allows discovery of hidden patterns and enables the development of models to predict HCC, utilizing routine data as an alternative to CT and liver biopsy. This study has highlighted a new cutoff for AFP (${\geq}50.3ng/ml$). Presence of a score of >2 risk variables (out of 5) can successfully predict HCC with a sensitivity of 96% and specificity of 82%.

Predictive Analysis of Problematic Smartphone Use by Machine Learning Technique

  • Kim, Yu Jeong;Lee, Dong Su
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.2
    • /
    • pp.213-219
    • /
    • 2020
  • In this paper, we propose a classification analysis method for diagnosing and predicting problematic smartphone use in order to provide policy data on problematic smartphone use, which is getting worse year after year. Attempts have been made to identify key variables that affect the study. For this purpose, the classification rates of Decision Tree, Random Forest, and Support Vector Machine among machine learning analysis methods, which are artificial intelligence methods, were compared. The data were from 25,465 people who responded to the '2018 Problematic Smartphone Use Survey' provided by the Korea Information Society Agency and analyzed using the R statistical package (ver. 3.6.2). As a result, the three classification techniques showed similar classification rates, and there was no problem of overfitting the model. The classification rate of the Support Vector Machine was the highest among the three classification methods, followed by Decision Tree and Random Forest. The top three variables affecting the classification rate among smartphone use types were Life Service type, Information Seeking type, and Leisure Activity Seeking type.

Study on Classification Function into Sasang Constitution Using Data Mining Techniques (데이터마이닝 기법을 이용한 사상체질 판별함수에 관한 연구)

  • Kim Kyu Kon;Kim Jong Won;Lee Eui Ju;Kim Jong Yeol;Choi Sun-Mi
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.18 no.6
    • /
    • pp.1938-1944
    • /
    • 2004
  • In this study, when we make a diagnosis of constitution using QSCC Ⅱ(Questionnaire of Sasang Constitution Classification). data mining techniques are applied to seek the classification function for improving the accuracy. Data used in the analysis are the questionnaires of 1051 patients who had been treated in Dong Eui Oriental Medical Hospital and Kyung Hee Oriental Medical Hospital. The criteria for data cleansing are the response pattern in the opposite questionnaires and the positive proportion of specific questionnaires in each constitution. And the criteria for variable selection are the test of homogeneity in frequency analysis and the coefficients in the linear discriminant function. Discriminant analysis model and decision tree model are applied to seek the classification function into Sasang constitution. The accuracy in learning sample is similar in two models, the higher accuracy in test sample is obtained in discriminant analysis model.