• Title/Summary/Keyword: dropout prediction

Search Result 28, Processing Time 0.022 seconds

Development of the Drop-outs Prediction Model for Intelligent Drop-outs Prevention System

  • Song, Mi-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.10
    • /
    • pp.9-17
    • /
    • 2017
  • The student dropout prediction is an indispensable for many intelligent systems to measure the educational system and success rate of all university. Therefore, in this paper, we propose an intelligent dropout prediction system that minimizes the situation by adopting the proactive process through an effective model that predicts the students who are at risk of dropout. In this paper, the main data sets for students dropout predictions was used as questionnaires and university information. The questionnaire was constructed based on theoretical and empirical grounds about factor affecting student's performance and causes of dropout. University Information included student grade, interviews, attendance in university life. Through these data sets, the proposed dropout prediction model techniques was classified into the risk group and the normal group using statistical methods and Naive Bays algorithm. And the intelligence dropout prediction system was constructed by applying the proposed dropout prediction model. We expect the proposed study would be used effectively to reduce the students dropout in university.

A Study of Freshman Dropout Prediction Model Using Logistic Regression with Shift-Sigmoid Classification Function (시프트 시그모이드 분류함수를 가진 로지스틱 회귀를 이용한 신입생 중도탈락 예측모델 연구)

  • Kim Donghyung
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.19 no.4
    • /
    • pp.137-146
    • /
    • 2023
  • The dropout of university freshmen is a very important issue in the financial problems of universities. Moreover, the dropout rate is one of the important indicators among the external evaluation items of universities. Therefore, universities need to predict dropout students in advance and apply various dropout prevention programs targeting them. This paper proposes a method to predict such dropout students in advance. This paper is about a method for predicting dropout students. It proposes a method to select dropouts by applying logistic regression using a shift sigmoid classification function using only quantitative data from the first semester of the first year, which most universities have. It is based on logistic regression and can select the number of prediction subjects and prediction accuracy by using the shift sigmoid function as an classification function. As a result of the experiment, when the proposed algorithm was applied, the number of predicted dropout subjects varied from 100% to 20% compared to the actual number of dropout subjects, and it was found to have a prediction accuracy of 75% to 98%.

A Study on the Development of University Students Dropout Prediction Model Using Ensemble Technique (앙상블 기법을 활용한 대학생 중도탈락 예측 모형 개발)

  • Park, Sangsung
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.17 no.1
    • /
    • pp.109-115
    • /
    • 2021
  • The number of freshmen at universities is decreasing due to the recent decline in the school-age population, and the survival of many universities is threatened. To overcome this situation, universities are seeking ways to use big data within the school to improve the quality of education. A study on the prediction of dropout students is a representative case of using big data in universities. The dropout prediction can prepare a systematic management plan by identifying students who will drop out of school due to reasons such as dropout or expulsion. In the case of actual on-campus data, a large number of missing values are included because it is collected and managed by various departments. For this reason, it is necessary to construct a model by effectively reflecting the missing values. In this study, we propose a university student dropout prediction model based on eXtreme Gradient Boost that can be applied to data with many missing values and shows high performance. In order to examine the practical applicability of the proposed model, an experiment was performed using data from C University in Chungbuk. As a result of the experiment, the prediction performance of the proposed model was found to be excellent. The management strategy of dropout students can be established through the prediction results of the model proposed in this paper.

Post-Examination Analysis on the Student Dropout Prediction Index (학생 중도탈락 예측지수에 관한 사후검증 연구)

  • Lee, Ji-Eun
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.175-183
    • /
    • 2019
  • Drop-out issue is one of the challenges of cyber university. There are about 130,000 students enrolled in cyber universities, but the dropout rate is also very high. To lower the dropout rate, cyber universities invest heavily in learning analytics. Some cyber universities analyze the possibility of dropout and actively support students who are more likely to drop out. The purpose of this paper is to identify the learning data affecting the dropout prediction index. As a result of the analysis, it is confirmed that number of lessons(progress), credits, achievement and leave of absence have a significant effect on dropout rate. It is necessary to increase the accuracy of the prediction model through post-test on the student dropout prediction index.

  • PDF

A Comparative Study of Prediction Models for College Student Dropout Risk Using Machine Learning: Focusing on the case of N university (머신러닝을 활용한 대학생 중도탈락 위험군의 예측모델 비교 연구 : N대학 사례를 중심으로)

  • So-Hyun Kim;Sung-Hyoun Cho
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.12 no.2
    • /
    • pp.155-166
    • /
    • 2024
  • Purpose : This study aims to identify key factors for predicting dropout risk at the university level and to provide a foundation for policy development aimed at dropout prevention. This study explores the optimal machine learning algorithm by comparing the performance of various algorithms using data on college students' dropout risks. Methods : We collected data on factors influencing dropout risk and propensity were collected from N University. The collected data were applied to several machine learning algorithms, including random forest, decision tree, artificial neural network, logistic regression, support vector machine (SVM), k-nearest neighbor (k-NN) classification, and Naive Bayes. The performance of these models was compared and evaluated, with a focus on predictive validity and the identification of significant dropout factors through the information gain index of machine learning. Results : The binary logistic regression analysis showed that the year of the program, department, grades, and year of entry had a statistically significant effect on the dropout risk. The performance of each machine learning algorithm showed that random forest performed the best. The results showed that the relative importance of the predictor variables was highest for department, age, grade, and residence, in the order of whether or not they matched the school location. Conclusion : Machine learning-based prediction of dropout risk focuses on the early identification of students at risk. The types and causes of dropout crises vary significantly among students. It is important to identify the types and causes of dropout crises so that appropriate actions and support can be taken to remove risk factors and increase protective factors. The relative importance of the factors affecting dropout risk found in this study will help guide educational prescriptions for preventing college student dropout.

Prediction of golden time for recovering SISs using deep fuzzy neural networks with rule-dropout

  • Jo, Hye Seon;Koo, Young Do;Park, Ji Hun;Oh, Sang Won;Kim, Chang-Hwoi;Na, Man Gyun
    • Nuclear Engineering and Technology
    • /
    • v.53 no.12
    • /
    • pp.4014-4021
    • /
    • 2021
  • If safety injection systems (SISs) do not work in the event of a loss-of-coolant accident (LOCA), the accident can progress to a severe accident in which the reactor core is exposed and the reactor vessel fails. Therefore, it is considered that a technology that provides recoverable maximum time for SIS actuation is necessary to prevent this progression. In this study, the corresponding time was defined as the golden time. To achieve the objective of accurately predicting the golden time, the prediction was performed using the deep fuzzy neural network (DFNN) with rule-dropout. The DFNN with rule-dropout has an architecture in which many of the fuzzy neural networks (FNNs) are connected and is a method in which the fuzzy rule numbers, which are directly related to the number of nodes in the FNN that affect inference performance, are properly adjusted by a genetic algorithm. The golden time prediction performance of the DFNN model with rule-dropout was better than that of the support vector regression model. By using the prediction result through the proposed DFNN with rule-dropout, it is expected to prevent the aggravation of the accidents by providing the maximum remaining time for SIS recovery, which failed in the LOCA situation.

Performance Comparison of Machine Learning based Prediction Models for University Students Dropout (머신러닝 기반 대학생 중도 탈락 예측 모델의 성능 비교)

  • Seok-Bong Jeong;Du-Yon Kim
    • Journal of the Korea Society for Simulation
    • /
    • v.32 no.4
    • /
    • pp.19-26
    • /
    • 2023
  • The increase in the dropout rate of college students nationwide has a serious negative impact on universities and society as well as individual students. In order to proactive identify students at risk of dropout, this study built a decision tree, random forest, logistic regression, and deep learning-based dropout prediction model using academic data that can be easily obtained from each university's academic management system. Their performances were subsequently analyzed and compared. The analysis revealed that while the logistic regression-based prediction model exhibited the highest recall rate, its f-1 value and ROC-AUC (Receiver Operating Characteristic - Area Under the Curve) value were comparatively lower. On the other hand, the random forest-based prediction model demonstrated superior performance across all other metrics except recall value. In addition, in order to assess model performance over distinct prediction periods, we divided these periods into short-term (within one semester), medium-term (within two semesters), and long-term (within three semesters). The results underscored that the long-term prediction yielded the highest predictive efficacy. Through this study, each university is expected to be able to identify students who are expected to be dropped out early, reduce the dropout rate through intensive management, and further contribute to the stabilization of university finances.

A Machine Learning-Based Vocational Training Dropout Prediction Model Considering Structured and Unstructured Data (정형 데이터와 비정형 데이터를 동시에 고려하는 기계학습 기반의 직업훈련 중도탈락 예측 모형)

  • Ha, Manseok;Ahn, Hyunchul
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.1
    • /
    • pp.1-15
    • /
    • 2019
  • One of the biggest difficulties in the vocational training field is the dropout problem. A large number of students drop out during the training process, which hampers the waste of the state budget and the improvement of the youth employment rate. Previous studies have mainly analyzed the cause of dropouts. The purpose of this study is to propose a machine learning based model that predicts dropout in advance by using various information of learners. In particular, this study aimed to improve the accuracy of the prediction model by taking into consideration not only structured data but also unstructured data. Analysis of unstructured data was performed using Word2vec and Convolutional Neural Network(CNN), which are the most popular text analysis technologies. We could find that application of the proposed model to the actual data of a domestic vocational training institute improved the prediction accuracy by up to 20%. In addition, the support vector machine-based prediction model using both structured and unstructured data showed high prediction accuracy of the latter half of 90%.

Performance Comparison of Neural Network and Gradient Boosting Machine for Dropout Prediction of University Students

  • Hyeon Gyu Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.8
    • /
    • pp.49-58
    • /
    • 2023
  • Dropouts of students not only cause financial loss to the university, but also have negative impacts on individual students and society together. To resolve this issue, various studies have been conducted to predict student dropout using machine learning. This paper presents a model implemented using DNN (Deep Neural Network) and LGBM (Light Gradient Boosting Machine) to predict dropout of university students and compares their performance. The academic record and grade data collected from 20,050 students at A University, a small and medium-sized 4-year university in Seoul, were used for learning. Among the 140 attributes of the collected data, only the attributes with a correlation coefficient of 0.1 or higher with the attribute indicating dropout were extracted and used for learning. As learning algorithms, DNN (Deep Neural Network) and LightGBM (Light Gradient Boosting Machine) were used. Our experimental results showed that the F1-scores of DNN and LGBM were 0.798 and 0.826, respectively, indicating that LGBM provided 2.5% better prediction performance than DNN.

Design of the Management System for Students at Risk of Dropout using Machine Learning (머신러닝을 이용한 학업중단 위기학생 관리시스템의 설계)

  • Ban, Chae-Hoon;Kim, Dong-Hyun;Ha, Jong-Soo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.6
    • /
    • pp.1255-1262
    • /
    • 2021
  • The proportion of students dropping out of universities is increasing year by year, and they are trying to identify risk factors and eliminate them in advance to prevent dropouts. However, there is a problem in the management of students at risk of dropping out and the forecast is inaccurate because crisis students are managed through the univariable analysis of specific risk factors. In this paper, we identify risk factors for university dropout and analyze multivariables through machine learning method to predict university dropout. In addition, we derive the optimization method by evaluation performance for various prediction methods and evaluate the correlation and contribution between risk factors that cause university dropout.