Browse > Article
http://dx.doi.org/10.5859/KAIS.2020.29.2.287

Study on the Prediction Model for Employment of University Graduates Using Machine Learning Classification  

Lee, Dong Hun (단국대학교 대학원 데이터지식서비스공학과)
Kim, Tae Hyung (단국대학교 대학원 데이터지식서비스공학과)
Publication Information
The Journal of Information Systems / v.29, no.2, 2020 , pp. 287-306 More about this Journal
Abstract
Purpose Youth unemployment is a social problem that continues to emerge in Korea. In this study, we create a model that predicts the employment of college graduates using decision tree, random forest and artificial neural network among machine learning techniques and compare the performance between each model through prediction results. Design/methodology/approach In this study, the data processing was performed, including the acquisition of the college graduates' vocational path survey data first, then the selection of independent variables and setting up dependent variables. We use R to create decision tree, random forest, and artificial neural network models and predicted whether college graduates were employed through each model. And at the end, the performance of each model was compared and evaluated. Findings The results showed that the random forest model had the highest performance, and the artificial neural network model had a narrow difference in performance than the decision tree model. In the decision-making tree model, key nodes were selected as to whether they receive economic support from their families, major affiliates, the route of obtaining information for jobs at universities, the importance of working income when choosing jobs and the location of graduation universities. Identifying the importance of variables in the random forest model, whether they receive economic support from their families as important variables, majors, the route to obtaining job information, the degree of irritating feelings for a month, and the location of the graduating university were selected.
Keywords
Machine Learning; Youth unemployment; Decision-Tree; Randome Forest; Artificial neural network;
Citations & Related Records
Times Cited By KSCI : 5  (Citation Analysis)
연도 인용수 순위
1 Oh, J. Y, Lee, W. G., Lee, J. M. and Park, M. S., "Big Data Use Cases of Last Mile Logistics," The Korea Institute of Information and Communication Engineering., 2019, pp. 121-123
2 Oh, S. G., "An analysis of the determinants of youth employment probability in Korea," Master's thesis, 2003, Yonsei University.
3 Park, G. H., "To foster leaders of the 4th Industrial Revolution Training market research," Ministry of Employment and Labor. 2018, pp. 167-171
4 Pal, M., "Random forest classifier for remote sensing classification," International Journal of Remote Sensing, Vol. 26, No. 1, 2005, pp. 2-3.
5 Park, K. Y. and Cheon, Y. M., "A Factor analysis of employment for college graduates," Employment survey Conference 2016, 2016.
6 Podgorelec, V., Kokol, P., Stiglic, B. and Rozman, I., "Decision trees: an overview and their use in medicine," Journal of Medical Systems, Kluwer Academic/ Plenum Press, Vol. 26, No. 5, 2002, pp. 9-10.   DOI
7 Son, J. S., "A study on transportation policy using big data: Focusing Seoul city late night bus case," Korean Policy Studies Review. 2019. pp. 19-34
8 Sutton, R. and Barto, A., "Reinforcement Learning: An Introduction," A Bradford Book, USA, England. 2015, pp. 2-4
9 Zhao, Y. and Zhang, Y., "Comparison of decision tree methods for finding active objects," Advances in Space Research, Vol. 41, No. 12, 2007, pp. 3-4.
10 Zhu, W., Zeng, W. and Wang, N., "Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS${(R)}$," Health Care and Life Sciences, Implementations, 2010, pp. 1-2.
11 Zurada, J., "Introduction to Arificial Neural Systems," West Publishing Company, USA. 1992, pp. 37-38
12 Gil, H. Y. and Choi, Y. M., "Analysis of Determinants of Employment Types of Graduates," The Journal of Vocational Education Research, Vol. 33, No. 6. 2014, pp. 13-19.
13 Choi, P. S. and Min, I. S., "Employment prediction model for college graduates using machine learning techniques," Vocational competency development research. Vol. 21, No. 1, 2018, pp. 32-38.
14 Chung, T. Y., and Lee, K. Y., "Determinants of Job Finding among College Graduates - with Emphasis on the Effects of GPA -," Korea Business Review, Vol. 8, No. 2, 2006, pp. 159-184
15 Geron, A. "Hands-On Machine Learning with Scikit-Learn and TensorFlow," O'Reilly, USA. 2017, pp. 10-12.
16 Hong, G. S., "Study on determinants of youth unemployment," Analysis of the Korean economy. Vol. 24, No. 2, 2018, pp. 91-93.
17 Hssina, B., Merbouha, A., Ezzikouri, H., and Erritali, M., "A comparative study of decision tree ID3 and C4.5," International Journal of Advanced Computer Science and Applications, Special Issue on Advances in Vehicular Ad Hoc Networking and Applications: 2014, pp. 13-18.
18 Jung, M. N. and Yim, Y. S., "Path analysis of variables related to entering the labor market among college graduates," The Journal of Career Education Research, Vol. 23, No. 2, 2010, pp. 143-146.
19 Kang, H. Y., "Big data application examples and utilization strategies," Magazine of the SAREK, Vol. 45, No. 1, 2016, pp. 32-33.
20 Kaisler, S., Armour, F., Espinosa, J., and Monet, W., "Big Data: Issues and Challenges Moving Forward," Hawaii International Conference on System Sciences, 46, 2016, pp. 996-997
21 Kim, D. A., Kang, D. A., and Song, J. W., "Classification Analysis for Unbalanced Data," The Korean Journal of Applied Statistics, Vol. 28, No. 3, 2015, pp. 495.   DOI
22 Kwak, H. and Lee, S. W., "Competitiveness Analysis for Artificial Intelligence Technology through Patent Analysis," Journal of Information Systems, Vol 28, No. 3., 2019, pp. 141-158.   DOI
23 Kim, J. K., "Domestic and foreign big data trends and success stories," Industrial Engineering Magazine, Vol. 23, No. 1, 2016. pp. 48-49.
24 Kim, J. S., "Big data analysis technology and use cases," Journal of Contents, Vol. 12, No. 1. 2014, pp. 18-19.
25 Kim, S. H., "The effect of job preparation activities of young college graduates on entering the labor market: focusing on whether and when to get a full-time job," Education Culture Research, Vol. 24, 2014, pp. 313-318.
26 Magerman, D., "Statistical Decision-Tree Models for Parsing," ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics. 1995, pp. 276-279.
27 Lee, J. H., Lee, C. K. and Lee, H. H., "An Analysis of College Graduates Employment factors using Data Mining: The Importance of Volunteer Services," Logos Management Review, Vol. 17, No. 2., 2019, pp. 143-156.
28 Lee, H. W., Lee, S. R., and Chung, K, K., "The impact of Artificial Intelligence Adoption in Canadidates Screening and Joub Interview on Intentions to Apply," Journal of Information Systems, Vol. 28, No. 2, 2019, pp. 25-52.   DOI
29 Lee, Y. J., Lee, S. H., and Lee, J. S., "KB Card's marketing activities and bigdata utilization," Korea Business Review, Vol. 18, No. 1, 2014, pp. 162-163.
30 Marsland, S., "Machine Learning Algorithmic Perspective," CRC Press, USA. 2015. pp. 6-9.
31 Mingers, J., "An empirical comparison of pruning methods for decision tree induction," Machine Learning, 4, 1989, pp. 241-242.   DOI
32 Noh, K. R. and Heo, S. J., "Analysis of Factors Influencing Achievement of Employment Goals," The Journal of Vocational Education Research, Vol. 1, No. 22, 2015, pp. 10-14.
33 Breiman, L., Friedman, J., Olshen, L., and Stone, J., "Classification and Regression trees," CHAPMAN and HALL/CRC, USA. 1989. pp. 55-58.
34 Agatonovic, S. and Beresford, R. "Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research," Journal of Pharmaceutical and Biomedical Analysis, Vol. 22. 1999, pp. 718-720.
35 Alpaydin, E. "Introduction to Machine Learning. second edition," The MIT Press, Cambridge, Massachusetts, USA. 2010. pp. 20-24.
36 Anyanwu. M. and Shiva. S., "Comparative Analysis of Serial Decision Tree Classification Algorithms," International Journal of Computer Science and Security, Vol. 3, No. 3, 2009, pp. 233.
37 Breiman, L. "RANDOM FORESTS," 1999. pp. 2-3.
38 Breiman, L. "RANDOM FORESTS," 2001. pp. 7-8.
39 Chae, C. G. and Kim, T. G., "Analysis of Determinants of Employment Performance of Young College Students," The Journal of Vocational Education Research, Vol. 28, No. 2., 2009. pp. 96-99.
40 Chawla, N. V., Bower, K. W., Hall, L. O., and Kegelmeyer, W. P., "SMOTE: Synthetic Minority Over-sampling Technique," Journal of Artificial Intelligence Research, Vol. 16, 2002, pp. 321-357.   DOI
41 Choi, I. S. and Shin, E. J., "An Empirical Study of the Determinants of Successful Job Seeking of College Students - Focusing on the Impacts of Job Education Programs," Economic Education Research, Vol. 23, No. 1. 2016, pp. 23-49.
42 Choi, J. H. and Seo, D. S., "Application of data mining decision tree," Statistical Analysis Research. Vol. 4, No. 1. 1999, pp. 62, 63-67.