Browse > Article
http://dx.doi.org/10.13088/jiis.2018.24.4.001

Corporate Default Prediction Model Using Deep Learning Time Series Algorithm, RNN and LSTM  

Cha, Sungjae (AIZEN GLOBAL)
Kang, Jungseok (AIZEN GLOBAL)
Publication Information
Journal of Intelligence and Information Systems / v.24, no.4, 2018 , pp. 1-32 More about this Journal
Abstract
In addition to stakeholders including managers, employees, creditors, and investors of bankrupt companies, corporate defaults have a ripple effect on the local and national economy. Before the Asian financial crisis, the Korean government only analyzed SMEs and tried to improve the forecasting power of a default prediction model, rather than developing various corporate default models. As a result, even large corporations called 'chaebol enterprises' become bankrupt. Even after that, the analysis of past corporate defaults has been focused on specific variables, and when the government restructured immediately after the global financial crisis, they only focused on certain main variables such as 'debt ratio'. A multifaceted study of corporate default prediction models is essential to ensure diverse interests, to avoid situations like the 'Lehman Brothers Case' of the global financial crisis, to avoid total collapse in a single moment. The key variables used in corporate defaults vary over time. This is confirmed by Beaver (1967, 1968) and Altman's (1968) analysis that Deakins'(1972) study shows that the major factors affecting corporate failure have changed. In Grice's (2001) study, the importance of predictive variables was also found through Zmijewski's (1984) and Ohlson's (1980) models. However, the studies that have been carried out in the past use static models. Most of them do not consider the changes that occur in the course of time. Therefore, in order to construct consistent prediction models, it is necessary to compensate the time-dependent bias by means of a time series analysis algorithm reflecting dynamic change. Based on the global financial crisis, which has had a significant impact on Korea, this study is conducted using 10 years of annual corporate data from 2000 to 2009. Data are divided into training data, validation data, and test data respectively, and are divided into 7, 2, and 1 years respectively. In order to construct a consistent bankruptcy model in the flow of time change, we first train a time series deep learning algorithm model using the data before the financial crisis (2000~2006). The parameter tuning of the existing model and the deep learning time series algorithm is conducted with validation data including the financial crisis period (2007~2008). As a result, we construct a model that shows similar pattern to the results of the learning data and shows excellent prediction power. After that, each bankruptcy prediction model is restructured by integrating the learning data and validation data again (2000 ~ 2008), applying the optimal parameters as in the previous validation. Finally, each corporate default prediction model is evaluated and compared using test data (2009) based on the trained models over nine years. Then, the usefulness of the corporate default prediction model based on the deep learning time series algorithm is proved. In addition, by adding the Lasso regression analysis to the existing methods (multiple discriminant analysis, logit model) which select the variables, it is proved that the deep learning time series algorithm model based on the three bundles of variables is useful for robust corporate default prediction. The definition of bankruptcy used is the same as that of Lee (2015). Independent variables include financial information such as financial ratios used in previous studies. Multivariate discriminant analysis, logit model, and Lasso regression model are used to select the optimal variable group. The influence of the Multivariate discriminant analysis model proposed by Altman (1968), the Logit model proposed by Ohlson (1980), the non-time series machine learning algorithms, and the deep learning time series algorithms are compared. In the case of corporate data, there are limitations of 'nonlinear variables', 'multi-collinearity' of variables, and 'lack of data'. While the logit model is nonlinear, the Lasso regression model solves the multi-collinearity problem, and the deep learning time series algorithm using the variable data generation method complements the lack of data. Big Data Technology, a leading technology in the future, is moving from simple human analysis, to automated AI analysis, and finally towards future intertwined AI applications. Although the study of the corporate default prediction model using the time series algorithm is still in its early stages, deep learning algorithm is much faster than regression analysis at corporate default prediction modeling. Also, it is more effective on prediction power. Through the Fourth Industrial Revolution, the current government and other overseas governments are working hard to integrate the system in everyday life of their nation and society. Yet the field of deep learning time series research for the financial industry is still insufficient. This is an initial study on deep learning time series algorithm analysis of corporate defaults. Therefore it is hoped that it will be used as a comparative analysis data for non-specialists who start a study combining financial data and deep learning time series algorithm.
Keywords
Optimal Feature Selection; Lasso Regression; Deep Learning Time Series Algorithm; Corporate Bankruptcy; RNN; LSTM;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Deakin, E. B., "A Discriminant Analysis of Predictors of Business Failure", Journal of Accounting Research, Vol.10, No.1, (1972), 167-179.   DOI
2 Grice, J. S. and M. T. Dugan, "The Limitations of Bankruptcy Prediction Models: Some Cautions for the Researcher", Review of Quantitative Finance and Accounting, Vol.17, No.2, (2001), 151-166.   DOI
3 Hong, S. H. and K. S. Shin, "Using GA based Input Selection Method for Artificial Neural Network Modeling; Application to Bankruptcy Prediction". Journal of Intelligence and Information Systems, Vol.9, No.1, (2003), 227-249.
4 Jo, N. O., H. J. Kim and K. S. Shin. "Bankrupcy Type Prediction Using A Hybrid Artificial Neural Networks Model." Journal of Intelligence and Information Systems, Vol.21, No.3, (2015), 79-99.   DOI
5 Jo, N. O. and K. S. Shin. "Bankrupcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics", Journal of Intelligence and Information Systems, Vol.22, No.2, (2016), 33-56.   DOI
6 Kapinos, P., and O.A. Mitnik, "A Top-Down Approach to Stress-Testing Banks", Journal of Financial Services Research, Vol.49, No.2, (2016), 229-264.   DOI
7 Kim, G. P., H. K. Lee, J. H. Kim and H. J. Kwon, "The Fourth Industrial Revolution in Major Countries and Growth Strategy of Korea: U.S., Germany and Japan Cases", Korea Institute for International Economic Policy, Policy Analysis, (2017).
8 Kim, J. B. and J. S. Lee, "Usability of Cash Flow Data in Predicting Bankruptcy Using Artificial Intelligence Techniques: The Case of Small and Medium Sized Firms", Korean Journal of Business Administration, No.26, (2000), 229-250.
9 Kim, M. J., "Ensemble Learning for Solving Data Imbalance in Bankruptcy Prediction", Journal of Intelligence and Information Systems, Vol.15, No.3. (2009), 1-15.
10 Kim, M. J., H. B. Kim and D. K. Kang, "Optimizing SVM Ensembles Using Genetic Algorithms in Bankruptcy Prediction", Journal of information and communication convergence engineering, Vol.8, No.4, (2010), 370-376.   DOI
11 Kim, M. J., "Ensemble Learning with Support Vector Machines for Bond Rating", Journal of Intelligence and Information Systems, Vol.18, No.2, (2012), 29-45.   DOI
12 Kim, S. B., P. Ji and K. J. Jo, "The Analysis on the Causes of Corporate Bankruptcy with the Bankruptcy Prediction Model", Journal of Market Economy, Vol.40, No.1, (2011), 85-106.
13 Kim, S. J. and H. C. Ahn, "Estimation Model applied Random Forest for Corporate Bond Ratings", Journal of Intelligence and Information Systems, Spring Conference, (2014), 371-376.
14 Kim, Y. D., C. H. Jun and H. S. Lee, "A new classification method using penalized partial Least squares", Journal of the Korean Data and Information Science Society, Vol.22, No.5, (2011), 931-940.
15 Kim, Y. T. and M. H. Kim, "An Artificial Neural Network Model for Business Failure Prediction", Korean Journal of Accounting Research, Vol.6, No.1, (2001), 275-294.
16 Kwon, H. K., D. K. Lee and M. S. Shin, "Dynamic forecasts of bankruptcy with Recurrent Neural Network model", Journal of Intelligence and Information Systems, Vol.23, No.3, (2017), 139-153.   DOI
17 Lee, I. R. and D. C. Kim, "Evaluation of Bankruptcy Prediction Model Using Accounting Information and Market Information", Journal of Korean Finance Association, Vol.28, No.4(2015), 626-666.
18 Lee, J. S. and J. H. Han, "Test of Non-Financial Information in Bankruptcy Prediction using Artificial Neural Network - The Case of Small and Medium - Sized Firms - )", Journal of Intelligence and Information Systems, Vol.1, No.1, (1995), 123-134.
19 Min, S. H., "Bankruptcy prediction using an improved bagging ensemble", Journal of Intelligence and Information Systems, Vol.20, No.4, (2014), 121-139.   DOI
20 Lee, K. C., "Comparative Study on the Bankruptcy Prediction Power of Statistical Model and Al Models : MDA , Inductive Learning , Neural Network )", Journal of the Korean Operations Research and Management Science Society, Vol.18, No.2, (1993), 57-81.
21 Min, S. H., "Simultaneous optimization of KNN ensemble model for bankruptcy prediction", Journal of Intelligence and Information Systems, Vol.22, No.1, (2016), 139-157.   DOI
22 No, G. M. and W. G. Han, "ICT Policy Direction After 100-days Moon Jae-in government launched.", National Information Society Agency, Hot Issue Report, (2017).
23 Ohlson, J. A., "Financial Ratios and the Probabilistic Prediction of Bankruptcy", Journal of Accounting Research, (1980), 109-131.
24 Park, J. Y., Y. W. Kim and M. Y. Lee, "A Prediction Model of Small Business Bankruptcy", Journal of Korean Logos Management, Conference, (2007), 202-204.
25 Presidential Committee on the Fourth Industrial Revolution, "Data Industry Promotion Strategy - I-KOREA 4.0 Data Field Plan, I-DATA+", (2017).
26 Shapiro, S. S. and M. B. Wilk, "An analysis of variance test for normality (complete samples)", Biometrika, Vol.52, (1965), 591-611.   DOI
27 Swedberg, R., "The Structure of Confidence and the Collapse of Lehman Brothers", Research in the Sociology of Organizations, (2009).
28 Yeh, S., C. Wang and M. Tsai, "Corporate default prediction via deep learning", Wireless and Optical Communication Conference, Vol.24, 1-8.
29 Wang, H., Q. Xu and L. Zhou, "Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble", PLoS One, San Francisco, Vol.10, No.2, (2015).
30 Welch, B. L., "'Student' and Small Sample Theory", Journal of the American Statistical Association, Vol.53, No.284, (1958), 777-788.
31 Zmijewski, M. E., "Methodological issues related to the estimation of financial distress prediction models", Studies on Current Econometric Issues in Accounting Research, Vol.22, (1984), 59-82.
32 Tibshirani, R., "Regression Shrinkage and Selection via the Lasso", Journal of the Royal Statistical Society, Series B (Methodological), Vol.58, No.1, (1996), 267-288.   DOI
33 Bae, J. K., "An Integrated Approach to Predict Corporate Bankruptcy with Voting Algorithms and Neural Networks", Korean Business Review, Vol.3, No.2, (2010), 79-101.
34 Addal, S., "Financial forecasting using machine learning", African Institute for Mathematical Science, (2016), 1-32.
35 Ahn, S. M., and J. W. Park, "Corporate Bankruptcy Prediction Using Financial Ratios: Focused on the Korean Manufacturing Companies Audited by External Auditors", Korean Management Review, Vol.43, No.3, (2014), 639-669.
36 Altman, E. I., "Financial Ratios, Discriminant Analysis and the Predication of Corporate Bankrupcy", Journal of Finance, Vol.23. No.4, (1968), 589-609.   DOI
37 Beaver, W. H., "Financial ratios as predictors of bankruptcy", Journal of Accounting Research, Supplement, (1966), 71-102.