DOI QR코드

DOI QR Code

An Application of Support Vector Machines to Personal Credit Scoring: Focusing on Financial Institutions in China

Support Vector Machines을 이용한 개인신용평가 : 중국 금융기관을 중심으로

  • Ding, Xuan-Ze (Dept. of Business Administration, Dongguk University) ;
  • Lee, Young-Chan (Dept. of Business Administration, Dongguk University)
  • Received : 2018.11.26
  • Accepted : 2018.12.14
  • Published : 2018.12.31

Abstract

Personal credit scoring is an effective tool for banks to properly guide decision profitably on granting loans. Recently, many classification algorithms and models are used in personal credit scoring. Personal credit scoring technology is usually divided into statistical method and non-statistical method. Statistical method includes linear regression, discriminate analysis, logistic regression, and decision tree, etc. Non-statistical method includes linear programming, neural network, genetic algorithm and support vector machine, etc. But for the development of the credit scoring model, there is no consistent conclusion to be drawn regarding which method is the best. In this paper, we will compare the performance of the most common scoring techniques such as logistic regression, neural network, and support vector machines using personal credit data of the financial institution in China. Specifically, we build three models respectively, classify the customers and compare analysis results. According to the results, support vector machine has better performance than logistic regression and neural networks.

개인신용평가는 은행이 대출을 승인할 때 수익성 있는 의사결정을 적절히 유도할 수 있는 효과적인 도구이다. 최근 많은 분류 알고리즘 및 모델이 개인신용평가에 사용되고 있다. 개인신용평가 기법은 대체로 통계적 방법과 비 통계적 방법으로 구분된다. 통계적 방법에는 선형회귀분석, 판별분석, 로지스틱 회귀분석, 의사결정나무 등이 포함된다. 비 통계적 방법에는 선형계획법, 신경망, 유전자 알고리즘 및 Support Vector Machines 등이 포함된다. 그러나 신용평가모형 개발을 위해 어떠한 방법이 최선인지에 관해서는 일관된 결론을 내리기는 어렵다. 본 논문에서는 중국 금융기관의 개인 신용 데이터를 사용하여 가장 대표적인 신용평가 기법인 로지스틱 회귀분석, 신경망 그리고 Support Vector Machines의 성능을 비교하고자 한다. 구체적으로, 세 가지 모형을 각각 구축하여 고객을 분류하고 분석 결과를 비교하였다. 분석결과에 따르면, Support Vector Machines이 로지스틱 회귀분석과 신경망보다 더 나은 성능을 가지는 것으로 나타났다.

Keywords

References

  1. Marques, A. I., Garcia, V., and Sanchez, J. S. (2013), "On the suitability of resampling techniques for the class imbalance problem in credit scoring", Journal of the Operational Research Society, 64(7), 1060-1070. https://doi.org/10.1057/jors.2012.120
  2. Thomas, L. C., Edelman, D. B., and Crook, L. N. (2002), Credit Scoring and I ts Applications, Philadelphia: Society for Industrial and Applied Mathematics.
  3. West, D. (2002), "Neural network credit scoring models", Computers and Operations Research, 27(12), 1131-1152.
  4. Guardia, N. (2002), "Consumer credit in the European Union", ECRI Research Report 1, 1-39.
  5. Richard, D., and John, G. (2013), "Financial literacy and consumer credit portfolios", Journal of Banking & Finance, 37(7), 2246-2254. https://doi.org/10.1016/j.jbankfin.2013.01.013
  6. Fisher, R. A., "The Use of Multiple Measurements in Taxonomic Problems", Annals of Eugenics, Vol. 7, No. 2, 1936, pp. 179-188. https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
  7. West, D. (2000), "Neural network credit scoring models", Computers & Operations Research, 27(12), 1131-1152. https://doi.org/10.1016/S0305-0548(99)00149-5
  8. Pavlidis, N., Tasoulis, D., Adams, N., and Hand, D. (2012), "Adaptive consumer credit classification", Journal of the Operational Research Society, 63(12), 1645-1654. https://doi.org/10.1057/jors.2012.15
  9. Yap, B., Ong, S., and Husain, N. (2011), "Using data mining to improve assessment of credit worthiness via credit scoring models", Expert Systems with Applications, 38(10), 13274-13283. https://doi.org/10.1016/j.eswa.2011.04.147
  10. Cock, M. D., Dowsley, R., Horst, C., Katti, R., Nascimento, A., & Poon, W. S. (2017)., "Efficient and private scoring of decision trees, support vector machines and logistic regression models based on pre-computation", IEEE Transactions on Dependable & Secure Computing, 16(2), 217-230.
  11. Ripley, B. D. (1996), Pattern Recognition and Neural Networks, Cambridge University Press.
  12. Abdou, H., Pointon, J., and El-Masry, A. (2008), "Neural nets versus conventional techniques in credit scoring in egyptian banking", Expert Systems with Applications, 35(3), 1275-1292. https://doi.org/10.1016/j.eswa.2007.08.030
  13. Marcano-Cedeno, A., Marin-De-La-Barcena, A., Jimenez-Trillo, J., Pinuela, J., and Andina, D. (2011), "Artificial metaplasticity neural network applied to credit scoring", International Journal of Neural Systems, 21(4), 311-317. https://doi.org/10.1142/S0129065711002857
  14. Pang, S.-L. (2005), "Study on credit scoring model and forecasting based on probabilistic neural network", System Engineering Theory and Practice, 25(5), 43-48.
  15. Ayouche, S., Aboulaich, R., & Ellaia, R. (2017). "Partnership credit scoring classification problem: a neural network approach", International Journal of Applied Engineering Research, 12(5), 693-704.
  16. Chi, G., Abedin, MZ., and Fahmida, E.M. (2017), "Chinese Small Business Credit Scoring: Application of Multiple Hybrids Neural Network", International Journal of Database Theory and Application, 10(2), 1-22. https://doi.org/10.14257/ijdta.2017.10.2.01
  17. Cristianini, N., and Shawe-Taylor, J. (2000), An introduction to support vector machines, Cambridge, England: Cambridge University Press.
  18. Gunn, S. R. (1998), "Support vector machines for classification and regression", Technical Report, University of Southampton.
  19. Hearst, M. A., Dumais, S. T., Osman, E., Platt, J., and Scholkopf, B. (1998), "Support vector machines", IEEE Intelligent System, 13(4), 18-28.
  20. Vapnik, V. (1998), Statistical learning theory, New York: Springer.
  21. Lee, Y. C. (2006), "Application of support vector machines to corporate credit rating prediction", Expert Systems with Applications, 33(1), 67-74. https://doi.org/10.1016/j.eswa.2006.04.018
  22. Chen, W., Ma, C., and Ma, L. (2009), "Mining the customer credit using hybrid support vector machine technique", Expert Systems with Applications, 36(4), 7611-7616. https://doi.org/10.1016/j.eswa.2008.09.054
  23. Zhou, L., Lai, K., Yu, L. (2010), "Least squares support vector machines ensemble models for credit scoring", Expert Systems with Applications, 37(1), 127-133. https://doi.org/10.1016/j.eswa.2009.05.024
  24. Li, Z., Tian, Y., Li, K., Zhou, F., and Yang, W. (2017), "Reject inference in credit scoring using semi-supervised support vector machines", Expert Systems with Applications, 74, 105-114. https://doi.org/10.1016/j.eswa.2017.01.011
  25. Shi, J., and Xu, B., "Credit scoring by fuzzy support vector machines with a novel membership function", Journal of Risk and Financial Management, 9(4), 13-23. https://doi.org/10.3390/jrfm9040013