Browse > Article
http://dx.doi.org/10.3745/JIPS.04.0119

Classification of Imbalanced Data Based on MTS-CBPSO Method: A Case Study of Financial Distress Prediction  

Gu, Yuping (School of Management Science and Engineering, Anhui University of Finance and Economics)
Cheng, Longsheng (School of Economics and Management, Nanjing University of Science and Technology)
Chang, Zhipeng (School of Business, Anhui University of Technology)
Publication Information
Journal of Information Processing Systems / v.15, no.3, 2019 , pp. 682-693 More about this Journal
Abstract
The traditional classification methods mostly assume that the data for class distribution is balanced, while imbalanced data is widely found in the real world. So it is important to solve the problem of classification with imbalanced data. In Mahalanobis-Taguchi system (MTS) algorithm, data classification model is constructed with the reference space and measurement reference scale which is come from a single normal group, and thus it is suitable to handle the imbalanced data problem. In this paper, an improved method of MTS-CBPSO is constructed by introducing the chaotic mapping and binary particle swarm optimization algorithm instead of orthogonal array and signal-to-noise ratio (SNR) to select the valid variables, in which G-means, F-measure, dimensionality reduction are regarded as the classification optimization target. This proposed method is also applied to the financial distress prediction of Chinese listed companies. Compared with the traditional MTS and the common classification methods such as SVM, C4.5, k-NN, it is showed that the MTS-CBPSO method has better result of prediction accuracy and dimensionality reduction.
Keywords
Chaotic Binary Particle Swarm Optimization (CBPSO); Financial Distress Prediction; Mahalanobis-Taguchi System (MTS); Variable Selection;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 R. Geng, I. Bose, and X. Chen, "Prediction of financial distress: an empirical study of listed Chinese companies using data mining," European Journal of Operational Research, vol. 241, no. 1, pp. 236-247, 2015.   DOI
2 S. Maldonado and J. Lopez, "Imbalanced data classification using second-order cone programming support vector machines," Pattern Recognition, vol. 47, no. 5, pp. 2070-2079, 2014.   DOI
3 G. Taguchi and R. Jugulum, The Mahalanobis-Taguchi strategy: A Pattern Technology System. New York, NY: John Wiley & Sons, 2002.
4 P. Shakya, M. S. Kulkarni, and A. K. Darpe, "Bearing diagnosis based on Mahalanobis-Taguchi-Gram-Schmidt method," Journal of Sound and Vibration, vol. 337, pp. 342-362, 2015.   DOI
5 B. John, "Application of Mahalanobis-Taguchi system and design of experiments to reduce the field failures of splined shafts," International Journal of Quality & Reliability Management, vol. 31, no. 6, pp. 681-697, 2014.   DOI
6 X. Jin and T. W. Chow, "Anomaly detection of cooling fan and fault classification of induction motor using Mahalanobis-Taguchi system," Expert Systems with Applications, vol. 40, no. 15, pp. 5787-5795, 2013.   DOI
7 B. Valarmathi and V. Palanisamy, "Opinion mining of customer reviews using Mahalanobis-Taguchi system," European Journal of Scientific Research, vol. 62, no. 1, pp. 95-100, 2011.
8 S. E. Abbasi, A. Aaghaie, and M. Fazlali, "Applying Mahalanobis-Tagouchi system in detection of high risk customers: a case-based study in an insurance company," Journal of Industrial Engineering, vol. 45, no. 2, pp. 1-12, 2011.
9 C. L. Huang, Y. H. Chen, and T. L. J. Wan, "The Mahalanobis Taguchi system: adaptive resonance theory neural network algorithm for dynamic product designs," Journal of Information and Optimization Sciences, vol. 33, no. 6, pp. 623-635, 2012.   DOI
10 W. H. Woodall, R. Koudelik, K. L. Tsui, S. B. Kim, Z. G. Stoumbos, and C. P. Carvounis, "A review and analysis of the Mahalanobis-Taguchi system," Technometrics, vol. 45, no. 1, pp. 1-15, 2003.   DOI
11 R. Jugulum, G. Taguchi, S. Taguchi, and J. O. Wilkins, "Discussion: a review and analysis of the Mahalanobis-Taguchi system," Technometrics, vol. 45, no. 1, pp. 16-21, 2003.   DOI
12 M. Sokolova and G. Lapalme, "A systematic analysis of performance measures for classification tasks," Information Processing & Management, vol. 45, no. 4, pp. 427-437, 2009.   DOI
13 R. Eberhart and J. Kennedy, "A new optimizer using particle swarm theory," in Proceedings of the 6th International Symposium on Micro Machine and Human Science, Nagoya, Japan, 1995, pp. 39-43.
14 J. Kennedy and R. C. Eberhart, "A discrete binary version of the particle swarm algorithm," in Proceedings of 1997 IEEE International Conference on Systems, Man, and Cybernetics: Computational Cybernetics and Simulation, Orlando, FL, 1997, pp. 4104-4108.
15 X. Xu and Z. Xiao, "Soft set theory oriented forecast combination method for business failure prediction," Journal of Information Processing Systems, vol. 12, no. 1, pp. 109-128, 2016.   DOI
16 L. Y. Chuang, C. H. Yang, and J. C. Li, "Chaotic maps based on binary particle swarm optimization for feature selection," Applied Soft Computing, vol. 11, no. 1, pp. 239-248, 2011.   DOI