Browse > Article
http://dx.doi.org/10.5859/KAIS.2017.26.4.17

An Application of Support Vector Machines to Customer Loyalty Classification of Korean Retailing Company Using R Language  

Nguyen, Phu-Thien (International Business Cooperative Course, Graduate School of Dongguk University)
Lee, Young-Chan (Dept. of Business Administration, Dongguk University)
Publication Information
The Journal of Information Systems / v.26, no.4, 2017 , pp. 17-37 More about this Journal
Abstract
Purpose Customer Loyalty is the most important factor of customer relationship management (CRM). Especially in retailing industry, where customers have many options of where to spend their money. Classifying loyal customers through customers' data can help retailing companies build more efficient marketing strategies and gain competitive advantages. This study aims to construct classification models of distinguishing the loyal customers within a Korean retailing company using data mining techniques with R language. Design/methodology/approach In order to classify retailing customers, we used combination of support vector machines (SVMs) and other classification algorithms of machine learning (ML) with the support of recursive feature elimination (RFE). In particular, we first clean the dataset to remove outlier and impute the missing value. Then we used a RFE framework for electing most significant predictors. Finally, we construct models with classification algorithms, tune the best parameters and compare the performances among them. Findings The results reveal that ML classification techniques can work well with CRM data in Korean retailing industry. Moreover, customer loyalty is impacted by not only unique factor such as net promoter score but also other purchase habits such as expensive goods preferring or multi-branch visiting and so on. We also prove that with retailing customer's dataset the model constructed by SVMs algorithm has given better performance than others. We expect that the models in this study can be used by other retailing companies to classify their customers, then they can focus on giving services to these potential vip group. We also hope that the results of this ML algorithm using R language could be useful to other researchers for selecting appropriate ML algorithms.
Keywords
Support Vector Machines, SVMs; Customer Relationship Management, CRM; Recursive Feature Elimination, RFE; Random Forest, RF; R Language; Loyalty; Korean Retailing; Customer Classification;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Felzenszwalb, P. F., Girshick, R. B., McAllester, D., and Ramanan, D., "Object detection with discriminatively trained part-based models," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, 2010, pp. 1627-1645.   DOI
2 Fisher, R. A., "The use of multiple measurements in taxonomic problems," Annals of Human Genetics, Vol. 7, No. 2, 1936, pp. 179-188.
3 Goodhue, D. L., Wixom, B. H., and Watson, H. J., "Realizing business benefits through CRM: hitting the right target in the right way." MIS Quarterly Executive, Vol. 1, No. 2, 2002, pp. 79-94.
4 Granitto, P. M., Furlanello, C., Biasioli, F., and Gasperi, F., "Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products," Chemometrics and Intelligent Laboratory Systems, Vol. 83, No. 2, 2006, pp. 83-90.   DOI
5 Gremler, D. D., and Brown, S. W., "Service loyalty: its nature, importance, and implications," Advancing Service Quality: A Global Perspective, Vol. 5, 1996, pp. 171-181.
6 Guyon, I., and Elisseeff, A., "An introduction to variable and feature selection," Journal of Machine Learning Research, Vol. 3, 2003, pp. 1157-1182.
7 He, Z., Xu, X., Huang, J. Z., and Deng, S., "Mining class outliers: concepts, algorithms and applications in CRM," Expert Systems with Applications, Vol. 27, No. 4, 2004, pp. 681-697.   DOI
8 Ho, T. K., "Random decision forests," Proceedings of the Third International Conference on, 1995, pp. 278-282.
9 Hosseini, S. M. S., Maleki, A., and Gholamian, M. R., "Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty," Expert Systems with Applications, Vol. 37, No. 7, 2010, pp. 5259-5264.   DOI
10 Hsu, C. W., Chang, C. C., and Lin, C. J., "A practical guide to support vector classification," 2003.
11 Deakin, E. B., "A discriminant analysis of predictors of business failure," Journal of Accounting Research, Vol. 10, 1972, pp. 167-179.   DOI
12 Breiman, L., "Random forests," Machine Learning, Vol. 45, No. 1, 2001, pp. 5-32.   DOI
13 Coussement, K., and Van den Poel, D., "Churn prediction in subscription services: An application of support vector machines while comparing two parameterselection techniques," Expert Systems with Applications, Vol. 34, No. 1, 2008, pp. 313-327.   DOI
14 Cui, D., and Curry, D., "Prediction in marketing using the support vector machine," Marketing Science, Vol. 24, No. 4, 2005, pp. 595-615.   DOI
15 Delen, D., "A comparative analysis of machine learning techniques for student retention management," Decision Support Systems, Vol. 49, No. 4, 2010, pp. 498-506.   DOI
16 Dudyala, A. K. and Ravi, V., "Predicting credit card customer churn in banks using data mining," International Journal of Data Analysis Techniques and Strategies, Vol. 1, No. 1, 2008, pp 4-28.   DOI
17 Farquad, M. A. H., Ravi, V., and Raju, S. B., "Data mining using rules extracted from SVM: an application to churn prediction in bank credit cards," Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, Vol. 5908, 2009, pp. 390-397.
18 Bensic, M., Sarlija, N., and Zekic-Susac, M., "Modelling small-business credit scoring by using logistic regression, neural networks and decision trees," Intelligent Systems in Accounting, Finance and Management, Vol. 13, No. 3, 2005, pp. 133-150.   DOI
19 Altman, E. I., "Financial ratios, discriminant analysis and the prediction of corporate bankruptcy," The Journal of Finance, Vol. 23, No. 4, 1968, pp. 589-609.   DOI
20 Ball, D., Coelho, P. S., and Machas, A., "The role of communication and trust in explaining customer loyalty: An extension to the ECSI model," European Journal of Marketing, Vol. 38, No. 9/10, 2004, pp. 1272-1293.   DOI
21 Blum, A., and Mitchell, T., "Combining labeled and unlabeled data with co-training," Proceedings of the Eleventh Annual Conference on Computational Learning Theory, ACM, 1998.
22 Breiman, L., "Bagging predictors," Machine Learning, Vol. 24, No. 2, 1996, pp. 123-140.   DOI
23 Johannes, M., Brase, J. C., Frohlich, H., Gade, S., Gehrmann, M., Falth, M., Sultmann, H., and BeiBbarth, T., "Integration of pathway knowledge into a reweighted recursive feature elimination approach for risk stratification of cancer patients," Bioinformatics, Vol. 26, No. 17, 2010, pp. 2136-2144.   DOI
24 Hu, C., Wang, J., Zheng, C., Xu, S., Zhang, H., Liang, Y., Bi, L., Fan, Z., Han, B., and Xu, W., "Raman spectra exploring breast tissues: Comparison of principal component analysis and support vector machine recursive feature elimination," Medical Physics, Vol. 40, No. 6, 2013, pp. 063501.   DOI
25 Hung, S. Y., Yen, D. C., and Wang, H. Y., "Applying data mining to telecom churn management," Expert Systems with Applications, Vol. 31, No. 3, 2006, pp. 515-524.   DOI
26 Joachims, T., "Text categorization with support vector machines: Learning with many relevant features," Machine Learning: ECML-98, 1998, pp. 137-142.
27 Keiningham, T. L., Cooil, B., Aksoy, L., Andreassen, T. W., and Weiner, J., "The value of different customer satisfaction and loyalty metrics in predicting customer retention, recommendation, and share-of-wallet," Managing Service Quality: An International Journal, Vol. 17, No. 4, 2007, pp. 361-384.   DOI
28 Kumar, V., Customer Relationship Management. John Wiley & Sons, Ltd, 2010.
29 Kim, S. A., Kim, J. W., Won, D. Y., and Choi, Y. R., "A Halal Food Classification Framework Using Machine Learning Method for Enhancing Muslim Tourists," The Journal of Information Systems, Vol. 26, No. 3, 2017, pp. 273-293.
30 Kotler, P., and Armstrong, G., Principles of Marketing. Pearson education, 2010.
31 Lee, M. H., "Loyalty of On-line Stock Trading Customers," The Journal of Information Systems, Vol. 14, No. 2, 2005, pp. 155-172.   DOI
32 Louw, N., and Steel, S. J., "Variable selection in kernel Fisher discriminant analysis by means of recursive feature elimination," Computational Statistics & Data Analysis, Vol. 51, No. 3, 2006, pp. 2043-2055.   DOI
33 Leslie, C., Eskin, E., and Noble, W. S., "The spectrum kernel: A string kernel for SVM protein classification," Pacific Symposium on Biocomputing, Vol. 7, 2002, pp. 566-575
34 Li, H., and Sun, J., "Empirical research of hybridizing principal component analysis with multivariate discriminant analysis and logistic regression for business failure prediction," Expert Systems with Applications, Vol. 38, No. 5, 2011, pp. 6244-6253.   DOI
35 Lin, H. H., and Wang, Y. S., "An examination of the determinants of customer loyalty in mobile commerce contexts," Information & Management, Vol. 43, No. 3, 2006, pp. 271-282.   DOI
36 Shmueli, G., Patel, N. R., and Bruce, P. C., Data Mining for Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. John Wiley & Sons, 2008.
37 Michel, P., and El Kaliouby, R., "Real time facial expression recognition in video using support vector machines," Proceedings of the 5th International Conference on Multimodal Interfaces. ACM, 2003, pp. 258-264
38 Min, J. H., and Lee, Y. C., "Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters," Expert Systems with Applications, Vol. 28, No. 4, 2005, pp. 603-614.   DOI
39 Samuel, A. L. "Some studies in machine learning using the game of checkers," IBM Journal of Research and Development, Vol. 3, No. 3, 1959, pp. 210-229.   DOI
40 So, S. H., Ryu, I., Cho, G., and Park, Y. S., "Structural Relationships of Logistics Service Quality, Relationship Orientation, Customer Satisfaction and Customer Loyalty in Electronic Commerce," The Journal of Information Systems, Vol. 16, No. 4, 2007, pp. 107-129.
41 Zaki, M., Kandeil, D., Neely, A., and McColl-Kennedy, J. R., The Fallacy of the Net Promoter Score: Customer Loyalty Predictive Model, Cambridge Service Alliance, University of Cambridge, 2016.
42 Song, F., Mei, D., and Li, H., "Feature selection based on linear discriminant analysis," Intelligent System Design and Engineering Application (ISDEA), 2010 International Conference on, Vol. 1, 2010, pp. 746-749.
43 Stuhlsatz, A., Lippel, J., and Zielke, T., "Feature extraction with deep neural networks by a generalized discriminant analysis," IEEE transactions on neural networks and learning systems, Vol. 23, No. 4, 2012, pp. 596-608.   DOI
44 Teo, T. S., Devadoss, P., and Pan, S. L., "Towards a holistic perspective of customer relationship management (CRM) implementation: A case study of the Housing and Development Board, Singapore," Decision Support Systems, Vol. 42, No. 3, 2006, pp. 1613-1627.   DOI
45 Tu, J. V., "Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes," Journal of Clinical Epidemiology, Vol. 49, No. 11, 1996, pp. 1225-1231.   DOI
46 Wood, E. H., "The internal predictors of business performance in small firms: A logistic regression analysis," Journal of Small Business and Enterprise Development, Vol. 13, No. 3, 2006, pp. 441-453.   DOI
47 Zhao, W., Chellappa, R., and Krishnaswamy, A., "Discriminant analysis of principal components for face recognition," Automatic Face and Gesture Recognition, 1998. Proceedings. Third IEEE International Conference on, IEEE, 1998.