Browse > Article
http://dx.doi.org/10.13088/jiis.2018.24.2.221

Customer Behavior Prediction of Binary Classification Model Using Unstructured Information and Convolution Neural Network: The Case of Online Storefront  

Kim, Seungsoo (Dept. of Business Administration, Graduate School, Hanyang University)
Kim, Jongwoo (School of Business, Hanyang University)
Publication Information
Journal of Intelligence and Information Systems / v.24, no.2, 2018 , pp. 221-241 More about this Journal
Abstract
Deep learning is getting attention recently. The deep learning technique which had been applied in competitions of the International Conference on Image Recognition Technology(ILSVR) and AlphaGo is Convolution Neural Network(CNN). CNN is characterized in that the input image is divided into small sections to recognize the partial features and combine them to recognize as a whole. Deep learning technologies are expected to bring a lot of changes in our lives, but until now, its applications have been limited to image recognition and natural language processing. The use of deep learning techniques for business problems is still an early research stage. If their performance is proved, they can be applied to traditional business problems such as future marketing response prediction, fraud transaction detection, bankruptcy prediction, and so on. So, it is a very meaningful experiment to diagnose the possibility of solving business problems using deep learning technologies based on the case of online shopping companies which have big data, are relatively easy to identify customer behavior and has high utilization values. Especially, in online shopping companies, the competition environment is rapidly changing and becoming more intense. Therefore, analysis of customer behavior for maximizing profit is becoming more and more important for online shopping companies. In this study, we propose 'CNN model of Heterogeneous Information Integration' using CNN as a way to improve the predictive power of customer behavior in online shopping enterprises. In order to propose a model that optimizes the performance, which is a model that learns from the convolution neural network of the multi-layer perceptron structure by combining structured and unstructured information, this model uses 'heterogeneous information integration', 'unstructured information vector conversion', 'multi-layer perceptron design', and evaluate the performance of each architecture, and confirm the proposed model based on the results. In addition, the target variables for predicting customer behavior are defined as six binary classification problems: re-purchaser, churn, frequent shopper, frequent refund shopper, high amount shopper, high discount shopper. In order to verify the usefulness of the proposed model, we conducted experiments using actual data of domestic specific online shopping company. This experiment uses actual transactions, customers, and VOC data of specific online shopping company in Korea. Data extraction criteria are defined for 47,947 customers who registered at least one VOC in January 2011 (1 month). The customer profiles of these customers, as well as a total of 19 months of trading data from September 2010 to March 2012, and VOCs posted for a month are used. The experiment of this study is divided into two stages. In the first step, we evaluate three architectures that affect the performance of the proposed model and select optimal parameters. We evaluate the performance with the proposed model. Experimental results show that the proposed model, which combines both structured and unstructured information, is superior compared to NBC(Naïve Bayes classification), SVM(Support vector machine), and ANN(Artificial neural network). Therefore, it is significant that the use of unstructured information contributes to predict customer behavior, and that CNN can be applied to solve business problems as well as image recognition and natural language processing problems. It can be confirmed through experiments that CNN is more effective in understanding and interpreting the meaning of context in text VOC data. And it is significant that the empirical research based on the actual data of the e-commerce company can extract very meaningful information from the VOC data written in the text format directly by the customer in the prediction of the customer behavior. Finally, through various experiments, it is possible to say that the proposed model provides useful information for the future research related to the parameter selection and its performance.
Keywords
Customer Behavior Prediction; Deep Learning; Convolution Neural Network(CNN); Voice of Customer(VOC);
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Ahn, S., "Deep learning architectures and applications," Journal of Intelligence and Information Systems, 22(2), (2016), 127-142.   DOI
2 Chu, H., S. Ahn, and S. Kim, "AlphaGo's artificial intelligence algorithm analysis", Software Policy & Research Institute, (2016).
3 Coussement, K., D. Van den Poel, "Integrating the voice of customers through call center emails into a decision support system for churn prediction," Information & Management, 45(3), (2008), 164-174.   DOI
4 Gridach, M., H. Haddad, and H. Mulki, "Churn identification in microblogs using convolutional neural networks with structured logical knowledge," Paper presented at the Proceedings of the 3rd Workshop on Noisy User-Generated Text, (2017), 21-30.
5 Kim, K., B. Lee, and J. Kim, "Feasibility of Deep Learning Algorithms for Binary Classification Problems," Journal of Intelligence and Information Systems, 23(1), (2017), 95-108.   DOI
6 Kim, S., J. Song, and K. Lee, "A Study of customer churn by analysing CRM customer data," Asia Marketing Journal, 7(1), (2005), 21-42.
7 Kim, Y., "Convolutional neural networks for sentence classification," In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), (2014), 1746-1751
8 Krizhevsky, A., I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Paper presented at the Advances in Neural Information Processing Systems, (2012), 1097-1105.
9 Le, Q., and T. Mikolov, "Distributed representations of sentences and documents," Paper presented at the International Conference on Machine Learning, (2014), 1188-1196.
10 LeCun, Y., Y. Bengio, and G. Hinton, "Deep learning," Nature, 521(7553), (2015), 436-444.   DOI
11 Lee, J., J. Kim, "Integrated use of classification and association rule for real-time CRM: Application of predicting credit card customer churn," KMIS International Conference, (2007), 135-140.
12 Mikolov, T., I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, "Distributed representations of words and phrases and their compositionality," Paper presented at the Advances in Neural Information Processing Systems, (2013), 3111-3119.
13 Yigit, I. O., A. F. Ates, M. Guvercin, H. Ferhatosmanoglu, and B. Gedik, "Call center text mining approach," Paper presented at the Signal Processing and Communications Applications Conference (SIU), 2017 25th, (2017), 1-4.
14 Yu, E., J. Kim, C. Lee, and N. Kim, "Using ontologies for semantic text mining," Journal of Information Systems, 21(3), (2012), 137-161.   DOI
15 Zhang, X., J. Zhao, and Y. LeCun, "Character-level convolutional networks for text classification," Paper presented at the Advances in Neural Information Processing Systems, (2015), 649-657.
16 Schmidhuber, J. "Deep learning in neural networks: An overview," Neural Networks, 61, (2015), 85-117.   DOI