Browse > Article
http://dx.doi.org/10.9708/jksci.2014.19.9.125

Response Modeling with Semi-Supervised Support Vector Regression  

Kim, Dong-Il (System Engineering Team, Samsung Electronics, Co. Ltd.)
Abstract
In this paper, I propose a response modeling with a Semi-Supervised Support Vector Regression (SS-SVR) algorithm. In order to increase the accuracy and profit of response modeling, unlabeled data in the customer dataset are used with the labeled data during training. The proposed SS-SVR algorithm is designed to be a batch learning to reduce the training complexity. The label distributions of unlabeled data are estimated in order to consider the uncertainty of labeling. Then, multiple training data are generated from the unlabeled data and their estimated label distributions with oversampling to construct the training dataset with the labeled data. Finally, a data selection algorithm, Expected Margin based Pattern Selection (EMPS), is employed to reduce the training complexity. The experimental results conducted on a real-world marketing dataset showed that the proposed response modeling method trained efficiently, and improved the accuracy and the expected profit.
Keywords
Response Modeling; Semi-Supervised Learning; Support Vector Regression; Customer Relationship Management; Data Mining;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 M. Sun, Z.Y. Chen, and Z.P. Fan, "A Multi-task Multi-Kernel Transfer Learning Method for Customer Response Modeling in Social Media," Procedia Computer Science, Vol. 31, pp.221-230, Jun. 2014.   DOI   ScienceOn
2 H. Risselada, P.C. Verhoef, and T.H.A. Bijmolt, "Dynamic Effects of Social Influence and Direct Marketing on the Adoption of High-Technology Products," Journal of Marketing, Vol. 78, No. 2, pp.99-118, Mar. 2014.   DOI   ScienceOn
3 A. Blum, and T. Mitchell, "Combining Labeled and Unlabeled Data with Co-Training," In Proceedings of the Workshop on Computational Learning Theory, pp.92-100, New York, NY, USA, 1998 Jul.
4 T. Mitchell, "The Role of Unlabeled Data in Supervised Learning," In Proceedings of the 6th International Colloquium on Cognitive Science, San Sebastian, Spain, 1999 May.
5 Z.H. Zhou, and M. Li, "Semisupervised Regression with Cotraining-Style Algorithms," IEEE Transactions on Knowledge and Data Engineering, Vol. 19, No. 11, pp.1479-1493, Nov. 2007.   DOI   ScienceOn
6 X. Wang, L. Fu, and L. Ma, "Semi-Supervised Support Vector Regression Model for Remote Sensing Water Quality Retrieving," Chinese Geographical Science, Vol. 21, No. 1, pp.57-64, Feb. 2011.   DOI
7 D. Kim, P. Kang, and S. Cho, "Semi-Supervised Support Vector Regression Considering Labeling Uncertainty with Label Distribution Estimation and Oversampling," Neurocomputing, Submitted, Jun. 2014.
8 S.K. Lee, P. Kang, and S. Cho, "Probabilistic Local Reconstruction in k-NN Regression and Its Applications to Virtual Metrology in Semiconductor Manufacturing," Neurocomputing, Vol. 131, pp.427-439, May. 2014.   DOI   ScienceOn
9 D. de Ridder, and R. Duin, "Locally Linear Embedding for Classification," Technical Report PH-2002-01, Delft University of Technology, 2002.
10 E.C. Malthouse, "Performance-based Variable Selection for Scoring Models," Journal of Interactive Marketing, Vol. 16, No. 4, pp.37-50, Nov. 2002.   DOI   ScienceOn
11 D.L. Olson, and B. C, "Direct Marketing Decision Support Through Predictive Customer Response Modeling," Decision Support Systems, Vol. 51, No. 1, pp.443-451, Dec. 2012.
12 E.H. Suh, K.C. Noh, and C.K. Suh, "Customer List Segmentation using the Combined Response Model," Expert Systems with Applications, Vol. 17, No. 2, pp.89-97, Aug. 1999.   DOI   ScienceOn
13 H. Lee, and S. Cho, "Focusing on Non-Respondents: Response Modeling with Novelty Detectors," Expert Systems with Applications 33(2), pp.522-530, Feb. 2007.   DOI   ScienceOn
14 Y. Bentz, and D. Merunka, "Neural Networks and the Multinomial Logit for Brand Choice Moldeing: a Hybrid Approach," Journal of Forecasting, Vol. 19, No. 3, pp.177-200, Apr. 2000.   DOI
15 K. Ha, S. Cho, and D. MacLachlan, "Response Models based on Bagging Neural Networks," Journal of Interactive Marketing, Vol. 19, No. 1, pp.17-30, Feb. 2005.   DOI   ScienceOn
16 E. Yu, and S. Cho, "Constructing Response Model using Ensemble based on Feature Subset Selection," Expert Systems with Applications, Vol. 30, No. 2, pp.352-360, Feb. 2006.   DOI   ScienceOn
17 M. Daneshmandi, and M. Ahmadzadeh, "A Hybrid Data Mining Model to Improve Customer Response Modeling in Direct Marketing," Indian Journal of Computer Science and Engineering, Vol. 3, No. 6, pp.844-855, Dec. 2012.
18 A.N. Aliabadi, "Hybrid Model of Customer Response Modeling Through Combination of Neural Networks and Data Preprocessing," In Proceedings of the 2013 IEEE International Conference on Fuzzy Systems (FUZZ), pp.1-4, Hyderabad, India, 2013 Jul.
19 P. Kang, S. Cho, and D. MacLachlan, "Improved Response Modeling based on Clustering, Under-Sampling, and Ensemble," Expert Systems with Applications, Vol. 39, No. 8, pp.6738-6753, Jun. 2012.   DOI   ScienceOn
20 H. Lee, H. Shin, S. Hwang, S. Cho, and D. MacLachlan, "Semi-Supervised Response Modeling," Journal of Interactive Marketing, Vol. 24, No. 1, pp.42-54, Feb. 2010.   DOI   ScienceOn
21 R.C. Blatberg, B.D. Kim, and S.A. Neslin, "Database Marketing: Analyzing and Managing Customers," Springer, pp.245-287, 2008.
22 A. Smola, and B. Scholkopf, "A Tutorial on Support Vector Regression," NeuroCOLT Technical Report NC-TR-98-030, University of London, 2002.
23 K. Wang, S. Zhou, Q. Yang, and J.M.S. Yeung, "Mining Customer Value from Association Rules to Direct Marketing," Data Mining and Knowledge Discovery, Vol. 11, pp.57-79, Jul. 2005.   DOI   ScienceOn
24 D. Kim, H.J. Lee, and S. Cho, "Response Modeling with Support Vector Regression," Expert Systems with Applications, Vol. 34, No. 2, pp.1102-1108, Feb. 2008.   DOI   ScienceOn
25 D. Kim, and S. Cho, "Pattern Selection for Support Vector Regression based Response Modeling," Expert Systems with Applications, Vol. 39, No. 10, pp.8975-8985, Aug. 2012.   DOI   ScienceOn
26 V. Vapnik, "The Natural of Statistical Learning Theory," Springer, pp.549-557, 1995.
27 B. Choi, and K. Cho, "Comparison of HMM and SVM Schemes in Detecting Mobile Botnet," Journal of the Korea Society of Computer and Information, vol.19, no.4, pp.81-90, 2014 Apr.   과학기술학회마을   DOI   ScienceOn
28 K. Huh, and S. Kim, "Context-Aware Fusion with Support Vector Machine," Journal of the Korea Society of Computer and Information, vol.19, no.6, pp.19-26, 2014 Jun.   과학기술학회마을   DOI   ScienceOn
29 X. Zhu, "Semi-Supervised Learning Literature Survey," Technical Report 1350, University of Wisconsin at Madison, 2006.
30 E.C. Malthouse, "Ridge Regression and Direct Marketing Scoring Models," Journal of Interactive Marketing, Vol. 19, No. 4, pp.10-23, Nov. 1999.
31 D. Haughton, and S. Oulabi, "Direct Marketing Modeling with CART and CHAID," Journal of Direct Marketing, Vol. 11, No. 4, pp.42-52, Nov. 1997.   DOI
32 H. Shin, and S. Cho, "Response Modeling with Support Vector Machines," Expert Systems with Applications, Vol. 30, No. 4, pp.746-760, May. 2006.   DOI   ScienceOn
33 F.F. Gonul, B.D. Kim, and M. Shi, "Mailing Smarter to Catalog Customer," Journal of Interactive Marketing, Vol. 14, No. 2, pp.2-6, Apr. 2000.
34 H. Drucker, C.J.C. Burges, L. Kaufman, A. Smola, and V. Vapnik, "Support Vector Regression Machines," Advances in Neural Information Processing System, Vol. 9, pp.155-161, May. 1997.