Browse > Article
http://dx.doi.org/10.52255/smarttourism.2021.1.1.7

Will You Buy It Now?: Predicting Passengers that Purchase Premium Promotions Using the PAX Model  

Al Emadi, Noora (Qatar Computing Research Institute, Hamad Bin Khalifa University)
Thirumuruganathan, Saravanan (Qatar Computing Research Institute, Hamad Bin Khalifa University)
Robillos, Dianne Ramirez (School of Statistics, University of the Philippines)
Jansen, Bernard Jim (Qatar Computing Research Institute, Hamad Bin Khalifa University)
Publication Information
Journal of Smart Tourism / v.1, no.1, 2021 , pp. 53-64 More about this Journal
Abstract
Upselling is often a critical factor in revenue generation for businesses in the tourism and travel industry. Utilizing passenger data from a major international airline company, we develop the PAX (Passenger, Airline, eXternal) model to predict passengers that are most likely to accept an upgrade offer from economy to premium. Formulating the problem as an extremely unbalanced, cost-sensitive, supervised binary classification, we predict if a customer will take an upgrade offer. We use a feature vector created from the historical data of 3 million passenger records from 2017 to 2019, in which passengers received approximately 635,000 upgrade offers worth more than $422,000,000 U.S. dollars. The model has an F1-score of 0.75, outperforming the airline's current rule-based approach. Findings have several practical applications, including identifying promising customers for upselling and minimizing the number of indiscriminate emails sent to customers. Accurately identifying the few customers who will react positively to upgrade offers is of paramount importance given the airline 'industry's razor-thin margins. Research results have significant real-world impacts because there is the potential to improve targeted upselling to customers in the airline and related industries.
Keywords
upselling; price elasticity; aviation company data; prediction; airline travel;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29-36.   DOI
2 Hanrahan, B. V., Willamowski, J. K., Swaminathan, S., & Martin, D. B. (2015). TurkBench: Rendering the market for Turkers. Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 1613-1616.
3 Hara, K., Adams, A., Milland, K., Savage, S., Callison-Burch, C., & Bigham, J. P. (2018). A data-driven analysis of workers' earnings on Amazon Mechanical Turk. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-14.
4 Havlicek, L. L., & Peterson, N. L. (1976). Robustness of the Pearson correlation against violations of assumptions. Perceptual and Motor Skills, 43(3_suppl), 1319-1334.   DOI
5 Hlee, S., Lee, H., Koo, C., & Chung, N. (2020). Will the relevance of review language and destination attractions be helpful? A data-driven approach. Journal of Vacation Marketing, 27(1), 61-81.   DOI
6 Yang, S., Wang, H., Zhang, C., & Gao, Y. (2020). Contextual bandits with hidden features to online recommendation via sparse interactions. IEEE Intelligent Systems, 35(5), 62-72.   DOI
7 Zhu, G., Wu, Z., Wang, Y., Cao, S., & Cao, J. (2019). Online purchase decisions for tourism e-commerce. Electronic Commerce Research and Applications, 38, 100887.   DOI
8 Zins, A. H. (2001). Relative attitudes and commitment in customer loyalty models: Some experiences in the commercial airline industry. International Journal of Service Industry Management, 12(3), 269-294.   DOI
9 Wiesman, D. W. (2006). The effects of performance feedback and social reinforcement on up-selling at fast-food restaurants. Journal of Organizational Behavior Management, 26(4), 1-18.   DOI
10 Abdella, J. A., Zaki, N., Shuaib, K., & Khan, F. (2019). Airline ticket price and demand prediction: A survey. Journal of King Saud University - Computer and Information Sciences.
11 Abdollahi, M., Khaleghi, T., & Yang, K. (2020). An integrated feature learning approach using deep learning for travel time prediction. Expert Systems with Applications, 139, 112864.   DOI
12 Aboelmaged, M., & Mouakket, S. (2020). Influencing models and determinants in big data analytics research: A bibliometric analysis. Information Processing & Management, 57(4), 102234.   DOI
13 Hu, W. H., Lin, B. T., Lu, F. S., & Jeng, J. Y. (2016). An up-selling pricing model using SDP-based rating mechanism with dynamic weight. Proceedings of the The 3rd Multidisciplinary International Social Networks Conference on SocialInformatics 2016, Data Science 2016, 1-6.
14 An, M., & Noh, Y. (2009). Airline customer satisfaction and loyalty: Impact of in-flight service quality. Service Business, 3(3), 293-307.   DOI
15 Aydin, G., & Ziya, S. (2008). Pricing promotional products under upselling. Manufacturing & Service Operations Management, 10(3), 360-376.   DOI
16 Bodrunova, S. S. (2018). Internet science. Proceedings of the 5th International Conference, INSCI 2018, St. Petersburg, Russia, October 24-26, 2018. Springer.
17 Johnson, J. S., & Friend, S. B. (2015). Contingent cross-selling and upselling relationships with performance and job satisfaction: An MOAtheoretic examination. Journal of Personal Selling & Sales Management, 35(1), 51-71.   DOI
18 Kleinbaum, D. G., Dietz, K., Gail, M., Klein, M., & Klein, M. (2002). Logistic regression. Berlin: Springer.
19 Kubiak, B. F., & Weichbroth, P. (1970). Cross and up-selling techniques in e-commerce activities. The Journal of Internet Banking and Commerce, 15(3), 1-7.
20 Benesty, J., Chen, J., Huang, Y., & Cohen, I. (2009). Pearson correlation coefficient. In I. Cohen, Y. Huang, J. Chen, & J. Benesty (Eds.), Noise reduction in speech processing (pp. 1-4). Berlin: Springer.
21 Bryan, D. L., & O'Kelly, M. E. (1999). Hub-and-spoke networks in air transportation: An analytical review. Journal of Regional Science, 39(2), 275-295.   DOI
22 Borenstein, S., & Rose, N. L. (1994). Competition and price dispersion in the U.S. airline industry. Journal of Political Economy, 102(4), 653-683.   DOI
23 Bradley, A. P. (1997). The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7), 1145-1159.   DOI
24 Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32.   DOI
25 Caruana, R., Lou, Y., Gehrke, J., Koch, P., Sturm, M., & Elhadad, N. (2015). Intelligible models for HealthCare: Predicting pneumonia risk and hospital 30-day readmission. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1721-1730.
26 Castillo, J. C., Knoepfle, D., & Weyl, G. (2017). Surge pricing solves the wild goose chase. Proceedings of the 2017 ACM Conference on Economics and Computation, 241-242.
27 Chiang, C. T., Yang, M. H., Koo, T. L., & Liao, C. H. (2020). What drives customer engagement behavior? The impact of user participation from a sociotechnical perspective. Journal of Electronic Commerce Research, 21(3), 197-214.
28 Lengerich, B., Tan, S., Chang, C. H., Hooker, G., & Caruana, R. (2020). Purifying interaction effects with the functional ANOVA: An efficient algorithm for recovering identifiable additive models. Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020, Palermo, Italy, 2402-2412.
29 Lou, Y., Caruana, R., & Gehrke, J. (2012). Intelligible models for classification and regression. Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 150-158.
30 Dai, B. T. (2014). How can consumer preferences be leveraged for targeted upselling in cable T.V. services? Pacific Telecommunications Council (PTC'14). Singapore: Research Collection School of Information Systems.
31 Clemes, M. D., & Choong, M. (2008). An empirical analysis of customer satisfaction in international air travel. Innovative Marketing, 4(2), 15.
32 Cui, Y., Orhun, A. Y., & Duenyas, I. (2018). How price dispersion changes when upgrades are introduced: Theory and empirical evidence from the airline industry. Management Science, 65(8), 3835-3852.   DOI
33 Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, & R. Garnett (Eds.), Advances in neural information processing systems 30 (pp. 4765-4774). Red Hook, NY: Curran Associates, Inc.
34 Chen, Y. L., Yeh, Y. H., & Ma, M. R. (2021). A movie recommendation method based on users' positive and negative profiles. Information Processing & Management, 58(3), 102531.   DOI
35 Ma, Y., Mao, J., Ba, Z., & Li, G. (2020). Location recommendation by combining geographical, categorical, and social preferences with location popularity. Information Processing & Management, 57(4), 102251.   DOI
36 Lou, Y., Caruana, R., Gehrke, J., & Hooker, G. (2013). Accurate intelligible models with pairwise interactions. Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 623-631.
37 Luttmann, A. (2019). Evidence of directional price discrimination in the U.S. airline industry. International Journal of Industrial Organization, 62, 291-329.   DOI
38 Ma, N. F., Yuan, C. W., Ghafurian, M., & Hanrahan, B. V. (2018). Using stakeholder theory to examine drivers' stake in Uber. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems - CHI '18, 1-12.
39 Manchanayake, S. M. A. M., Samarasinghe, D. P., Perera, L. P. J., Bandara, H. M. M. T., Kumaradasa, K. C., Premadasa, N., & Samarasinghe, A. P. (2019). Potential upselling customer prediction through user behavior analysis based on CDR data. 2019 14th Conference on Industrial and Information Systems (ICIIS), 46-51.
40 Murphy, K. P. (2012). Machine learning: A probabilistic perspective. Cambridge, MA: MIT Press.
41 Nadeau, D., & Turney, P. D. (2005). A supervised learning approach to acronym identification. In B. Kegl & G. Lapalme (Eds.), Advances in artificial intelligence (pp. 319-329). Berlin: Springer.
42 Hussain, R., Al Nasser, A., & Hussain, Y. K. (2015). Service quality and customer satisfaction of a UAE-based airline: An empirical investigation. Journal of Air Transport Management, 42, 167-175.   DOI
43 Greenstein-Messica, A., & Rokach, L. (2020). Machine learning and operation research-based method for promotion optimization of products with no price elasticity history. Electronic Commerce Research and Applications, 40, 100914.   DOI
44 Steinberg, D. (2009). CART: Classification and regression trees. In The top ten algorithms in data mining (pp. 193-216). Boca Raton, FL: Chapman and Hall/CRC.
45 Hamari, J., Hanner, N., & Koivisto, J. (2020). "Why pay premium in freemium services?" A study on perceived value, continued use and purchase intentions in free-to-play games. International Journal of Information Management, 51, 102040.   DOI
46 Southern, C., Cheng, Y., Zhang, C., & Abowd, G. D. (2017). Understanding the cost of driving trips. Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 430-434.
47 Opitz, D., & Maclin, R. (1999). Popular ensemble methods: An empirical study. Journal of Artificial Intelligence Research, 11, 169-198.   DOI
48 Svangren, M. K., Skov, M. B., & Kjeldskov, J. (2018). Passenger trip planning using ride-sharing services. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-12.
49 Song, W. K., & Lee, H. C. (2020). An analysis of traveler need for and willingness to purchase airline dynamic packaging: A Korean case study. Journal of Air Transport Management, 82, 101735.   DOI
50 Squires, J., Wilder, D. A., Fixsen, A., Hess, E., Rost, K., Curran, R., & Zonneveld, K. (2007). The effects of task clarification, visual prompts, and graphic feedback on customer greeting and up-selling in a restaurant. Journal of Organizational Behavior Management, 27(3), 1-13.   DOI
51 Stavins, J. (2001). Price discrimination in the airline market: The effect of market concentration. Review of Economics and Statistics, 83(1), 200-202.   DOI
52 Steffen, A., Weibel, C., Stampfli, A. E., & von Arx, W. (2020). Upselling by default: The effect of default options on travelers' board and lodging choices. Journal of Travel Research, 59(7), 1253-1267.   DOI
53 Stein, G., Chen, B., Wu, A. S., & Hua, K. A. (2005). Decision tree classifier for network intrusion detection with GA-based feature selection. Proceedings of the 43rd Annual Southeast Regional Conference - Volume 2, 136-141.
54 Eti, S., & Mizrak, F. (2020). Analysing customer satisfaction of civil aviation companies of Turkey with text mining. In H. Dincer & S. Yuksel (Eds.), Strategic outlook for innovative work behaviours: Interdisciplinary and multidimensional perspectives (pp. 21-41). New York: Springer International Publishing.
55 Tan, S., Caruana, R., Hooker, G., & Lou, Y. (2018). Distill-and-compare: Auditing black-box models using transparent model distillation. Proceedings of the 2018 AAAI/ACM Conference on A.I., Ethics, and Society, 303-310.
56 Denizci Guillet, B. (2020). Online upselling: Moving beyond offline upselling in the hotel industry. International Journal of Hospitality Management, 84, 102322.   DOI
57 Duda, R. O., Hart, P. E., & Stork, D. G. (2012). Pattern classification. Hoboken: John Wiley & Sons.
58 Renjith, S., Sreekumar, A., & Jathavedan, M. (2020). An extensive study on the evolution of context-aware personalized travel recommender systems. Information Processing & Management, 57(1), 102078.   DOI
59 Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189-1232.   DOI
60 Huang, C., Li, Y., Loy, C. C., & Tang, X. (2016). Learning deep representation for imbalanced classification. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5375-5384.
61 Van Buuren, S. (2018). Flexible imputation of missing data. Boca Raton, FL: CRC Press.
62 Fu, H., Manogaran, G., Wu, K., Cao, M., Jiang, S., & Yang, A. (2020). Intelligent decision-making of online shopping behavior based on internet of things. International Journal of Information Management, 50, 515-525.   DOI
63 Gupta, N. (2018). Influence of demographic variables: Customers' perception about cross-selling and up-selling for eBanking. International Journal of Electronic Customer Relationship Management, 11(2), 126-141.   DOI
64 Ham, J., Koo, C., & Chung, N. (2020). Configurational patterns of competitive advantage factors for smart tourism: An equifinality perspective. Current Issues in Tourism, 23(9), 1066-1072.   DOI
65 Mayer, V. F., Santos, G. E. de O., & Marques, O. R. B. (2020). Option framing for upselling tourism services: Does cognitive availability prevent irrational choices? Tourism Economics.
66 Cheng, W., & Hullermeier, E. (2009). Combining instance-based learning and logistic regression for multilabel classification. Machine Learning, 76(2), 211-225.   DOI
67 Oh, H., Parks, S. C., & Demicco, F. J. (2002). Age- and gender-based market segmentation. International Journal of Hospitality & Tourism Administration, 3(1), 1-20.   DOI
68 Nicolini, G., & Salini, S. (2006). Customer satisfaction in the airline industry: The case of British Airways. Quality and Reliability Engineering International, 22(5), 581-589.   DOI
69 Nori, H., Jenkins, S., Koch, P., & Caruana, R. (n.d.). InterpretML: A unified framework for machine learning interpretability. Ithaca, NY: Cornell University.
70 Norvell, T., Kumar, P., & Contractor, S. (2018). Assessing the customerbased impact of up-selling versus down-selling. Cornell Hospitality Quarterly, 59(3), 215-227.   DOI
71 Ostrowski, P. L., O'Brien, T. V., & Gordon, G. L. (1993). Service quality and customer loyalty in the commercial airline industry. Journal of Travel Research, 32(2), 16-24.   DOI
72 Potdar, K., S., Pardawala, T. S., & Pai, C. D. (2017). A comparative study of categorical variable encoding techniques for neural network classifiers. International Journal of Computer Applications, 175(4), 7-9.   DOI
73 Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why Should I Trust You?": Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135-1144.
74 Royston, P. (2004). Multiple imputation of missing values. The Stata Journal, 4(3), 227-241.   DOI
75 Safavian, S. R., & Landgrebe, D. (1991). A survey of decision tree classifier methodology. IEEE Transactions on Systems, Man, and Cybernetics, 21(3), 660-674.   DOI
76 Sarker, I. H., & Kayes, A. S. M. (2020). ABC-RuleMiner: User behavioral rule-based machine learning method for context-aware intelligent services. Journal of Network and Computer Applications, 168, 102762.   DOI