DOI QR코드

DOI QR Code

Will You Buy It Now?: Predicting Passengers that Purchase Premium Promotions Using the PAX Model

  • Received : 2021.01.12
  • Accepted : 2021.03.10
  • Published : 2021.03.31

Abstract

Upselling is often a critical factor in revenue generation for businesses in the tourism and travel industry. Utilizing passenger data from a major international airline company, we develop the PAX (Passenger, Airline, eXternal) model to predict passengers that are most likely to accept an upgrade offer from economy to premium. Formulating the problem as an extremely unbalanced, cost-sensitive, supervised binary classification, we predict if a customer will take an upgrade offer. We use a feature vector created from the historical data of 3 million passenger records from 2017 to 2019, in which passengers received approximately 635,000 upgrade offers worth more than $422,000,000 U.S. dollars. The model has an F1-score of 0.75, outperforming the airline's current rule-based approach. Findings have several practical applications, including identifying promising customers for upselling and minimizing the number of indiscriminate emails sent to customers. Accurately identifying the few customers who will react positively to upgrade offers is of paramount importance given the airline 'industry's razor-thin margins. Research results have significant real-world impacts because there is the potential to improve targeted upselling to customers in the airline and related industries.

Keywords

Acknowledgement

This publication was made possible by GSRA grant, I.D. # GSRA5-2-0602-18131, from the Qatar National Research Fund (a member of Qatar Foundation).

References

  1. Abdella, J. A., Zaki, N., Shuaib, K., & Khan, F. (2019). Airline ticket price and demand prediction: A survey. Journal of King Saud University - Computer and Information Sciences.
  2. Abdollahi, M., Khaleghi, T., & Yang, K. (2020). An integrated feature learning approach using deep learning for travel time prediction. Expert Systems with Applications, 139, 112864. https://doi.org/10.1016/j.eswa.2019.112864
  3. Aboelmaged, M., & Mouakket, S. (2020). Influencing models and determinants in big data analytics research: A bibliometric analysis. Information Processing & Management, 57(4), 102234. https://doi.org/10.1016/j.ipm.2020.102234
  4. An, M., & Noh, Y. (2009). Airline customer satisfaction and loyalty: Impact of in-flight service quality. Service Business, 3(3), 293-307. https://doi.org/10.1007/s11628-009-0068-4
  5. Aydin, G., & Ziya, S. (2008). Pricing promotional products under upselling. Manufacturing & Service Operations Management, 10(3), 360-376. https://doi.org/10.1287/msom.1070.0187
  6. Benesty, J., Chen, J., Huang, Y., & Cohen, I. (2009). Pearson correlation coefficient. In I. Cohen, Y. Huang, J. Chen, & J. Benesty (Eds.), Noise reduction in speech processing (pp. 1-4). Berlin: Springer.
  7. Bodrunova, S. S. (2018). Internet science. Proceedings of the 5th International Conference, INSCI 2018, St. Petersburg, Russia, October 24-26, 2018. Springer.
  8. Borenstein, S., & Rose, N. L. (1994). Competition and price dispersion in the U.S. airline industry. Journal of Political Economy, 102(4), 653-683. https://doi.org/10.1086/261950
  9. Bradley, A. P. (1997). The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition, 30(7), 1145-1159. https://doi.org/10.1016/S0031-3203(96)00142-2
  10. Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324
  11. Bryan, D. L., & O'Kelly, M. E. (1999). Hub-and-spoke networks in air transportation: An analytical review. Journal of Regional Science, 39(2), 275-295. https://doi.org/10.1111/1467-9787.00134
  12. Caruana, R., Lou, Y., Gehrke, J., Koch, P., Sturm, M., & Elhadad, N. (2015). Intelligible models for HealthCare: Predicting pneumonia risk and hospital 30-day readmission. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1721-1730.
  13. Castillo, J. C., Knoepfle, D., & Weyl, G. (2017). Surge pricing solves the wild goose chase. Proceedings of the 2017 ACM Conference on Economics and Computation, 241-242.
  14. Chen, Y. L., Yeh, Y. H., & Ma, M. R. (2021). A movie recommendation method based on users' positive and negative profiles. Information Processing & Management, 58(3), 102531. https://doi.org/10.1016/j.ipm.2021.102531
  15. Cheng, W., & Hullermeier, E. (2009). Combining instance-based learning and logistic regression for multilabel classification. Machine Learning, 76(2), 211-225. https://doi.org/10.1007/s10994-009-5127-5
  16. Chiang, C. T., Yang, M. H., Koo, T. L., & Liao, C. H. (2020). What drives customer engagement behavior? The impact of user participation from a sociotechnical perspective. Journal of Electronic Commerce Research, 21(3), 197-214.
  17. Clemes, M. D., & Choong, M. (2008). An empirical analysis of customer satisfaction in international air travel. Innovative Marketing, 4(2), 15.
  18. Cui, Y., Orhun, A. Y., & Duenyas, I. (2018). How price dispersion changes when upgrades are introduced: Theory and empirical evidence from the airline industry. Management Science, 65(8), 3835-3852. https://doi.org/10.1287/mnsc.2018.3117
  19. Dai, B. T. (2014). How can consumer preferences be leveraged for targeted upselling in cable T.V. services? Pacific Telecommunications Council (PTC'14). Singapore: Research Collection School of Information Systems.
  20. Denizci Guillet, B. (2020). Online upselling: Moving beyond offline upselling in the hotel industry. International Journal of Hospitality Management, 84, 102322. https://doi.org/10.1016/j.ijhm.2019.102322
  21. Duda, R. O., Hart, P. E., & Stork, D. G. (2012). Pattern classification. Hoboken: John Wiley & Sons.
  22. Eti, S., & Mizrak, F. (2020). Analysing customer satisfaction of civil aviation companies of Turkey with text mining. In H. Dincer & S. Yuksel (Eds.), Strategic outlook for innovative work behaviours: Interdisciplinary and multidimensional perspectives (pp. 21-41). New York: Springer International Publishing.
  23. Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189-1232. https://doi.org/10.1214/aos/1013203451
  24. Fu, H., Manogaran, G., Wu, K., Cao, M., Jiang, S., & Yang, A. (2020). Intelligent decision-making of online shopping behavior based on internet of things. International Journal of Information Management, 50, 515-525. https://doi.org/10.1016/j.ijinfomgt.2019.03.010
  25. Greenstein-Messica, A., & Rokach, L. (2020). Machine learning and operation research-based method for promotion optimization of products with no price elasticity history. Electronic Commerce Research and Applications, 40, 100914. https://doi.org/10.1016/j.elerap.2019.100914
  26. Gupta, N. (2018). Influence of demographic variables: Customers' perception about cross-selling and up-selling for eBanking. International Journal of Electronic Customer Relationship Management, 11(2), 126-141. https://doi.org/10.1504/IJECRM.2018.090195
  27. Ham, J., Koo, C., & Chung, N. (2020). Configurational patterns of competitive advantage factors for smart tourism: An equifinality perspective. Current Issues in Tourism, 23(9), 1066-1072. https://doi.org/10.1080/13683500.2019.1566303
  28. Hamari, J., Hanner, N., & Koivisto, J. (2020). "Why pay premium in freemium services?" A study on perceived value, continued use and purchase intentions in free-to-play games. International Journal of Information Management, 51, 102040. https://doi.org/10.1016/j.ijinfomgt.2019.102040
  29. Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29-36. https://doi.org/10.1148/radiology.143.1.7063747
  30. Hanrahan, B. V., Willamowski, J. K., Swaminathan, S., & Martin, D. B. (2015). TurkBench: Rendering the market for Turkers. Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 1613-1616.
  31. Hara, K., Adams, A., Milland, K., Savage, S., Callison-Burch, C., & Bigham, J. P. (2018). A data-driven analysis of workers' earnings on Amazon Mechanical Turk. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-14.
  32. Havlicek, L. L., & Peterson, N. L. (1976). Robustness of the Pearson correlation against violations of assumptions. Perceptual and Motor Skills, 43(3_suppl), 1319-1334. https://doi.org/10.2466/pms.1976.43.3f.1319
  33. Hlee, S., Lee, H., Koo, C., & Chung, N. (2020). Will the relevance of review language and destination attractions be helpful? A data-driven approach. Journal of Vacation Marketing, 27(1), 61-81. https://doi.org/10.1177/1356766720950356
  34. Hu, W. H., Lin, B. T., Lu, F. S., & Jeng, J. Y. (2016). An up-selling pricing model using SDP-based rating mechanism with dynamic weight. Proceedings of the The 3rd Multidisciplinary International Social Networks Conference on SocialInformatics 2016, Data Science 2016, 1-6.
  35. Huang, C., Li, Y., Loy, C. C., & Tang, X. (2016). Learning deep representation for imbalanced classification. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5375-5384.
  36. Hussain, R., Al Nasser, A., & Hussain, Y. K. (2015). Service quality and customer satisfaction of a UAE-based airline: An empirical investigation. Journal of Air Transport Management, 42, 167-175. https://doi.org/10.1016/j.jairtraman.2014.10.001
  37. Johnson, J. S., & Friend, S. B. (2015). Contingent cross-selling and upselling relationships with performance and job satisfaction: An MOAtheoretic examination. Journal of Personal Selling & Sales Management, 35(1), 51-71. https://doi.org/10.1080/08853134.2014.940962
  38. Kleinbaum, D. G., Dietz, K., Gail, M., Klein, M., & Klein, M. (2002). Logistic regression. Berlin: Springer.
  39. Kubiak, B. F., & Weichbroth, P. (1970). Cross and up-selling techniques in e-commerce activities. The Journal of Internet Banking and Commerce, 15(3), 1-7.
  40. Lengerich, B., Tan, S., Chang, C. H., Hooker, G., & Caruana, R. (2020). Purifying interaction effects with the functional ANOVA: An efficient algorithm for recovering identifiable additive models. Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020, Palermo, Italy, 2402-2412.
  41. Lou, Y., Caruana, R., & Gehrke, J. (2012). Intelligible models for classification and regression. Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 150-158.
  42. Lou, Y., Caruana, R., Gehrke, J., & Hooker, G. (2013). Accurate intelligible models with pairwise interactions. Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 623-631.
  43. Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, & R. Garnett (Eds.), Advances in neural information processing systems 30 (pp. 4765-4774). Red Hook, NY: Curran Associates, Inc.
  44. Luttmann, A. (2019). Evidence of directional price discrimination in the U.S. airline industry. International Journal of Industrial Organization, 62, 291-329. https://doi.org/10.1016/j.ijindorg.2018.03.013
  45. Ma, N. F., Yuan, C. W., Ghafurian, M., & Hanrahan, B. V. (2018). Using stakeholder theory to examine drivers' stake in Uber. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems - CHI '18, 1-12.
  46. Ma, Y., Mao, J., Ba, Z., & Li, G. (2020). Location recommendation by combining geographical, categorical, and social preferences with location popularity. Information Processing & Management, 57(4), 102251. https://doi.org/10.1016/j.ipm.2020.102251
  47. Manchanayake, S. M. A. M., Samarasinghe, D. P., Perera, L. P. J., Bandara, H. M. M. T., Kumaradasa, K. C., Premadasa, N., & Samarasinghe, A. P. (2019). Potential upselling customer prediction through user behavior analysis based on CDR data. 2019 14th Conference on Industrial and Information Systems (ICIIS), 46-51.
  48. Mayer, V. F., Santos, G. E. de O., & Marques, O. R. B. (2020). Option framing for upselling tourism services: Does cognitive availability prevent irrational choices? Tourism Economics.
  49. Murphy, K. P. (2012). Machine learning: A probabilistic perspective. Cambridge, MA: MIT Press.
  50. Nadeau, D., & Turney, P. D. (2005). A supervised learning approach to acronym identification. In B. Kegl & G. Lapalme (Eds.), Advances in artificial intelligence (pp. 319-329). Berlin: Springer.
  51. Nicolini, G., & Salini, S. (2006). Customer satisfaction in the airline industry: The case of British Airways. Quality and Reliability Engineering International, 22(5), 581-589. https://doi.org/10.1002/qre.763
  52. Nori, H., Jenkins, S., Koch, P., & Caruana, R. (n.d.). InterpretML: A unified framework for machine learning interpretability. Ithaca, NY: Cornell University.
  53. Norvell, T., Kumar, P., & Contractor, S. (2018). Assessing the customerbased impact of up-selling versus down-selling. Cornell Hospitality Quarterly, 59(3), 215-227. https://doi.org/10.1177/1938965518762836
  54. Oh, H., Parks, S. C., & Demicco, F. J. (2002). Age- and gender-based market segmentation. International Journal of Hospitality & Tourism Administration, 3(1), 1-20. https://doi.org/10.1300/J149v03n01_01
  55. Opitz, D., & Maclin, R. (1999). Popular ensemble methods: An empirical study. Journal of Artificial Intelligence Research, 11, 169-198. https://doi.org/10.1613/jair.614
  56. Ostrowski, P. L., O'Brien, T. V., & Gordon, G. L. (1993). Service quality and customer loyalty in the commercial airline industry. Journal of Travel Research, 32(2), 16-24. https://doi.org/10.1177/004728759303200203
  57. Potdar, K., S., Pardawala, T. S., & Pai, C. D. (2017). A comparative study of categorical variable encoding techniques for neural network classifiers. International Journal of Computer Applications, 175(4), 7-9. https://doi.org/10.5120/ijca2017915495
  58. Renjith, S., Sreekumar, A., & Jathavedan, M. (2020). An extensive study on the evolution of context-aware personalized travel recommender systems. Information Processing & Management, 57(1), 102078. https://doi.org/10.1016/j.ipm.2019.102078
  59. Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why Should I Trust You?": Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135-1144.
  60. Royston, P. (2004). Multiple imputation of missing values. The Stata Journal, 4(3), 227-241. https://doi.org/10.1177/1536867x0400400301
  61. Safavian, S. R., & Landgrebe, D. (1991). A survey of decision tree classifier methodology. IEEE Transactions on Systems, Man, and Cybernetics, 21(3), 660-674. https://doi.org/10.1109/21.97458
  62. Sarker, I. H., & Kayes, A. S. M. (2020). ABC-RuleMiner: User behavioral rule-based machine learning method for context-aware intelligent services. Journal of Network and Computer Applications, 168, 102762. https://doi.org/10.1016/j.jnca.2020.102762
  63. Song, W. K., & Lee, H. C. (2020). An analysis of traveler need for and willingness to purchase airline dynamic packaging: A Korean case study. Journal of Air Transport Management, 82, 101735. https://doi.org/10.1016/j.jairtraman.2019.101735
  64. Southern, C., Cheng, Y., Zhang, C., & Abowd, G. D. (2017). Understanding the cost of driving trips. Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 430-434.
  65. Squires, J., Wilder, D. A., Fixsen, A., Hess, E., Rost, K., Curran, R., & Zonneveld, K. (2007). The effects of task clarification, visual prompts, and graphic feedback on customer greeting and up-selling in a restaurant. Journal of Organizational Behavior Management, 27(3), 1-13. https://doi.org/10.1300/J075v27n03_01
  66. Stavins, J. (2001). Price discrimination in the airline market: The effect of market concentration. Review of Economics and Statistics, 83(1), 200-202. https://doi.org/10.1162/rest.2001.83.1.200
  67. Steffen, A., Weibel, C., Stampfli, A. E., & von Arx, W. (2020). Upselling by default: The effect of default options on travelers' board and lodging choices. Journal of Travel Research, 59(7), 1253-1267. https://doi.org/10.1177/0047287519877250
  68. Stein, G., Chen, B., Wu, A. S., & Hua, K. A. (2005). Decision tree classifier for network intrusion detection with GA-based feature selection. Proceedings of the 43rd Annual Southeast Regional Conference - Volume 2, 136-141.
  69. Steinberg, D. (2009). CART: Classification and regression trees. In The top ten algorithms in data mining (pp. 193-216). Boca Raton, FL: Chapman and Hall/CRC.
  70. Svangren, M. K., Skov, M. B., & Kjeldskov, J. (2018). Passenger trip planning using ride-sharing services. Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-12.
  71. Tan, S., Caruana, R., Hooker, G., & Lou, Y. (2018). Distill-and-compare: Auditing black-box models using transparent model distillation. Proceedings of the 2018 AAAI/ACM Conference on A.I., Ethics, and Society, 303-310.
  72. Van Buuren, S. (2018). Flexible imputation of missing data. Boca Raton, FL: CRC Press.
  73. Wiesman, D. W. (2006). The effects of performance feedback and social reinforcement on up-selling at fast-food restaurants. Journal of Organizational Behavior Management, 26(4), 1-18. https://doi.org/10.1300/J075v26n04_01
  74. Yang, S., Wang, H., Zhang, C., & Gao, Y. (2020). Contextual bandits with hidden features to online recommendation via sparse interactions. IEEE Intelligent Systems, 35(5), 62-72. https://doi.org/10.1109/mis.2020.3010298
  75. Zhu, G., Wu, Z., Wang, Y., Cao, S., & Cao, J. (2019). Online purchase decisions for tourism e-commerce. Electronic Commerce Research and Applications, 38, 100887. https://doi.org/10.1016/j.elerap.2019.100887
  76. Zins, A. H. (2001). Relative attitudes and commitment in customer loyalty models: Some experiences in the commercial airline industry. International Journal of Service Industry Management, 12(3), 269-294. https://doi.org/10.1108/EUM0000000005521

Cited by

  1. Artificial intelligence (AI) and robotics in travel, hospitality and leisure vol.31, pp.3, 2021, https://doi.org/10.1007/s12525-021-00494-z