DOI QR코드

DOI QR Code

Prediction of Food Franchise Success and Failure Based on Machine Learning

머신러닝 기반 외식업 프랜차이즈 가맹점 성패 예측

  • 안예린 (서울여자대학교 소프트웨어융합학과) ;
  • 유성민 (서울여자대학교 데이터과학전공) ;
  • 이현희 (서울여자대학교 데이터과학전공) ;
  • 박민서 (서울여자대학교 데이터사이언스학과)
  • Received : 2022.05.24
  • Accepted : 2022.07.02
  • Published : 2022.07.31

Abstract

In the restaurant industry, start-ups are active due to high demand from consumers and low entry barriers. However, the restaurant industry has a high closure rate, and in the case of franchises, there is a large deviation in sales within the same brand. Thus, research is needed to prevent the closure of food franchises. Therefore, this study examines the factors affecting franchise sales and uses machine learning techniques to predict the success and failure of franchises. Various factors that affect franchise sales are extracted by using Point of Sale (PoS) data of food franchise and public data in Gangnam-gu, Seoul. And for more valid variable selection, multicollinearity is removed by using Variance Inflation Factor (VIF). Finally, classification models are used to predict the success and failure of food franchise stores. Through this method, we propose success and failure prediction model for food franchise stores with the accuracy of 0.92.

외식업은 소비자의 수요가 많고 진입장벽이 낮아 창업이 활발하게 일어난다. 하지만 외식업은 폐업률이 높고, 프랜차이즈의 경우 동일 브랜드 내에서도 매출 편차가 크게 나타난다. 따라서 외식업 프랜차이즈의 폐업을 방지하기 위한 연구가 필요하다. 이를 위해, 본 연구에서는 프랜차이즈 가맹점 매출에 영향을 미치는 요인들을 살펴보고, 도출된 요인들에 머신러닝 기법을 활용하여 프랜차이즈의 성패를 예측하고자 한다. 강남구 프랜차이즈 매장의 PoS(Point of Sale) 데이터와 공공데이터를 활용하여 가맹점 매출에 영향을 미치는 여러 요인들을 추출하고, VIF(Variance Inflation Factor)를 활용하여 다중공산성을 제거하여 타당성 있는 변수 선택을 진행한 뒤, 머신러닝 기법 중 분류모델을 활용하여 프랜차이즈 매장의 성패 예측을 진행한다. 이를 통해 최고 정확도 0.92를 가진 프랜차이즈 성패 예측 모델을 제안한다.

Keywords

References

  1. J.-H. Kim, K.-J. Kang, and K.-M. Ryoo, "A study on the effects of catering consultant capability on consulting satisfaction and business performance of small business owners", International Journal of Tourism and Hospitality Research, vol. 34, no. 11. Korea Tourism Research Association, pp. 165-177, 2020. https://doi.org/10.21298/IJTHR.2020.11.34.11.165
  2. Kim, Sun Jung, "Effects of the characteristics of TV and Internet Food-related Programs on Dietary Self-efficacy of Regular Viewers - focused on single household", The Journal of the Convergence on Culture Technology (JCCT), Vol. 5, No. 4, pp 307-313, 2019, DOI: https://doi.org/10.17703/JCCT.2019.5.4.307
  3. Korea Institute of Startup and Entrepreneurship Development, 「2020 Startup Business Status Survey (as of 2018)」, (2021).
  4. aT, 「2019 Domestic and International Restaurant Trend Survey」, (2020).
  5. Korea Fair Trade Commission, 「Franchise Business Status Report」, (2022).
  6. Im Dong Lee, Chan Ho Lee and Sang Mog Kang. "The Study of Site Factors Affecting the Sales of Convenience Stores", Journal of KREAA, vol. 16, pp.53-77, 2010. DOI: http://dx.doi.org/10.19172/KR EAA.23.4.6
  7. Hwang Kyu Sung, Jang Hyeong Jin. "A Study on the Effects of Location Factors on Performance of Convenience Stores -Comparative Analysis on Ulsan and Gyeong Nam Regions-", Korea Real Estate Academy Review, vol. 64, pp.199-211, 2016.
  8. Young Man Lee. "A Review of the Hedonic Price Model". Journal of KREAA, vol. 14, pp.81-87, 2008.
  9. Woo Jin Shin, So Youn Moon. "A Study on the Effects of Locational Characteristics on the Sales of a Coffee Shop Franchise", Journal of KREAA, vol. 17, pp.111-123, 2011.
  10. Aitken, A. C. "IV.-On Least Squares and Linear Combination of Observations", Proceedings of the Royal Society of Edinburgh, pp. 42-48, 1936. DOI: https://doi.org/10.1017/S0370164600014346
  11. Seong Eun Yeong, Choi Chang Gyu. "The Effects of Disposable Income and Built Environment Characteristics on Retail Business Sales", Journal of KREAA, vol. 23, pp.77-91, December 2017. DOI: http://dx.doi.org/10.19172/KREAA.23.4.6
  12. Soowook Lee, Manyong Han, "Utilization and Analysis of Big-data.", International Journal of Advanced Culture Technology(IJACT), 7(4), pp. 255-259, 2019. DOI: https://doi.org/10.17703/IJACT.2019.7.4.255
  13. Cox, D.R., Snell, E.J., Analysis of Binary Data, 2nd ed, New York, U.S.A: Routledge, 1970.
  14. SooYoung Kim. "Prediction of Hotel Bankruptcy Using Multivariate Discriminant Analysis, Logistic Regression and Artificial Neural Network", Journal of Tourism Sciences, vol. 30, no. 2, pp.53-75, 2006.
  15. Breiman, L., et al, Classification and Regression Trees, Belmont, California: Wadsworth, 1984.
  16. Jonghoo Choi, Doosung Seo. "Decision Trees and Its Applications", Journal of The Korean Official Statistics, vo1. 4, no. 1, pp.61-83, March 1999.
  17. Breiman, Leo., "Random forests", Machine Learning, Vol. 45, No. 1, pp. 5-32, 2001, DOI: https://doi.org/10.1023/A:1010933404324
  18. Younhak Oh, Han Kim, Jaesub Yun, Jong-Seok Lee."Using Data Mining Techniques to Predict Win-Loss in Korean Professional Baseball Games", Journal of the Korean Institute of Industrial Engineers, vol. 40, no. 1, pp.8-17, 2014. DOI: http://dx.doi.org/10.7232/JKIIE.2014.40.1.008
  19. D. E. Farrar and R. R. Glauber, "Multi collinearity in regression analysis: The problem revisited", Review of Economics and Statistics, Vol.49, No.1 pp.92-107, 1967. https://doi.org/10.2307/1937887
  20. A. Swets, "Measuring the Accuracy of Diagnostic Systems", Science, vol. 240, no. 4857. American Association for the Advancement of Science (AAAS), pp. 1285-1293, 1988. https://doi.org/10.1126/science.3287615