Applications of Machine Learning Models on Yelp Data

Ruchi Singh;Jongwook Woo;

doi:10.14329/apjis.2019.29.1.35

Asia pacific journal of information systems

Volume 29 Issue 1
/
Pages.35-49
/
2019
/
2288-5404(pISSN)
/
2288-6818(eISSN)

The Korea Society of Management Information Systems (한국경영정보학회)

DOI QR Code

Applications of Machine Learning Models on Yelp Data

Ruchi Singh (CISCO) ;
Jongwook Woo (College of Business and Economics, California State University Los Angeles)

Received : 2018.07.17
Accepted : 2019.01.14
Published : 2019.03.31

https://doi.org/10.14329/apjis.2019.29.1.35 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

The paper attempts to document the application of relevant Machine Learning (ML) models on Yelp (a crowd-sourced local business review and social networking site) dataset to analyze, predict and recommend business. Strategically using two cloud platforms to minimize the effort and time required for this project. Seven machine learning algorithms in Azure ML of which four algorithms are implemented in Databricks Spark ML. The analyzed Yelp business dataset contained 70 business attributes for more than 350,000 registered business. Additionally, review tips and likes from 500,000 users have been processed for the project. A Recommendation Model is built to provide Yelp users with recommendations for business categories based on their previous business ratings, as well as the business ratings of other users. Classification Model is implemented to predict the popularity of the business as defining the popular business to have stars greater than 3 and unpopular business to have stars less than 3. Text Analysis model is developed by comparing two algorithms, uni-gram feature extraction and n-feature extraction in Azure ML studio and logistic regression model in Spark. Comparative conclusions have been made related to efficiency of Spark ML and Azure ML for these models.

Keywords

References

AzureML Team for Microsoft (2015). Text Classification. March 18, Retrieved from https://gallery.azure.ai/Experiment/Text-Classification-Step-1-of-5-data-preparation-3
Carbon, K., Fujii, K., and Veerina, P. (2014). Applications of Machine Learning to Predict Yelp Ratings. Stanford Univ., Stanford, CA.
Chou, T. Y., Hsu, C. L., and Chen, M. C. (2008). A Fuzzy Multi-Criteria Decision Model for International Tourist Hotels Location Selection. International Journal of Hospitality Management, 27(2), 293-301. https://doi.org/10.1016/j.ijhm.2007.07.029
Fan, M., and Khademi, M. (2014). Predicting a business star in yelp from its reviews text alone. arXiv preprint arXiv:1401.0864.
Ganu, G., Elhadad, N., and Marian, A. (2009). Beyond the stars: Improving rating predictions using review text content. WebDB.
Jong, J. (2011). Predicting Rating with Sentiment Analysis. Stanford Univ., Stanford, CA.
Leung, C. W., Chan, S. C., and Chung, F. (2007). Applying Cross-level Association Rule Mining to Cold-Start Recommendations. IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Workshops. pp. 133-136. Silicon Valley, California, USA.
Li, C., and Zhang, J. (2014). Prediction of Yelp Review Star Rating using Sentiment Analysis. Stanford Univ., Stanford, CA.
Liu, X., Michel, S., and Nan, Z. (2016). Predicting Usefulness of Yelp Reviews. Stanford Univ., Stanford, CA. Section 3.1.
Qu, L., Ifrim, G., and Weikum, G. (2010). The bag-of-opinions method for review rating prediction from sparse text patterns. Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010), 913-921.
Tzeng, G. H., Teng, M. H., Chen, J. J., and Opricovic, S. (2002). Multicriteria selection for a restaurant location in Taipei. International Journal of Hospitality Management, 21(2), 171-187. https://doi.org/10.1016/S0278-4319(02)00005-1
Yelp.com (2014), Yelp Dataset Challenge. October 15, 2014. Retrieved from http://www.yelp.com/dataset_challenge

Asia pacific journal of information systems

Applications of Machine Learning Models on Yelp Data

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)