Browse > Article
http://dx.doi.org/10.13088/jiis.2019.25.1.219

How to improve the accuracy of recommendation systems: Combining ratings and review texts sentiment scores  

Hyun, Jiyeon (Business Informatics, School of Business, Hanyang University)
Ryu, Sangyi (School of Business, Hanyang University)
Lee, Sang-Yong Tom (School of Business, Hanyang University)
Publication Information
Journal of Intelligence and Information Systems / v.25, no.1, 2019 , pp. 219-239 More about this Journal
Abstract
As the importance of providing customized services to individuals becomes important, researches on personalized recommendation systems are constantly being carried out. Collaborative filtering is one of the most popular systems in academia and industry. However, there exists limitation in a sense that recommendations were mostly based on quantitative information such as users' ratings, which made the accuracy be lowered. To solve these problems, many studies have been actively attempted to improve the performance of the recommendation system by using other information besides the quantitative information. Good examples are the usages of the sentiment analysis on customer review text data. Nevertheless, the existing research has not directly combined the results of the sentiment analysis and quantitative rating scores in the recommendation system. Therefore, this study aims to reflect the sentiments shown in the reviews into the rating scores. In other words, we propose a new algorithm that can directly convert the user 's own review into the empirically quantitative information and reflect it directly to the recommendation system. To do this, we needed to quantify users' reviews, which were originally qualitative information. In this study, sentiment score was calculated through sentiment analysis technique of text mining. The data was targeted for movie review. Based on the data, a domain specific sentiment dictionary is constructed for the movie reviews. Regression analysis was used as a method to construct sentiment dictionary. Each positive / negative dictionary was constructed using Lasso regression, Ridge regression, and ElasticNet methods. Based on this constructed sentiment dictionary, the accuracy was verified through confusion matrix. The accuracy of the Lasso based dictionary was 70%, the accuracy of the Ridge based dictionary was 79%, and that of the ElasticNet (${\alpha}=0.3$) was 83%. Therefore, in this study, the sentiment score of the review is calculated based on the dictionary of the ElasticNet method. It was combined with a rating to create a new rating. In this paper, we show that the collaborative filtering that reflects sentiment scores of user review is superior to the traditional method that only considers the existing rating. In order to show that the proposed algorithm is based on memory-based user collaboration filtering, item-based collaborative filtering and model based matrix factorization SVD, and SVD ++. Based on the above algorithm, the mean absolute error (MAE) and the root mean square error (RMSE) are calculated to evaluate the recommendation system with a score that combines sentiment scores with a system that only considers scores. When the evaluation index was MAE, it was improved by 0.059 for UBCF, 0.0862 for IBCF, 0.1012 for SVD and 0.188 for SVD ++. When the evaluation index is RMSE, UBCF is 0.0431, IBCF is 0.0882, SVD is 0.1103, and SVD ++ is 0.1756. As a result, it can be seen that the prediction performance of the evaluation point reflecting the sentiment score proposed in this paper is superior to that of the conventional evaluation method. In other words, in this paper, it is confirmed that the collaborative filtering that reflects the sentiment score of the user review shows superior accuracy as compared with the conventional type of collaborative filtering that only considers the quantitative score. We then attempted paired t-test validation to ensure that the proposed model was a better approach and concluded that the proposed model is better. In this study, to overcome limitations of previous researches that judge user's sentiment only by quantitative rating score, the review was numerically calculated and a user's opinion was more refined and considered into the recommendation system to improve the accuracy. The findings of this study have managerial implications to recommendation system developers who need to consider both quantitative information and qualitative information it is expect. The way of constructing the combined system in this paper might be directly used by the developers.
Keywords
collaborative filtering; recommendation system; sentiment analysis; sentiment dictionary; text mining;
Citations & Related Records
Times Cited By KSCI : 4  (Citation Analysis)
연도 인용수 순위
1 Ahn. J. K. and H. W. Kim , "Building a Korean Sentiment Dictionary and Applications of Natural Language Processing", J Intell Inform Syst, (2014), 177-182.
2 Bhojne, N. G., Deore, S., Jagtap, R., Jain, G. and Kalal, C., "Collaborative Approach based Restaurant Recommender System using Naive Bayes", International Journal of Advanced Research in Computer and Communication Engineering, Vol. 6, No.4(2017).
3 Breese, J. S., Heckerman, D., and Kadie, C., "Empirical analysis of predictive algorithms for collaborative filtering", In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, (1998), 43-52.
4 Choi. D. J., H. S. Choi and C. Y. Park., "Classification of ratings in online reviews", Journal of the Korean Data & Information Science Society, (2016), 845-854.   DOI
5 Deng, D., Jing, L., Yu, J., Sun, S. and Zhou, H., "Neural gaussian mixture model for review-based rating prediction", RecSys '18 Proceedings of the 12th ACM Conference on Recommender Systems, (2018).
6 Garcis-Cumbreras, M. A., A. Montejo-Raez, and M. C. Diaz-Galiano, "Pessimists and optimists: Improving collaborative filtering through sentiment analysis", Expert Systems with Applications, Vol. 40, No.17(2013), 6758-6765.   DOI
7 Goldberg, D., Nichols, D., Oki, B. M., and Terry, D., "Using collaborative filtering to weave an information tapestry", Communications of the ACM, (1992), 61-70.
8 Hoerl, E. and Kennard, R.W., "Ridge Regression: Biased Estimation for Nonorthogonal Problems", Technometrics, Vol.12, No.1 (1970), 55-67.   DOI
9 Jeon, B. K., H. C. Ahn, "A Collaborative Filtering System Combined with Users Review Mining Application to the Recommendation of Smartphone-Apps", J Intell Inform Syst, Vol.21, No.2(2015), 1-18.   DOI
10 Jakob, "Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations", TSA '09 Proceedings of the 1st international CIKM workshop on Topic-sentiment analysis for mass opinion, (2009), 57-64.
11 Jung, C. H., J. H. Kim, Y. J. Jeon, H. J. Jung, "Korean Sentiment Dictionary based on the Reliability of Review data", Korean Institute of Information Scientists and Engineers, 2017 Korea Software Congress, (2017), 1965-1967.
12 Kim. D. H. and S. H. Choi, "A Domain Adaptive Sentiment Dictionary Construction Method for Domain Sentiment Analysis", The Korean Society of Computer and Information, Proceedings of the Korean Society of Computer Information Conference, (2015), 15-18.
13 Kim. S. B., S. J. Kwon. and J. T. Kim, "Building Sentiment Dictionary and Polarity Classification of Blog Review by Using Elastic Net", Korean Information Science Society, (2015), 639-641.
14 Park, J. Y. and B. S. Jeon, "A structural Analysis of the Movie Reviews", The Journal of the Korea Contents Association, Vol. 14, No. 5(2014).
15 Kim. Y. S. and S. R. Jeong, "Intelligent VOC Analyzing System Using Opinion Mining", Korea Intelligent Information Systems Society, Journal of Intelligence and Information Systems 19(3), (2013), 113-125.   DOI
16 Lee, S. H., J. Choi and J. W. Kim, "Analysis of movie review emotion through customized emotional dictionary construction by domain", J Intell Inform Syst, Vol.22, No.2 (2016), 97-113.   DOI
17 Leung, C. W., Chan, S. C. and Chung, F., "Integrating Collaborative Filtering and Sentiment Analysis: A Rating Inference Approach", ECAI 2006 Workshop on Recommender Systems, (2006), 62-68.
18 Liu, S. M., Chan, J. H., "A multi-label classification based approach for sentiment classification", Expert Systems with Application, Vol. 42, (2005), 1083-1093.   DOI
19 Oh. Y. J. and S. H. Choi, "Movie Rating Inference by Construction of Movie Sentiment Sentence using Movie comments and ratings", Journal of Korean Society for Internet Information, Vol.16, No.2(2015), 41-48.
20 Ricci, F., Rokach, L., Shapira, B. and Kantor, P., Recommender systems handbook, 2011.
21 Sarwar, B., Karypis, G., Konstan, J., and Riedl, J., "Item-based collaborative filtering recommendation algorithms", In Proceedings of the 10th international conference on World Wide Web, (2001), 285-295.
22 Son. J. E., S. B. H., H. J. Kim and S. J. Cho., "Review and Analysis of Recommender Systems, Journal of the Korean Institute of Industrial Engineers, Vol. 41, No. 2(2015), 185-208.   DOI
23 Song. J. S., J. B. Back and S. W. Lee, "Automatic Construction of Positive/Negative Dictionary to Improve Performance of Product Review Classification", Korean Institute of Information Scientists and Engineers, 2010 Korea Computer Congress, Vol. 37, No.1 (2010), 136-137.
24 Yu, E., Y. Kim, N. Kim, and S. R. Jeong, "Predicting the direction of the stock index by using a domain-specific sentiment dictionary," KIIS Journal of Intelligence and Information Systems, Vol. 19, No. 1(2013), 95-110.   DOI
25 Tibshirani, R., "Regression Shrinkage and Selection via the Lasso", Journal of the Royal Statistical Society, Vol. 58, No. 1(1996), 267-288.   DOI
26 Wang, Y., Liu, Y. and Yu, X., "Collaborative Filtering with Aspect-Based Opinion Mining: A Tensor Factorization Approach", Proceedings of 2012 IEEE 12th International Conference on Data Mining (ICDM), (2012), 1152-1157.