Browse > Article
http://dx.doi.org/10.5351/KJAS.2020.33.1.075

Identification of sentiment keywords association-based hotel network of hotel review using mapper method in topological data analysis  

Jeon, Ye-Seul (Department of Applied Statistics, Yonsei University)
Kim, Jeong-Jae (Artificial Intelligence Lab, Daumsoft Inc.)
Publication Information
The Korean Journal of Applied Statistics / v.33, no.1, 2020 , pp. 75-86 More about this Journal
Abstract
Hotel review data can extract various information that includes purchasing factors that lead to consumption, advantages, and disadvantages for hotels. In particular, the sentiment keyword of the review data helps consumers understand the pros and cons of hotels. However, it is not efficient for consumers to read a large number of reviews. Therefore, it is necessary to offer a summary review to customers. In this study, we suggest providing summary information on sentiment keywords association as well as a network of hotels based on sentiment keywords. Based on a sentiment keyword dictionary, the extracted sentiment keywords associations construct the hotel network through topological data analysis based mapper. This hotel network allows a consumer to find some hotels associated with specific sentiment keywords as well as recommends the same related hotels. This summary information provides users with a summarized emotional assessment of hotels and helps hotel marketing teams understand consumers' perceptions of their hotel.
Keywords
topological data analysis; mapper; sentiment analysis; sentiment keywords association-based hotel network;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Guo, W. and Banerjee, A. G. (2017). Identification of key features using topological data analysis for accurate prediction of manufacturing system outputs, Journal of Manufacturing Systems, 43, 225-234.   DOI
2 Ha, H. J., Kim, G. N., and Lee, K. W. (2013). A study on analysis of affective words in movie reviews and the situation of watching movies, Design Convergence Study, 12, 17-32.
3 Jung, M. J., Shin, S. M., and Choi, Y. S. (2019). Creation and clustering of proximity data for text data analysis, The Korean Journal of Applied Statistics, 32, 451-462.   DOI
4 Kruskal, J. (1964). Nonmetric multidimensional scaling: a numerical method, Psychometrika, 29, 115-129.   DOI
5 Maaten, L. V. D. and Hinton, G. (2008). Visualizing data using t-SNE, Journal of Machine Learning Research, 9, 2579-2605.
6 Singh, G., Memoli, F., and Carlsson, G. E. (2007). Topological methods for the analysis of high dimensional data sets and 3D object recognition, SPBG.
7 Zomorodian, A. and Carlsson, G. (2005). Computing persistent homology, Discrete and Computational Geometry, 33, 249-274.   DOI