Browse > Article
http://dx.doi.org/10.13088/jiis.2016.22.4.217

Reliability Analysis of VOC Data for Opinion Mining  

Kim, Dongwon (Shipping Management, Korea Maritime & Ocean University)
Yu, Song Jin (Shipping Management, Korea Maritime & Ocean University)
Publication Information
Journal of Intelligence and Information Systems / v.22, no.4, 2016 , pp. 217-245 More about this Journal
Abstract
The purpose of this study is to verify how 7 sentiment domains extracted through sentiment analysis from social media have an influence on business performance. It consists of three phases. In phase I, we constructed the sentiment lexicon after crawling 45,447 pieces of VOC (Voice of the Customer) on 26 auto companies from the car community and extracting the POS information and built a seven-sensitive domains. In phase II, in order to retain the reliability of experimental data, we examined auto-correlation analysis and PCA. In phase III, we investigated how 7 domains impact on the market share of three major (GM, FCA, and VOLKSWAGEN) auto companies by using linear regression analysis. The findings from the auto-correlation analysis proved auto-correlation and the sequence of the sentiments, and the results from PCA reported the 7 sentiments connected with positivity, negativity and neutrality. As a result of linear regression analysis on model 1, we indentified that the sentimental factors have a significant influence on the actual market share. In particular, not only posotive and negative sentiment domains, but neutral sentiment had significantly impacted on auto market share. As we apply the availability of data to the market, and take advantage of auto-correlation of the market-related information and the sentiment, the findings will be a huge contribution to other researches on sentiment analysis as well as actual business performances in various ways.
Keywords
Auto-correlation; PCA; Sentiment Analysis; Linear Regression Analysis;
Citations & Related Records
Times Cited By KSCI : 6  (Citation Analysis)
연도 인용수 순위
1 T. Loughran, B. McDonald. "When is a Liability not a Liability? Textual Analysis Dictionaries, and 10-Ks," Journal of Finance, Vol.661, No.1(2011), 35-65.
2 Abdi. H., & Williams, L.J, "Principal Component Analysis". Wiley Interdisciplinary Reviews: Computational Statistics, Vol.2, No.4(2010), 433-459.   DOI
3 A. Esuli and F. Sebastiani, "Sentiwordnet: A Publicly Available Lexical Resource for Opinion Mining," LREC (2006), 417-422.
4 An J. K. and H. W. Kim, "Building a Korean Sentiment Lexicon Using Collective Intelligence," Journal of Intelligent Information Systems, Vol.21, No.2(2015), 49-67.   DOI
5 A. Woolridge, "Social media provides huge opportunities, but will bring huge problems," Economist, (2011), 50.
6 B.J. Finch, "Internet Discussions as a Source for Consumer Product Customer Involvement and Quality Information: an Exploratory Study," Journal of Operations Management, Vol.17, No.5(1999), 535-556.   DOI
7 B. Kessler, G. Numberg, and H. Schutze, "Automatic Detection of Text Genre," Meeting of the Association for Computational Linguistics (1997), 32-38.
8 B. Kujawski, J. Holyst, and G. J. Rodgers, "Growing Trees in Internet News Groups and Forums," Physical Review, Vol.76 (2007), 103.
9 B. Pang, L. Lee, and S. Vaithyanathan, "Thumbs up?: Sentiment Classification Using Machine Learning Techniques," The ACL-02 conference on Empirical Methods in Natural Language Processing, Vol.10 (2002), 79-86.
10 B. Tronvoll, "Negative Emotions and Their Effect on Customer Complaint Behavior," Journal of Service Management, Vol.22 (2011), 111-134.   DOI
11 Choi, Y.-J. and H. Choi, "A Study on the Customer Satisfaction Strategies of the Online Company Using VOC," Journal of Korean Industrial Economics and Business, Vol.3, No.1(2011), 73-93.
12 C. Whitelaw, N. Garg, and S. Argamon, "Using Appraisal Groups for Sentiment Analysis," The 14th ACM International Conference on Information and Knowledge Management, (2005), 625-631.
13 David A. Freedman. "Statistical Models: Theory and Practice". Cambridge University Press, 2009, 26.
14 D.E. O'Leary, "Blog Mining-Review and Extensions: from Each according to His Opinion," Decision Support Systems, Vol.51, No.4(2011), 821-830.   DOI
15 D. Ward, P.H. Jesty, R.S. Rivett. "Decomposition Scheme in Automotive Hazard Analysis," SAE International Journal of Passenger Cars- Mechanical Systems, Vol.2, No.1(2009), 803-813.   DOI
16 E. Diener, H. Smith, and F. Fujita, "The Personality Structure of Affect," Journal of Personality and Social Psychology, Vol.69(1995), 130.   DOI
17 E. Spertus, "Smokey: Automatic Recognition of Hostile Messages," The National Conference on Artificial Intelligence, (1997), 1058-1065.
18 G. M. Ljung; G. E. P. Box, "On a Measure of a Lack of Fit in Time Series Models," Biometrika, Vol.65, No.2(1978), 297-303.   DOI
19 G. A. Miller, "WordNet: a Lexical Database for English," Communications of the ACM, Vol.38 (1995).
20 Gerald M. Katz, "One Right Way to Gather the Voice of the Customer," PDMA Visions Magazine, (2001).
21 Hanjun Lee, JinYoung Han, Yongmoo Suh, "Gift or Threat? An Examination of Voice of the Customer: The Case of MyStarbucksIdea.com," Electronic Commerce Research and Applications, Vol.13 (2014), 205-219.   DOI
22 Hilary L. Seal."The Historical Development of the Gauss Linear Model", Biometrika, Vol.54, No.1/2(1967), 1-24.   DOI
23 Hyun Won Jung, Ken Nah, A Study on the Meaning of Sensibility and Vocabulary System for Sensibility Evaluation, Journal of the Ergonomics Society of Korea, Vol.26, No.3(2007), 17-25.   DOI
24 J. Bollen, H. Mao, and X. Zeng, "Twitter Mood Predicts the Stock Market," Journal of Computational Science, Vol.2 (2011), 1-8.   DOI
25 J. S. Lerner and D. Keltner, "Beyond valence: Toward a Model of Emotion-specific Influences on Judgment and Choice," Cognition & Emotion, Vol.14 (2000), 473-493.   DOI
26 Jo H. J., J. H. Seo and J. T. Choi, "OAR Algorithm Technology Based on Opinion Mining Utilizing Stock News Contents," Journal of Korean Institute of Information Technology, Vol.13, No.2(2015), 111-119.
27 Jung, "The Influence of Negative Emotions on Customer Contribution to Organizational Innovation in an Online Brand Community," Journal of Korean Society for Internet Information, Vol.14, No.4(2013), 91-100
28 Liu, Bing, "Sentiment Analysis and Subjectivity," Handbook of Natural Language Processing 2, (2010), 627-666.
29 K. Coussement, D. Van den Poel, "Improving Customer Complaint Management by Automatic Email Classification using Linguistic style Features as Predictors," Decision Support Systems, Vol.44, No.4 (2008), 870-882.   DOI
30 Kim, Y., N. Kim, and S. R. Jeong, "Stock-index Invest Model Using News Big Data Opinion Mining," Journal of Intelligence and Information Systems, Vol.18, No.2(2012), 143-156.   DOI
31 L.. Venkata Subramaniam, Tanveer A. Faruquie, Shajith Ikbal, Shantanu Godbole, Mukesh K. Mohania, "Business Intelligence from Voice of Customer," IEEE International Conference on Data Engineering (2009).
32 L. Zhuang, F. Jing, X. Y. Zhu, and L. Zhang, "Movie Review Mining and Summarization," Conference on Information and Knowledge Management: Proceedings of the 15 th ACM International Conference on Information and Knowledge Management, (2006), 43-50.
33 M. Thelwall, K. Buckley, and G. Paltoglou, "Sentiment in Twitter Events," Journal of the American Society for Information Science and Technology, Vol.62 (2011), 406-418.   DOI
34 N.C. Romano, C. Donovan, H. Chen, J. Nunamaker, "A Methodology for Analyzing Web-based Qualitative Data," Journal of Management Information Systems, Vol.19(4) (2003), 213-246.   DOI
35 N. Li and D. D. Wu, "Using Text Mining and Sentiment Analysis for Online Forums Hotspot Detection and Forecast," Decision Support Systems, Vol.48 (2010), 354-368.   DOI
36 P.C. Tetlock, M. Saar-Tsechansky, S. Macskassy, "More than Words: Quantifying Language to Measure Firms' Fundamentals," Journal of Finance, Vol.63, No.3(2008), 1437-1467   DOI
37 R.P. Schumaker, H. Chen, "Textual Analysis of Stockmarket Prediction Using Breaking Financial News: the AZFin Text System," ACM Transactions on Information Systems, Vol.27, No.2(2009).
38 P.H. Jesty, K.M. Hobley, R. Evans, I. Kendall, Safety Analysis of Vehicle-based Systems, in: F. Redmill, T. Anderson (Eds.). "Lessons in System Safety, Proceedings of the 8th Safety-Critical Systems Symposium (SCSS)," Springer, London, 2000.
39 Pearson, K, "Onlines and Planes of Closest Fit to Systems of Points in Space," Philosophical Magazine, Vol.2, No.11(1901), 559-572.   DOI
40 R.G. Vedder, M.T. Vanecek, C.S. Guynes, J.J. Cappel. "CEO and CIO Perspectives on Competitive Intelligence," Communications of the ACM, Vol.42, No.8(1999), 108-116.   DOI
41 S. Argamon, M. Koppel, and G. Avneri, "Routing Documents according to Style," First International Workshop on Innovative Information Systems, (1998), 85-92.
42 Song J. S., and S. W. Lee, "Automatic Construction of Positive/Negative FeaturePredicate Dictionary for Polarity Classification of Product Reviews," Journal of KIISE: Software and Applications, Vol.38, No.3(2013), 157-168.
43 S. Spangler, J. Kreulen, "Mining the Talk: Unlocking the Business Value in Unstructured Information," IBM Press, 2008.
44 Takeuchi, H., L. V. Subramaniam., T. Nasukawa, S. Roy, "Getting Insights from the Voices of Customers : Conversation Mining at a Contact Center," Information Science, Vol.179, No.11(2009), 1584-1591.   DOI
45 Turney P. D. and M.L. Littman, "Unsupervised Learning of Semantic Orientation from a Hundred-Billion-word Corpus," National Research Council, Institute for Information Technology, Technical Report (2002), ERB-1094.
46 Yu E. J., Y. S. Kim, N. Y. Kim and S. R. Jeong, "Predicting the Direction of the Stock Index by Using a Domain-specific Sentiment Dictionary," Journal of Intelligent Information Systems, Vol.19, No.1(2013), 95-10   DOI
47 T. Nasukawa and J. Yi, "Sentiment Analysis: Capturing Favorability Using Natural Language Processing," The 2nd International Conference on Knowledge Capture, (2003), 70-77.
48 T. Wilson, J. Wiebe, and P. Hoffmann, "Recognizing Contextual Polarity: An Exploration of Features for Phrase-level Sentiment Analysis," Computational Linguistics, Vol.35 (2009), 399-433.   DOI
49 W. Duan, B. Gu, A.B. Whinston, "Do online reviews matter? - An Epirical Investigation of Panel Data," Decision Support Systems, Vol.45, No.4(2008), 1007-1016.   DOI
50 Yune, H., H.-J. Kim, J.-Y. Chang, "An Efficient Search Method of Product Review Using Opinion Mining Techniques," Journal of KIISE : Computing Practices and Letters, Vol.16, No.2(2010), 222-226.
51 Zhuang, L., F. Jing, and X. Y. Zhu, "Movie Review Mining and Summarization," Proceedings of the 15th ACM International Conference on Information and Knowledge Management, (2006), 43-50.