DOI QR코드

DOI QR Code

Developing a Sentiment Analysing and Tagging System

감성 분석 및 감성 정보 부착 시스템 구현

  • 이현규 (한국교통대학교 컴퓨터정보공학과) ;
  • 이성욱 (한국교통대학교 컴퓨터정보공학과)
  • Received : 2016.03.21
  • Accepted : 2016.04.28
  • Published : 2016.08.31

Abstract

Our goal is to build the system which collects tweets from Twitter, analyzes the sentiment of each tweet, and helps users build a sentiment tagged corpus semi-automatically. After collecting tweets with the Twitter API, we analyzes the sentiments of them with a sentiment dictionary. With the proposed system, users can verify the results of the system and can insert new sentimental words or dependency relations where sentiment information exist. Sentiment information is tagged with the JSON structure which is useful for building or accessing the corpus. With a test set, the system shows about 76% on the accuracy in analysing the sentiments of sentences as positive, neutral, or negative.

본 연구의 목적은 트위터에서 수집된 트윗들의 감성을 분석하고 각 문장의 감성 정보를 반자동으로 부착하여 감성 말뭉치를 구축할 수 있는 시스템의 구현이다. 트위터 API를 이용해 트윗을 수집한 후 각 트윗이 어떤 감성을 갖는지 감성사전을 이용해 분석한다. 사용자는 감성 분석 결과를 확인하고 누락된 감성 정보를 추가하거나 의존구조 사이에 존재하는 감성 정보를 추가할 수 있다. 감성 정보는 JSON 구조로 부착함으로써 감성 말뭉치 구축 및 활용에 용이하게 하였다. 제안 시스템은 긍정, 부정, 중립 문장에 대한 감성 분석 결과 약 76%의 성능을 보였다.

Keywords

References

  1. Gyoung-Ho Lee and Kong Joo Lee, "Design of a Reputation System for Twitter," Proceedings of the 24th Annual Conference on Human and Cognitive Language Technology, pp.62-66, 2012.
  2. Johan Bollen, Huina Mao, and Xiao-Jun Zeng, "Twitter mood predicts the stock market," Journal of Computational Science, Vol.2, No.1, pp.1-8, 2011. https://doi.org/10.1016/j.jocs.2010.12.007
  3. Sitaram Asur and Bernardo A. Huberman, "Predicting the Future With Social Media," Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, Vol.1, pp.492-499, 2010.
  4. Gyoung-Ho Lee and Kong Joo Lee, "Twitter Sentiment Analysis for the Recent Trend Extracted from the Newspaper Article," The KIPS Transactions Part B, Vol.2B No.10, pp.731-738, 2013
  5. Richard Socher, Alex Perelygin, Jean Y Wu, Jason Chuang, Christopher D Manning, Andrew Y Ng, and Christopher Potts Potts, "Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank," Proceedings of Conference on Empirical Methods on Natural Language Processing, 2013.
  6. R. Feldman, "Techniques and applications for sentiment analysis," Communications of the ACM, Vol.56, No.4, pp.82-89, 2013. https://doi.org/10.1145/2436256.2436274
  7. Kong Joo Lee, "Compositional rules of Korean auxiliary predicates for sentiment analysis," Journal of the Korean Society of Marine Engineering, Vol.37, No.3, pp.291-299, 2013. https://doi.org/10.5916/jkosme.2013.37.3.291
  8. Bing Liu, Minqing Hu, and Junsheng Cheng, "Opinion Observer: Analyzing and Comparing Opinions on the Web," Proceedings of the 14th International World Wide Web Conference, pp.342-451, 2005.
  9. Ana-Maria Popescu and Oren Etzioni, "Extracting Product Features and Opinions from Reviews," Proceedings of Conference on Empirical Methods on Natural Language Processing, pp.339-346, 2005.
  10. Brendan O'Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith, "From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series," Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media, pp.122-129, 2010.
  11. Andrea Esuli and Fabrizio Sebastiani, "SENTIWORDNET: A publicly available lexical resource for opinion mining," in Proceedings of the 5th Conference on Language Resources and Evaluation (LREC '06), pp.417-422, 2006.
  12. Woo Chul Lee, Hyun Ah Lee, and Kong Joo Lee, "Product Evaluation Summarization Through Linguistic Analysis of Product Reviews," The KIPS Transactions Part B, Vol.17B No.1, pp.93-98, 2010. https://doi.org/10.3745/KIPSTB.2010.17B.1.093
  13. P. D. Turney, "Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews," Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL '02), pp.417-424, 2002.
  14. Johan Bollen, Huina Mao, and Xiao-Jun Zeng, "Twitter mood predicts the stock market," Journal of Computational Science, Vol.2, No.1, pp.1-8, 2011. https://doi.org/10.1016/j.jocs.2010.12.007
  15. Alexander Pak and Patrick Paroubek, "Twitter as a Corpus for Sentiment Analysis and Opinion Mining," Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC '10), pp.1320-1326, 2010.
  16. Alec Go, Richa Bhayani, and Lei Huang, "Twitter Sentiment Classification using Distant Supervision," Technical Report CS224N, Stanford University, 2009.
  17. Songwook Lee, "Sentiment Analysis System Using Stanford Sentiment Treebank," Journal of the Korean Society of Marine Engineering, Vol.39, No.3, pp.274-279, 2015. https://doi.org/10.5916/jkosme.2015.39.3.274
  18. Kong Joo Lee and Songwook Lee, "Error-driven Noun-Connection Rule Extraction for Morphological Analysis," Journal of the Korean Society of Marine Engineering, Vol.36, No.8, pp.1123-1128, 2012. https://doi.org/10.5916/jkosme.2012.36.8.1123