Browse > Article
http://dx.doi.org/10.4218/etrij.13.0113.0093

Extracting Multiword Sentiment Expressions by Using a Domain-Specific Corpus and a Seed Lexicon  

Lee, Kong-Joo (Department of Information & Communication Engineering, Chungnam National University)
Kim, Jee-Eun (Department of English Linguistics, Hankuk University of Foreign Studies)
Yun, Bo-Hyun (Department of Computer Education, Mokwon University)
Publication Information
ETRI Journal / v.35, no.5, 2013 , pp. 838-848 More about this Journal
Abstract
This paper presents a novel approach to automatically generate Korean multiword sentiment expressions by using a seed sentiment lexicon and a large-scale domain-specific corpus. A multiword sentiment expression consists of a seed sentiment word and its contextual words occurring adjacent to the seed word. The multiword sentiment expressions that are the focus of our study have a different polarity from that of the seed sentiment word. The automatically extracted multiword sentiment expressions show that 1) the contextual words should be defined as a part of a multiword sentiment expression in addition to their corresponding seed sentiment word, 2) the identified multiword sentiment expressions contain various indicators for polarity shift that have rarely been recognized before, and 3) the newly recognized shifters contribute to assigning a more accurate polarity value. The empirical result shows that the proposed approach achieves improved performance of the sentiment analysis system that uses an automatically generated lexicon.
Keywords
Sentiment analysis; multiword sentiment expression; seed lexicon; domain-specific corpus; polarity shift; contextual words;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Choi and C. Cardie, "Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis," Proc. Conf. Empirical Methods Natural Language Process., 2008, pp. 793-801.
2 S.-J. Chang, Korean, Philadelphia, PA: John Benjamins Publishing Company, 1996.
3 Y. Cho and K. Lee, "Automatic Affect Recognition Using Natural Language Processing Techniques and Manually Built Affect Lexicon," IEICE Trans. Inf. Syst., vol. E89D, no. 12, 2006, pp. 2964-2971.
4 M. Dong and R. Kothari, "Classifiability Based Pruning of Decision Trees," Proc. Neural Netw., 2001, pp. 1739-1743.
5 H. Kanayama and T. Nasukawa, "Fully Automatic Lexicon Expansion for Domain-Oriented Sentiment Analysis," Proc. Conf. Empirical Methods Natural Language Process., 2006, pp. 355-363.
6 S. Lee and J. Seo, "Grammatical Relations Identification of Korean Parsed Texts Using Support Vector Machines," LNCS, vol. 3206, 2004, pp. 121-128.
7 Y. Lu et al., "Automatic Construction of a Context-Aware Sentiment Lexicon: An Optimization Approach," Proc. 20th Int. Conf. World Wide Web, 2011, pp. 347-356.
8 I. Milevskiy and J.-Y. Ha, "A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors," J. Comput. Sci. Eng., vol. 5, no. 3, Sept. 2011, pp. 161-166.   DOI   ScienceOn
9 R. Quirk et al., A Comprehensive Grammar of the English Language, New York: Longman, 1985.
10 M. Taboada et al., "Lexicon-Based Methods for Sentiment Analysis," Comput. Linguistics, June 2011, vol. 37, no. 2, pp. 267-307.   DOI