Browse > Article

A Korean Document Sentiment Classification System based on Semantic Properties of Sentiment Words  

Hwang, Jae-Won (동아대학교 컴퓨터공학과)
Ko, Young-Joong (동아대학교 컴퓨터공학과)
Abstract
This paper proposes how to improve performance of the Korean document sentiment-classification system using semantic properties of the sentiment words. A sentiment word means a word with sentiment, and sentiment features are defined by a set of the sentiment words which are important lexical resource for the sentiment classification. Sentiment feature represents different sentiment intensity in general field and in specific domain. In general field, we can estimate the sentiment intensity using a snippet from a search engine, while in specific domain, training data can be used for this estimation. When the sentiment intensity of the sentiment features are estimated, it is called semantic orientation and is used to estimate the sentiment intensity of the sentences in the text documents. After estimating sentiment intensity of the sentences, we apply that to the weights of sentiment features. In this paper, we evaluate our system in three different cases such as general, domain-specific, and general/domain-specific semantic orientation using support vector machine. Our experimental results show the improved performance in all cases, and, especially in general/domain-specific semantic orientation, our proposed method performs 3.1% better than a baseline system indexed by only content words.
Keywords
Sentiment word; Sentiment Feature; Sentiment Classification; Semantic Orientation;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 X. Bai, R. Padman and E. Airoldi, "Sentiment extraction from unstructured text using tabu search-enhanced Markov blanket," In Proceedings of the International Workshop on Mining for and from the Semantic Web, pp.24-35, 2004.
2 T. Joachims, "Text Categorization with Support Vector Machines: Learning with Many relevant Features," In Proceedings of the ECML, pp.137-142, 1998.
3 P. Turney and M. Littman, "Unsupervised learning of semantic orientation from a hundred-billonword corpus," Technical Report ERB-1094, National Research Council, Institute for Information Technology, 2002.
4 S.M. Kim and E. Hovy, "Determining the Sentiment of Opinions," In Proceedings of the COLING conference, pp.1367-1373, 2004.
5 A. Esuli and F. Sebastiani, "PageRanking WordNet Synsets: An Application to Opinoin Mining," In Proceedings of the ACL, pp.424-431, 2007.
6 M. Thomas, B. Pang, and L. Lee, "Get out the vote: Determining support or opposition from congressional floor-debate transcripts," In Proceedings of the EMNLP, pp.327-335, 2006.
7 황재원, 고영중, "감정 분류를 위한 한국어 감정 자 질 추출 기법과 감정 자질의 유용성 평가", 한국정보과학회논문지, 컴퓨팅의 실제 및 레터, 제14권 제3호, pp.336-340, 2008.   과학기술학회마을
8 Y. Ko, J. Park, and J, Seo, "Automatic Text Categorization using the Importance of Sentences," In Proceedings of the 19, IInternational Conference on COLING, pp.474-480, 2002.
9 Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan, "Identifying sources of opinions with conditional random fields and extraction patterns," In Proceedings of the HLT/EMNLP, pp.355-362, 2005.
10 A. Esuli and F. Sebastiani, "Determining the Semantic Orientation of Terms through Gloss Classification," In Proceedings of the CIKM, pp.617-624, 2005.
11 M. Gamon, "Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis," In Proceedings the ACL, pp.841-847, 2004.
12 E. Riloff and J. Wiebe, "Learning extraction patterns for subjective expressions," In Proceedings of the EMNLP, pp.105-112, 2003.
13 B. Pang and L. Lee, "A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts," In Proceedings of the ACL, pp.271-278, 2004.
14 P. Beineke and T. Hastie and S. Vaithyanathan, "The Sentimental Factor: Improving Review Classification via Human-Provided Information," In Proceedings of the ACL, pp.263-270, 2004.
15 Y. Mao and G. Lebanon, "Isotonic Conditional Random Fields and Local Sentiment Flow," In Proceedings of the NIPS, 2007.
16 P.D. Turney and M.L. Littman, "Measuring Praise and Criticism: Inference of Semantic Orientation from Association," In Proceedings of the ACM Transactions on Information Systems, pp.315-346, 2003.
17 T. Mullen and N. Collier, "Sentiment Analysis using Support Vector Machines with Diverse Information Sources," In Proceedings of EMNLP, pp.412-418, 2004.
18 V. Hatzivassiloglou and K. McKeown, "Predicting the semantic orientation of adjectives," In Proceedings of the 35th ACL/8th EACL, pp.174-181, 1997.
19 P. Turney, "Thumbs up or thumbs down? Sentiment orientation applied to unsupervised classification of reviews," In Proceedings of the ACL, pp.417-424, 2002.
20 황재원, 고영중, "문장 감정 강도를 반영한 개선된 자질 가중치 기법 기반의 문서 감정 분류 시스템", 한국정보과학회논문지, 소프트웨어 및 응용, 제36권 제6호, pp.491-497, 2009.   과학기술학회마을
21 B. Pang, L. Lee and S. Vaithyanathan, "Thumbs up? Sentiment Classification Using Machine Learning Techniques," In Proceedings of the EMNLP, pp.79-86, 2002.