Browse > Article

A Document Sentiment Classification System Based on the Feature Weighting Method Improved by Measuring Sentence Sentiment Intensity  

Hwang, Jae-Won (동아대학교 컴퓨터공학과)
Ko, Young-Joong (동아대학교 컴퓨터공학과)
Abstract
This paper proposes a new feature weighting method for document sentiment classification. The proposed method considers the difference of sentiment intensities among sentences in a document. Sentiment features consist of sentiment vocabulary words and the sentiment intensity scores of them are estimated by the chi-square statistics. Sentiment intensity of each sentence can be measured by using the obtained chi-square statistics value of each sentiment feature. The calculated intensity values of each sentence are finally applied to the TF-IDF weighting method for whole features in the document. In this paper, we evaluate the proposed method using support vector machine. Our experimental results show that the proposed method performs about 2.0% better than the baseline which doesn't consider the sentiment intensity of a sentence.
Keywords
Sentiment Classification; Feature Weighting; Sentiment Feature; Sentiment Intensity;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 M. Hu and B. Liu, 'Mining and Summarizing Customer Reviews,' In Proceedings of the KDD, pp. 168-177, 2004
2 P.D. Turney and M.L. Littman, 'Measuring Praise and Criticism: Inference of Semantic Orientation from Association,' In Proceedings of the ACM Transactions on Information Systems, pp. 315-346, 2003   DOI   ScienceOn
3 P. Turney, 'Thumbs up or thumbs down? Sentiment orientation applied to unsupervised classification of reviews,' In Proceedings of the ACL, pp. 417-424, 2002   DOI
4 K. Dave, S. Lawrence, D.M. Pennock, 'Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews,' In Proceedings of the 12th WWW, pp. 519-528, 2003
5 A. Esuli and F. Sebastiani, 'PageRanking WordNet Synsets: An Application to Opinoin Mining,' In Proceedings of the ACL, pp. 424-431, 2007
6 B. Pang, L. Lee and S. Vaithyanathan, 'Thumbs up? Sentiment Classification Using Machine Learning Techniques,' In Proceedings of the EMNLP, pp. 79-86, 2002
7 J. Yi, T. Nasukawa, R. Bunescu, and W. Niblack, 'Sentimental Analyzer : Extracting Sentiments about a Given Topic using Natural Language Processing Techniques,' In Proceedings of International Conference on Data Mining, pp. 427-434, 2003
8 A. Esuli and F. Sebastiani, 'Determining the Semantic Orientation of Terms through Gloss Classification,' In Proceedings of the CIKM, pp. 617-624, 2005   DOI
9 M. Murata, Q. Ma, K. Uchimoto, H. Ozaku, H. Isahara, and M. Utiyama, 'Information Retrieval Using Location and Category Information,' Journal of the Association for Natural Language Processing, Vol.7, No.2, 2000
10 N. Hiroshima, S. Yamada, O. Furuse and R. Kataoka, 'Searching for Sentences Expressing Opinions by Using Declaratively Subjective Clues,' In Proceedings of the Workshop on Sentiment and Subjectivity in Text, pp. 39-46, 2006
11 Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan, 'Identifying sources of opinions with conditional random fields and extraction patterns,' In Proceedings of the HLT/EMNLP, pp. 355-362, 2005   DOI
12 Y. Mao and G. Lebanon, 'Isotonic Conditional Random Fields and Local Sentiment Flow,' In Proceedings of the NIPS, 2007
13 S.M. Kim and E. Hovy, 'Determining the Sentiment of Opinions,' In Proceedings of the COLING conference, pp. 1367-1373, 2004
14 M. Thomas, B. Pang, and L. Lee, 'Get out the vote: Determining support or opposition from congressional floor-debate transcripts,' In Proceedings of the EMNLP, pp. 327-335, 2006
15 E. Riloff and J. Wiebe, 'Learning extraction patterns for subjective expressions,' In Proceedings of the EMNLP, pp. 105-112, 2003   DOI
16 L.W. Ku, L.Y. Lee, T.H. Wu, and H.H. Chen, 'Major Topic Detection and Its Application to Opinion Summarization,' In Proceedings of the ACM SIGIR, pp. 627-628, 2005
17 B. Pang and L. Lee, 'A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts,' In Proceedings of the ACL, pp. 271-278, 2004   DOI
18 황재원, 고영중, '감정 분류를 위한 한국어 감정 자질 추출 기법과 감정 자질의 유용성 평가', 한국정보과학회논문지: 컴퓨팅의 실제 및 레터, 제14권 제3호, pp. 336-340, 2008   과학기술학회마을
19 Y. Ko, J. Park, and J, Seo, 'Automatic Text Categorization using the Importance of Sentences,' In Proceedings of the 19th International Conference on COLING, pp. 474-480, 2002   DOI
20 T. Joachims, 'Text Categorization with Support Vector Machines: Learning with Many relevant Features,' In Proceedings of the ECML, pp. 137- 142, 1998
21 M. Rimon, 'Sentiment Classification: Linguistic and Non-Linguistic Issues,' Hebrew University
22 C. Cortes and V. Vapnik 'Support-Vector Networks,' Machine Learning, Vol.20, pp. 273-297, 1995