Browse > Article
http://dx.doi.org/10.5392/JKCA.2012.12.11.030

Constructing an Evaluation Set for Korean Sentiment Analysis Systems Incorporating the Category and the Strength of Sentiment  

Kim, Do-Yeon (전남대학교 전자컴퓨터공학과)
Wu, Yong (전남대학교 전자컴퓨터공학과)
Park, Hyuk-Ro (전남대학교 전자컴퓨터공학과)
Publication Information
Abstract
Sentiment analysis is concerned with extracting and analyzing different kinds of user sentiment expressed in a variety of social media such as blog and twitter. Although sentiment analysis techniques are actively studied for these days, evaluation sets are not developed yet for Korean sentiment analysis. In this paper, we constructed an evaluation set for Korean sentiment analysis. To evaluate sentiment analysis systems more throughly, each sentence in our evaluation set is tagged with the polarity of the sentiment as well as the category and the strength of the sentiment. We divide kinds of sentiment into 7 positive categories and 15 negative categories. Each category is given the strength of the sentiment from 1 to 3. Our evaluation set consists of 3,270 sentences extracted from various social media. For each sentence, 5 human taggers assigned the category and the strength of the sentiment expressed in the sentence. The ratio of inter-taggers agreement was 93% in the polarity, 70% in the category, 58% in the strength of sentiment. The ratio of inter-taggers agreement our evaluation set is a bit higher than other evaluation sets developed for German and Spanish. This result shows our evaluation set can be used as a reliable resource for the evaluation of sentiment analysis systems.
Keywords
Sentiment Analysis; Sentiment Strength; Evaluation Set;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 B. Pang, and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval: Vol.2, No.1-2, pp.1-135, 2008.   DOI
2 김정호, 김명규, 차명훈, 인주호, 채수환, "한국어 특성을 고려한 감성 분류", 한국감성과학회지 제13권, 제3호, pp.449-458, 2010.   과학기술학회마을
3 H. Tang, S. Tan, and X. Cheng, " A survey on sentiment detection of reviews," Expert Systems with Applications, Vol.36, pp.10760-10773, 2009.   DOI   ScienceOn
4 김은영, 국어 감정 동사 연구, 전남대학교 대학원, 박사학위논문, 2004.
5 P. Harland, "HOW THE BRAIN FEELS," Emotion and Cognition in Neuro-Linguistic Psychotherapy, Rapport, Journal of the Association for NLP (UK), Issue 57, 2002.
6 R. Plutchik and H. Kellerman, Emotion: Theory, research, and experience: Vol.1, Theories of emotion.1, New York: Academic, 1980.
7 http://www.cs.cornell.edu/people/pabo/movie-review-data
8 http://condensr.com
9 http://www.wjh.harvard.edu/-inquirer/homecat.htm
10 김기홍, "감정언어와 그의 문법성 고찰", 동서문화 11, pp.161-181, 1979.
11 C. E. Osgood,, "Cross-Cultural comparability in Attitude Measurement via Muttilingual Semantic Differentials," in Social Psychology, pp.95-106, 1965.
12 손춘섭, "정도부사의 의미와 기능에 대한 고찰," 한국어의미학회, 한국어의미학, 제9권, pp.97-130, 2001.
13 J. M. Schulz, C. Womser-Hacker, and T. Mandl, "Multilingual corpus development for opinion mining," In Proc. of LREC'10, pp.3409-3412, 2010.
14 김재원, 곽훈성, 장재우, "감성어의 비중처리와 퍼지추론에 의한 평가 방법," 한국콘텐츠학회논문지, 제9권, 제1호, pp.30-35, 2011.
15 http://www.wordnet.co.kr/