Browse > Article
http://dx.doi.org/10.7236/JIWIT.2012.12.5.243

An Experimental Evaluation of Short Opinion Document Classification Using A Word Pattern Frequency  

Chang, Jae-Young (Dept. of Computer Engineering, Hansung University)
Kim, Ilmin (Dept. of Computer Engineering, Hansung University)
Publication Information
The Journal of the Institute of Internet, Broadcasting and Communication / v.12, no.5, 2012 , pp. 243-253 More about this Journal
Abstract
An opinion mining technique which was developed from document classification in area of data mining now becomes a common interest in domestic as well as international industries. The core of opinion mining is to decide precisely whether an opinion document is a positive or negative one. Although many related approaches have been previously proposed, a classification accuracy was not satisfiable enough to applying them in practical applications. A opinion documents written in Korean are not easy to determine a polarity automatically because they often include various and ungrammatical words in expressing subjective opinions. Proposed in this paper is a new approach of classification of opinion documents, which considers only a frequency of word patterns and excludes the grammatical factors as much as possible. In proposed method, we express a document into a bag of words and then apply a learning algorithm using a frequency of word patterns, and finally decide the polarity of the document using a score function. Additionally, we also present the experiment results for evaluating the accuracy of the proposed method.
Keywords
Data Mining; Opinion Mining; Classification; Sentiment Analysis;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 J. Y. Chang, "A Sentiment Analysis Algorithm for Automatic Product Reviews Classification in On-Line Shopping Mall", Journal of Korea Society for E-Business Studies, Vol. 14, No. 4, 2009.   과학기술학회마을
2 J. Y. Chang, J. M. Kim, S, Y, Lee, "Automatic Classification of Korean Movie Reviews Using a Word Pattern Frequency", Proc. of 2012 Korea Computer Congress, 2012.
3 S. S. Kang, Korean Morpheme Analysis and Information Retrieval, HongRung Publishing Company, 2003.
4 C. Park, D. Seong, K. Lee, "Automatic IPC Classification for Patent Documents using Machine Learning", Journal of Korean Institute of Information Technology, Vol. 10, No. 4, 2011.
5 J. Shim, H. C. Lee, "The Development of Automatic Ontology Generation System Using Extended Search Keywords" Journal of the Korea Academia-Industrial cooperation Society, Vol. 11, no. 6, 2009.
6 B. Liu , M. Hu , and J. Cheng, "Opinion observer: analyzing and comparing opinions on the Web", Proceedings of the 14th international conference on WWW, pp. 10-14, 2005.
7 C. Scaffidi, K. Bierhoff, E. Chang, M. Felker, H. Ng, and C. Jin, "Red Opal: Product-Feature Scoring from Reviews", Proceedings of the 8th ACM conference on Electronic commerce, pp. 11-15, 2007.
8 Xiaowen Ding, and Bing Lui, "The Utility of Linguistic Rules in Opinion Mining", SIGIR 2007, pp. 811-812, 2007.
9 E. Courses, and T. Surveys, "Using SentiWordNet for multilingual sentiment analysis", Data Engineering Workshop ICDEW 2008, 2008.
10 Q. Miao, Q. Li, and R. Dai, "A sentiment mining and retrieval system", Expert Systems with Applications, Vol.36, pp. 7192-7198, 2009.   DOI
11 J. O. Kim, S. S. Lee, W, S, Yong, "Automatic Opinion Classification Of Korean Text", Journal of KIISE: Database, Vol. 38, No. 6, Dec., 2011.
12 J. S. Myoung, D. J. Lee, S. G. Lee, "A Korean Product Review Analysis System Using a Semi-Automatically Constructed Semantic Dictionary", Journal of KIISE, Vol. 35, No. 6, 2008.   과학기술학회마을
13 H. H. Kang, S. J. Yoo, S. I, Han, "Automatic Extraction of Opinion Words from Korean Product Reviews Using the k-Structure", Journal of KIISE, Vol. 37, No. 6, 2010.   과학기술학회마을