Browse > Article
http://dx.doi.org/10.9717/kmms.2014.17.6.671

An Extraction Method of Sentiment Infromation from Unstructed Big Data on SNS  

Back, Bong-Hyun (Dept. of Computer Engineering, Yeungnam University)
Ha, Ilkyu (Dept. of Computer Engineering, Yeungnam University)
Ahn, ByoungChul (Dept. of Computer Engineering, Yeungnam University)
Publication Information
Abstract
Recently, with the remarkable increase of social network services, it is necessary to extract interesting information from lots of data about various individual opinions and preferences on SNS(Social Network Service). The sentiment information can be applied to various fields of society such as politics, public opinions, economics, personal services and entertainments. To extract sentiment information, it is necessary to use processing techniques that store a large amount of SNS data, extract meaningful data from them, and search the sentiment information. This paper proposes an efficient method to extract sentiment information from various unstructured big data on social networks using HDFS(Hadoop Distributed File System) platform and MapReduce functions. In experiments, the proposed method collects and stacks data steadily as the number of data is increased. When the proposed functions are applied to sentiment analysis, the system keeps load balancing and the analysis results are very close to the results of manual work.
Keywords
Big Data; Sentiment Analysis; SNS; Unstructed data analysis;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 S. Gilbert and N. Lynch, "Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-tolerant Web Services," ACM SIGACT, Vol. 33, No. 2, pp. 51-59, 2002.   DOI
2 J. Dean and S. Ghemawat, "MapReduce; Simplified Data Processing on Large Clusters," Communications of the ACM, Vol. 51, No. 1, pp. 107-113, 2008.
3 N. Godbole and S. Skiena, "Large-Scale Sentiment Analysis for News and Blogs," Proceeding of the ICWSM'2007, pp.1-4, 2007.
4 A. Pak and P. Paroubek, "Twitter as a Corpus for Sentiment Analysis and Opinion Mining," Proceeding of the LREC'2010, pp.1320-1326, 2010.
5 H. Tang, S. Tan, and X. Cheng, "A Survey on Sentiment Detection of Reviews," Expert Systems with Applications, Vol. 36, pp. 10760-10773, 2009.   DOI   ScienceOn
6 S. Mukherjee and P. Bhattacharyya, "Sentiment Analysis in Twitter with Lightweight Discourse Analysis," Proceeding of COLING 2012, pp. 1847-1864, 2012.
7 L. George, HBase: The Definitive Guide, O'REILLY, Sebastopol, Calif., 2011.
8 K. Chodorow, MongoDB: The Definitive Guide 2nd Edition, O'REILLY, 2013.
9 B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, Vol. 2, No. 1-2, pp. 1-135, 2008.   DOI
10 Hadoop ECO system, http://www.revelytix.com/?q=content/hadoop-ecosystem (accessed Jan., 27, 2014)
11 J. Han and K. Du, "Survey on NoSQL Database," Proceeding of 6th International Conference on Pervasive Computing and Applications, pp. 363-366, 2011.
12 M. Song and S. Kim, "A Study of Improving on Prediction Model by Analyzing Method Big data," The Journal of Digital Policy & Management, Vol. 11, No. 6, pp. 103-112, 2013.   과학기술학회마을
13 B. Lee, J. Lim, and J. Yoo, "Utilization of Social Media Analysis using Big Data," Journal of the Korea Contents Association, Vol. 13, No. 2, pp. 211-219, 2013.   과학기술학회마을   DOI   ScienceOn
14 C.S. Lee and M.H. Wang, "Automated Ontology Construction for Unstructured Text Documents," Data & Knowledge Engineering, Vol. 60, Issue 3, pp. 547-566, 2007.   DOI   ScienceOn
15 F. Chang and R.E. Gruber, "Bigtable: A Distributed Storage System for Structured Data," ACM Transactions on Computer System, Vol. 26, Issue 2, pp.1-26, 2008.
16 S. Sivasubramanian, "Amazon dynamoDB: A Seamlessly Scalable Non-relational Database Service," Proceeding of the 2012 ACM SIGMOD'12, pp. 729-730, 2012.
17 Big Data: The Next Frontier for Innovation, Competition, and Productivity(2011), http://www. mckinsey.com/ (accessed Jan., 27, 2014)
18 S. Kim and B. Hwang, "Propensity Analysis of Political Attitude of Twitter Users by Extracting Sentiment from Timeline," Journal of Korea Multimedia Society, Vol. 17, No. 1, pp. 43-51, 2014.   과학기술학회마을   DOI   ScienceOn