Browse > Article
http://dx.doi.org/10.9716/KITS.2014.13.3.221

A Comparative Study between Stock Price Prediction Models Using Sentiment Analysis and Machine Learning Based on SNS and News Articles  

Kim, Dongyoung (숭실대학교 SW특성화대학원)
Park, Jeawon (숭실대학교 SW특성화대학원)
Choi, Jaehyun (숭실대학교 SW특성화대학원)
Publication Information
Journal of Information Technology Services / v.13, no.3, 2014 , pp. 221-233 More about this Journal
Abstract
Because people's interest of the stock market has been increased with the development of economy, a lot of studies have been going to predict fluctuation of stock prices. Latterly many studies have been made using scientific and technological method among the various forecasting method, and also data using for study are becoming diverse. So, in this paper we propose stock prices prediction models using sentiment analysis and machine learning based on news articles and SNS data to improve the accuracy of prediction of stock prices. Stock prices prediction models that we propose are generated through the four-step process that contain data collection, sentiment dictionary construction, sentiment analysis, and machine learning. The data have been collected to target newspapers related to economy in the case of news article and to target twitter in the case of SNS data. Sentiment dictionary was built using news articles among the collected data, and we utilize it to process sentiment analysis. In machine learning phase, we generate prediction models using various techniques of classification and the data that was made through sentiment analysis. After generating prediction models, we conducted 10-fold cross-validation to measure the performance of they. The experimental result showed that accuracy is over 80% in a number of ways and F1 score is closer to 0.8. The result can be seen as significantly enhanced result compared with conventional researches utilizing opinion mining or data mining techniques.
Keywords
Data Mining; Stock Price Prediction; Sentiment Analysis; SNS; Big Data; Machine Learning;
Citations & Related Records
Times Cited By KSCI : 7  (Citation Analysis)
연도 인용수 순위
1 Ahn, S., "Stock Prediction Using News Text Mining and Time Series Analysis", M.S. thesis, The Graduate School of Engineering, Yonsei Univ., Seoul, Korea, 2010.
2 Bollen, J., H. Mao, and X., Zeng, "Twitter mood predicts the stock market", Journal of Computational Science, Vol.2, No.1, 2011, 1-8.   DOI   ScienceOn
3 Ian, H., F. Eibe, and A. Mark, Data Mining, Morgan Kaufmann, Burlington, 2011.
4 Kim, K., G. Lee, and S., Lee, "A Comparative Analysis of Artificial Intelligence System and Ohlson model for IPO firm's Stock Price Evaluation", The Journal of Digital Policy and Management, Vol.11, No.5, 2013, 145-158.
5 Kim, S., D. Nam, H. Jo, and S. Kim, "A Study on the Relation of Web News and Stock Price", Korea Society of IT Services, Vol. 11, No.3, 2012, 191-203.   과학기술학회마을   DOI   ScienceOn
6 Kim, T., W. Jung, and S. Lee, "The Analysis on the Relationship between Firms' Exposures to SNS and Stock Prices in Korea", Asia Pacific Journal of Information Systems, Vol. 24, No.2, 2014, 233-253.   DOI   ScienceOn
7 Kim, Y., "News Big Data Opinion Mining Model for Predicting KOSPI Movement", Ph.D. thesis, Graduate School of Business IT, Kookmin Univ., Seoul, Korea, 2013.
8 Kim, Y., N. Kim, and S. Jung, "Stock-Index Invest Model Using News Big Data Opinion Mining", Journal of Intelligence and Information Systems, Vol.18, No.2, 2012, 143-156.   과학기술학회마을
9 Park, K. and H. Shin, "Stock Price Prediction Based on Time Series Network", Journal of the Korean Operations Research and Management Science Society, Vol.28, No.1, 2011, 53-60.   과학기술학회마을
10 Shim, K. and J. Yang, "High Speed Korean Morphological Analysis based on Adjacency Condition Check", Korean Institute of Information Scientists and Engineers, Vol.31, No.1, 2004, 89-99.   과학기술학회마을
11 Song, C., "News and Financial Prices", International Economic Journal, Vol.8, No.3, 2002, 1-34.
12 Song, J. and S. Lee, "Automatic Construction of Positive/Negative Feature-Predicate Dictionary for Polarity Classification of Product Reviews", Korean Institute of Information Scientists and Engineers, Vol.38, No.3, 2011, 157-168.   과학기술학회마을
13 Yu, E., Y. Kim, N. Kim, and S., Jeong, "Predicting the Direction of the Stock Index by Using a Domain-Specific Sentiment Dictionary", Journal of Intelligence and Information Systems, Vol.19, No.1, 2013, 95-110.   과학기술학회마을   DOI
14 Sagong, J., "A Study on Predicting Stock Price Based on Data Mining Techniques", M.S. thesis, Dept. Data Science, Inje Univ., Gimhae, Korea, 2012.
15 Chun, S., "뉴스 콘텐츠의 오피니언 마이닝을 통한 매체별 주가상승 예측정확도 비교 연구", M. S. thesis, Graduate School of Business IT, Kookmin Univ., Seoul, Korea, 2013.