DOI QR코드

DOI QR Code

An Accurate Stock Price Forecasting with Ensemble Learning Based on Sentiment of News

뉴스 감성 앙상블 학습을 통한 주가 예측기의 성능 향상

  • Received : 2021.10.11
  • Accepted : 2021.12.06
  • Published : 2022.02.28

Abstract

Various studies have been conducted from the past to the present because stock price forecasts provide stability in the national economy and huge profits to investors. Recently, there have been many studies that suggest stock price prediction models using various input data such as macroeconomic indicators and emotional analysis. However, since each study was conducted individually, it is difficult to objectively compare each method, and studies on their impact on stock price prediction are still insufficient. In this paper, the effect of input data currently mainly used on the stock price is evaluated through the predicted value of the deep learning model and the error rate of the actual stock price. In addition, unlike most papers in emotional analysis, emotional analysis using the news body was conducted, and a method of supplementing the results of each emotional analysis is proposed through three emotional analysis models. Through experiments predicting Microsoft's revised closing price, the results of emotional analysis were found to be the most important factor in stock price prediction. Especially, when all of input data is used, error rate of ensembled sentiment analysis model is reduced by 58% compared to the baseline.

Keywords

Acknowledgement

이 연구는 2021년도 정부 (과학기술정보통신부)의 재원으로 한국연구재단의 지원을 받아 수행된 연구임 (NRF-2020R1A2C1014768 중견연구자지원사업).

References

  1. E. F. FAMA, "Efficient Capital Markets a Review of Theory and Empirical Work," The Fama Portfolio, pp.76-121, 2021.
  2. M. D. Godfrey, C. W. J. Granger, O. Morgenstern, "The Random Walk Hypothesis of Stock Market Behavior," Kyklos, Vol. 17, No. 1, pp. 1-30, 1964. https://doi.org/10.1111/j.1467-6435.1964.tb02458.x
  3. T. Gao, Y. Chai, Y. Liu, "Applying Long Short Term Momory Neural Networks for Predicting Stock Closing Price," 2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS). pp. 575-578, 2017.
  4. J. G. De Gooijer, R. J. Hyndman, "25 years of IIF time Series Forecasting: a Selective Review," International Journal of Forecasting Vol. 22, No. 3, pp. 443-473, 2006. https://doi.org/10.1016/j.ijforecast.2006.01.001
  5. S. Hochreiter, J. Schmidhuber, "Long Short-Term Memory," in Neural Computation, Vol. 9, No. 8, pp. 1735-1780, 1997. https://doi.org/10.1162/neco.1997.9.8.1735
  6. K. Cho, B. Van Merrienboer, D. Bahdanau, "On the Properties of Neural Machine Translation: Encoder-decoder Approaches," arXiv preprint arXiv:1409.1259., 2014.
  7. Y. S. Kim, J. S. Cha, "Analysis of the Relationship Between Stock Prices and Macroeconomic Variables," Bank Of Korea, pp. 99-12 ,1999 (in Korean).
  8. P. S. Cho, "The Stock Forecasting Model using Bidirectional LSTM and Reinforcement Learning," PhD Thesis. Hanyang University. 2021 (in Korean).
  9. D. H. Shin, K. H. Choi, C. B. Kim, "Deep Learning Model for Prediction Rate Improvement of Stock Price Using RNN and LSTM," The Journal of Korean Institute of Information Technology, Vol. 15, No. 10, pp. 9-16, 2017 (in Korean).
  10. S. H. Hong, "A Research on Stock Price Prediction Based on Deep Learning and Economic Indicators," Journal of Digital Convergence Vol. 18. No. 11, pp. 267-272, 2020 (in Korean). https://doi.org/10.14400/JDC.2020.18.11.267
  11. W. Antweiler, M. Z. Frank, "Is all that talk just noise?," The Information Content of Internet Stock Message Board," J Finance Vol. 59, No. 3, pp. 1259- 1294, 2004. https://doi.org/10.1111/j.1540-6261.2004.00662.x
  12. M. Baker, J. Wurgler, "Investor Sentiment and the Cross Section of Stock Returns," The Journal of Finance, Vol. 61, No. 4, pp. 1645-1680, 2006. https://doi.org/10.1111/j.1540-6261.2006.00885.x
  13. M. J. Kim, J. H. Ryu, D. H. Cha, M. K. Sim, "Stock Price Prediction Using Sentiment Analysis: from "Stock Discussion Room" in Naver," The Jounal of Society for e-Business Studies ,Vol. 25, No. 4, pp. 61-75, 2020 (in Korean).
  14. J. G. Koh, G. Y. Lee, I. J. Son, Y. R. Gwon, "Stock Price Index Prediction Program Using Deep Learning Techniques," Proceedings of the Korean Society of Computer Information Conference, pp. 525-526, 2021 (in Korean).
  15. S. H. Hong, "A Study on Stock Price Prediction System Based on Text Mining Method Using LSTM and Stock Market news," Journal of Digital Convergence, Vol. 18, No. 7, pp. 223-228, 2020 (in Korean). https://doi.org/10.14400/JDC.2020.18.7.223
  16. E. Alickovic, A.. Subasi, "Normalized Neural Networks for Breast Cancer Classification," In: International Conference on Medical and Biological Engineering. Springer, Cham, pp. 519-524, 2019.
  17. C. Hutto, E. Gilbert, "Vader: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text," Proceedings of the International AAAI Conference on Web and Social Media. Vol. 8. No. 1, pp. 216-225, Jan. 2015.
  18. J. Devlin, M. W. Chang, K. Lee, K. Toutanova, "Bert: Pre-training of Deep Bidirectional Transformers for Language Understanding," arXiv preprint arXiv:1810.04805, 2018.
  19. D. Araci, "Finbert: Financial Sentiment Analysis with Pre-trained Language Models," arXiv preprint arXiv:1908.10063, 2019.
  20. T. Mikolov, K. Chen, G. Corrado, J. Dean, "Efficient Estimation of Word Representations in Vector Space," arXiv preprint arXiv:1301.3781., 2013.
  21. I. S. Kang, "A Comparative Study on Using SentiWordNet for English Twitter Sentiment Analysis," Journal of Korean Institute of Intelligent Systems, Vol. 23, No. 4, pp. 317-324, 2013 (in Korean). https://doi.org/10.5391/JKIIS.2013.23.4.317
  22. T. Mikolov, K. Chen, G. Corrado, J. Dean, "Efficient Estimation of Word Representations in Vector Space," arXiv preprint arXiv:1301.3781, 2013 (in Korean).
  23. T, Fischer, C. Krauss, "Deep Learning with Long Short-term Memory Networks for Financial Market Predictions," European Journal of Operational Research, Vol. 270, No. 2, pp. 654-669, 2018. https://doi.org/10.1016/j.ejor.2017.11.054
  24. X. Shi, Z. Chen, H. Wang, D. Y. Yeung, "Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting," In: Advances in Neural Information Processing Systems, pp. 802-810, 2015.