DOI QR코드

DOI QR Code

코로나 확진자 수 예측을 위한 BI-LSTM과 GRU 알고리즘의 성능 비교 분석

Comparative analysis of performance of BI-LSTM and GRU algorithm for predicting the number of Covid-19 confirmed cases

  • Kim, Jae-Ho (Department of Computer Science, The University of Suwon) ;
  • Kim, Jang-Young (Department of Computer Science, The University of Suwon)
  • 투고 : 2021.11.04
  • 심사 : 2021.11.18
  • 발행 : 2022.02.28

초록

위드 코로나의 예정 발표일이 결정되었고, 위드 코로나에 가장 중요한 조건인 백신 접종을 아직 부작용 걱정 때문에 완료하지 않은 사람들이 있다. 또한 위드 코로나로 경제는 회복될 수 있지만 감염자 수는 급증할 수 있다. 본 논문은 위드 코로나에 앞서 코로나19에 대한 경각심을 깨우고자, 코로나19를 비선형 확률과정으로 예측한다. 여기서 딥러닝의 RNN중 양방향 LSTM인 BI-LSTM와 LSTM보다 gate수를 줄인 GRU를 사용하고 이것을 train set, test set, 손실함수, 잔차분석, 정규분포, 자기 상관을 통해서 비교 분석하여 어떠한 성능이 더 좋은지 비교하고 예측한다.

Even the announcing date for the staring date of "With Corona" has been decided, still many people have not completed vaccination, the most important condition for starting the With Corona, because of concerns for its side effects. In addition, although the economy may can be recovered by the With Corona, but the number of infected people may can be surged. In this paper, in order to awaken the people for the awareness of Corona 19 in advance of the With Corona, the Corona 19 is predicted through a non-linear probability process. Here, among the deep learning RNN, BI-LSTM, which is a bidirectional LSTM, and GRU, gates decreased than LSTM have been used. And this has been compared and analyzed through train set, test set, loss function, residual analysis, normal distribution, and autocorrelation, and compared and predicted for which has a better performance.

키워드

참고문헌

  1. Centers for Disease Control and Prevention. Ensuring COVID-19 Vaccines Work [Internet]. Available: https://korean.cdc.gov/coronavirus/2019-ncov/vaccines/effectiveness.html.
  2. Centers for Disease Control and Prevention. Effectiveness of Pfizer-BioNTech and Moderna Vaccines Against COVID-19 [Internet]. Available: https://www.cdc.gov/mmwr/volumes/70/wr/mm7018e1.htm?s_cid=mm7018e1_w.
  3. J. H. Kim and J. Y. Kim, "Covid19 trends predictions using time series data," Journal of the Korea Institute of Information and Communication Engineering, vol. 25, no. 7, pp.884-889, Dec. 2021. https://doi.org/10.6109/JKIICE.2021.25.7.884
  4. R. Jozefowicz, W. Zaremba, and I. Sutskever, "An Empirical Exploration of Recurrent Network Architectures," in Proceedings of the 32nd International Conference on International Conference on Machine Learning, vol. 37, pp. 2342-2350, Jul. 2015.
  5. F. A. Gers, J. Schmidhuber, and F. Cummins, "Learning to Forget: Continual Prediction with LSTM," Neural Computation, vol. 12, no. 10, pp. 2451-2471, Oct. 2000. https://doi.org/10.1162/089976600300015015
  6. K. Clark, M. T. Luong, C. D. Manning, and Q. V. Le "Semi-Supervised Sequence Modeling with Cross-View Training," in Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp. 1914-1925, Oct. 2018.
  7. S. J. Ko, H. Y. Yun, and D. M. Shin, "Electronic Demand Data Prediction using Bidirectional Long Short Term Memory Networks," Journal of Korea Software Appraisal Association, vol. 14, no. 1, pp. 33-40, 2018.
  8. J. Chung, C. Gulcehre, K. H. Cho, and Y. Bengio, "Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling," in NIPS 2014 Deep Learning and Representation Learning Workshop, Dec. 2014.
  9. D. Britz, A. Goldie, M. T. Luong, and Q. Le, "Massive Exploration of Neural Machine Translation Architectures," in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 1442-1451, Sep. 2017.
  10. Kaggle data set. covid19-data-from-john-hopkins-university [Internet]. Available: https://www.kaggle.com/antgoldbloom/covid19-data-from-john-hopkins-university.