DOI QR코드

DOI QR Code

A Machine Learning Univariate Time series Model for Forecasting COVID-19 Confirmed Cases: A Pilot Study in Botswana

  • Received : 2021.12.05
  • Published : 2022.01.30

Abstract

The recent outbreak of corona virus (COVID-19) infectious disease had made its forecasting critical cornerstones in most scientific studies. This study adopts a machine learning based time series model - Auto Regressive Integrated Moving Average (ARIMA) model to forecast COVID-19 confirmed cases in Botswana over 60 days period. Findings of the study show that COVID-19 confirmed cases in Botswana are steadily rising in a steep upward trend with random fluctuations. This trend can also be described effectively using an additive model when scrutinized in Seasonal Trend Decomposition method by Loess. In selecting the best fit ARIMA model, a Grid Search Algorithm was developed with python language and was used to optimize an Akaike Information Criterion (AIC) metric. The best fit ARIMA model was determined at ARIMA (5, 1, 1), which depicted the least AIC score of 3885.091. Results of the study proved that ARIMA model can be useful in generating reliable and volatile forecasts that can used to guide on understanding of the future spread of infectious diseases or pandemics. Most significantly, findings of the study are expected to raise social awareness to disease monitoring institutions and government regulatory bodies where it can be used to support strategic health decisions and initiate policy improvement for better management of the COVID-19 pandemic.

Keywords

References

  1. R. Singla, A. Mishra, and R. et al. Joshi, "Human animal interface of SARS-CoV-2 (COVID-19) transmission: a critical appraisal of scientific evidence," Springer Nature Switzerland, pp. 119-130 (2020).DOI:10.1007/s11259-020-09781-0, 2020.
  2. BBC News. (2020, December) Covid-19 pandemic: Tracking the global coronavirus outbreak. [Online]. https://www.bbc.com/news/world-51235105
  3. Our World in Data. (2020, January) Coronavirus Pandemic (COVID-19).[Online]. https://ourworldindata.org/coronavirus#citation
  4. V. Vara. (2020, April) Latest Analysis. [Online]. https://www.pharmaceuticaltechnology.com/features/coronavirus-outbreak-thecountries-affected/
  5. Worldometer. (2021, Aug.) Worldometer. [Online]. https://www.worldometers.info/coronavirus/
  6. WHO. (2021, January) WHO Coronavirus Disease (COVID-19) Dashboard. [Online]. https://covid19.who.int/
  7. IWK Health Center. (2021, July) Library Services. [Online]. https://library.nshealth.ca/COVID19Research/Publications
  8. S. H. Hodgson, K. Mansatta, and Mallet, G. et al, "What defines an efficacious COVID-19 vaccine? A review of the challenges assessing the clinical efficacy of vaccines against SARS-CoV-2," The Lancert,. DOI:10.1016/S1473-3099(20)30773-8, 2020.
  9. Z. Ceylan, "Estimation of COVID-19 prevalence in Italy, Spain, and France," PMC US National Library of Medicine National Institute of Health, DOI: 10.1016/j.scitotenv.2020.138817, 2020.
  10. L. Jia, K. Li, Y. Jiang, X. Guo, and T. zhao, "Prediction and analysis of Coronavirus Disease 2019," NASA Astrophysics Data System, [Online].https://arxiv.org/ftp/arxiv/papers/2003/2003.05447.pdf, 2020.
  11. A. Steptoe and L. Poole, Infectious Diseases: Psychosocial Aspects.: International Encyclopedia of the Social & Behavioral Sciences (Second Edition), 2015.
  12. World Health Organization. (2021, January) WHO Health Topic Page: Zoonoses. [Online]. https://www.who.int/topics/zoonoses/en/
  13. V. Chaurasia and S. Pal, "Application of machine learning time series analysis for prediction COVID-19 pandemic," Springer. DOI:10.1007/s42600-020-00105-4, 2020.
  14. K. A. Abuhasel, M. Khadr, and M. M. Alquraish, "Analyzing and forecasting COVID-19 pandemic in the Kingdom of Saudi Arabia using ARIMA SIR models," Wiley - Computational Intelligence, DOI: 10.1111/coin.12407, 2020.
  15. S. Chae, S. Kwon, and D. Lee, "Predicting Infectious Disease Using Deep Learning and Big Data," International Journal of Environmental Research and Public Health. DOI:10.3390/ijerph15081596, 2018.
  16. M Wang, H. Wang, J. Wang, and Lui et. al, "A novel model for malaria prediction based on ensemble algorithms," PLoS ONE, 14(12). DOI:10.1371/journal.pone.0226910, 2019.
  17. Y. Wang, C. Xu, S. Zhang, and Yang et. al, "Development and evaluation of a deep learning approach formodeling seasonality and trends in hand-foot-mouth disease incidence in mainland China," Springer Nature, 2019.
  18. Johns Hopkins University of Medicine. (2021, January) COVID-19 Dashboards by the Centerfor System Science and Engineering (CSSE) at Johns Hopkisn University. [Online]. https://coronavirus.jhu.edu/map.html
  19. Github. (2021, January) Github. [Online]. https://github.com/CSSEGISandData/COVID-19
  20. AccuWeather. (2021, January) Botswana Weather. [Online]. https://www.accuweather.com/en/bw/national/covid-19
  21. B. Fanoodi, B. Malmir, and F. F. Jahantigh, "Reducing demand uncertainty in platelet supply chain through artificial neural networks and ARIMA models," Elsevier Computers in Biology and Medicine.DOI: 10.1016/j.compbiomed.2019.103415, 2019.
  22. G.E.P. Box and G.M. Jenkins, Time Series Analysis: Forecasting and Control, Revised Edition. San Francisco: Holden Day, 1976.
  23. J. Fattah, L. Ezzine, and Z. Aman, "Forecasting of demand using ARIMA model ," Internationa Journal of Business Management, DOI:10.1177/1847979018808673, 2018.
  24. Z. Malkia, E Atlamb, E. Ewisc, and G etal Dagnewe, "ARIMA Models for Predicting the End of COVID-19 Pandemic and the Risk of a Second Rebound," Research square, DOI:10.21203/rs.3.rs-34702/v1, 2020.
  25. L. H. Koopmans, Multivariate Spectral Models and Their Applications. Science Direct - Elsevier, 1995.