TL;DR
This study evaluates various machine learning models, including ensemble methods, for short-term COVID-19 case forecasting in Brazil, finding SVR and stacking ensembles most accurate for 1-6 day predictions.
Contribution
It introduces a comprehensive comparison of multiple forecasting models, including a stacking ensemble, for COVID-19 case prediction in Brazilian states.
Findings
SVR and stacking ensemble outperform other models in accuracy.
Forecasting errors range from 0.87% to 6.90% depending on the horizon.
Models can effectively support public health decision-making.
Abstract
The new Coronavirus (COVID-19) is an emerging disease responsible for infecting millions of people since the first notification until nowadays. Developing efficient short-term forecasting models allow knowing the number of future cases. In this context, it is possible to develop strategic planning in the public health system to avoid deaths. In this paper, autoregressive integrated moving average (ARIMA), cubist (CUBIST), random forest (RF), ridge regression (RIDGE), support vector regression (SVR), and stacking-ensemble learning are evaluated in the task of time series forecasting with one, three, and six-days ahead the COVID-19 cumulative confirmed cases in ten Brazilian states with a high daily incidence. In the stacking learning approach, the cubist, RF, RIDGE, and SVR models are adopted as base-learners and Gaussian process (GP) as meta-learner. The models' effectiveness is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGaussian Process
