Markovian And Non-Markovian Processes with Active Decision Making Strategies For Addressing The COVID-19 Pandemic
Hamid Eftekhari, Debarghya Mukherjee, Moulinath Banerjee, Ya'acov, Ritov

TL;DR
This paper models Covid-19 spread in six US states using Markov decision processes, predicting epidemic evolution and optimal intervention policies, and discusses non-Markovian extensions for future research.
Contribution
It introduces a compartment-based Markov decision process framework for Covid-19 policy optimization and explores non-Markovian modeling as a future direction.
Findings
No lockdowns needed in the test period under optimal policies
Active interventions like mask use and social distancing are crucial
Model predictions align with observed epidemic trends
Abstract
We study and predict the evolution of Covid-19 in six US states from the period May 1 through August 31 using a discrete compartment-based model and prescribe active intervention policies, like lockdowns, on the basis of minimizing a loss function, within the broad framework of partially observed Markov decision processes. For each state, Covid-19 data for 40 days (starting from May 1 for two northern states and June 1 for four southern states) are analyzed to estimate the transition probabilities between compartments and other parameters associated with the evolution of the epidemic. These quantities are then used to predict the course of the epidemic in the given state for the next 50 days (test period) under various policy allocations, leading to different values of the loss function over the training horizon. The optimal policy allocation is the one corresponding to the smallest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Decision Making
