Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning

Radman Rakhshandehroo; Daniel Coombs

arXiv:2511.18000·cs.LG·March 26, 2026

Reward Engineering for Spatial Epidemic Simulations: A Reinforcement Learning Platform for Individual Behavioral Learning

Radman Rakhshandehroo, Daniel Coombs

PDF

Open Access

TL;DR

ContagionRL is a reinforcement learning platform for systematic reward engineering in spatial epidemic simulations, enabling analysis of how reward design influences agent behavior and survival strategies under diverse conditions.

Contribution

We introduce ContagionRL, a modular platform that allows rigorous evaluation of reward functions in spatial epidemic models, highlighting the impact of reward design on learned behaviors.

Findings

01

Potential field rewards outperform other designs in agent survival.

02

Directional guidance and adherence incentives are key for robust policy learning.

03

Reward choice significantly affects agent behavior and epidemic outcomes.

Abstract

We present ContagionRL, a Gymnasium-compatible reinforcement learning platform specifically designed for systematic reward engineering in spatial epidemic simulations. Unlike traditional agent-based models that rely on fixed behavioral rules, our platform enables rigorous evaluation of how reward function design affects learned survival strategies across diverse epidemic scenarios. ContagionRL integrates a spatial SIRS+D epidemiological model with configurable environmental parameters, allowing researchers to stress-test reward functions under varying conditions including limited observability, different movement patterns, and heterogeneous population dynamics. We evaluate five distinct reward designs, ranging from sparse survival bonuses to a novel potential field approach, across multiple RL algorithms (PPO, SAC, A2C). Through systematic ablation studies, we identify that directional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 epidemiological studies · Reinforcement Learning in Robotics · Digital Mental Health Interventions