Domain-driven Metrics for Reinforcement Learning: A Case Study on Epidemic Control using Agent-based Simulation

Rishabh Gaur; Gaurav Deshkar; Jayanta Kshirsagar; Harshal Hayatnagarkar; Janani Venugopalan

arXiv:2508.05154·cs.LG·August 11, 2025

Domain-driven Metrics for Reinforcement Learning: A Case Study on Epidemic Control using Agent-based Simulation

Rishabh Gaur, Gaurav Deshkar, Jayanta Kshirsagar, Harshal Hayatnagarkar, Janani Venugopalan

PDF

Open Access

TL;DR

This paper introduces domain-driven metrics for reinforcement learning applied to agent-based epidemic models, demonstrating their effectiveness in evaluating policies like masking, vaccination, and lockdown in pandemic simulations.

Contribution

It develops and applies domain-specific metrics for RL in complex, stochastic agent-based models, enhancing performance assessment in epidemic control scenarios.

Findings

01

Domain-driven rewards improve policy evaluation.

02

Metrics effectively differentiate simulation scenarios.

03

RL policies show varied effectiveness based on domain metrics.

Abstract

For the development and optimization of agent-based models (ABMs) and rational agent-based models (RABMs), optimization algorithms such as reinforcement learning are extensively used. However, assessing the performance of RL-based ABMs and RABMS models is challenging due to the complexity and stochasticity of the modeled systems, and the lack of well-standardized metrics for comparing RL algorithms. In this study, we are developing domain-driven metrics for RL, while building on state-of-the-art metrics. We demonstrate our ``Domain-driven-RL-metrics'' using policy optimization on a rational ABM disease modeling case study to model masking behavior, vaccination, and lockdown in a pandemic. Our results show the use of domain-driven rewards in conjunction with traditional and state-of-the-art metrics for a few different simulation scenarios such as the differential availability of masks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics · Complex Systems and Decision Making · Innovation Diffusion and Forecasting