Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning

Xueqiao Peng; Andrew Perrault

arXiv:2603.19397·cs.LG·May 4, 2026

Optimizing Resource-Constrained Non-Pharmaceutical Interventions for Multi-Cluster Outbreak Control Using Hierarchical Reinforcement Learning

Xueqiao Peng, Andrew Perrault

PDF

TL;DR

This paper introduces a hierarchical reinforcement learning approach to optimize resource allocation for non-pharmaceutical interventions across multiple outbreak clusters, improving control effectiveness and scalability.

Contribution

It formulates the resource allocation problem as a constrained restless multi-armed bandit and develops a hierarchical RL framework that outperforms existing baselines in realistic simulations.

Findings

01

Outperforms RMAB-inspired and heuristic baselines by 20%-30% in outbreak control.

02

Scalable to up to 40 clusters with faster decision-making.

03

Effective across various system scales and testing budgets.

Abstract

Non-pharmaceutical interventions (NPIs), such as diagnostic testing and quarantine, are crucial for controlling infectious disease outbreaks but are often constrained by limited resources, particularly in early outbreak stages. In real-world public health settings, resources must be allocated across multiple outbreak clusters that emerge asynchronously, vary in size and risk, and compete for a shared resource budget. Here, a cluster corresponds to a group of close contacts generated by a single infected index case. Thus, decisions must be made under uncertainty and heterogeneous demands, while respecting operational constraints. We formulate this problem as a constrained restless multi-armed bandit and propose a hierarchical reinforcement learning framework. A global controller learns a continuous action cost multiplier that adjusts global resource demand, while a generalized local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.