Dynamic Noises of Multi-Agent Environments Can Improve Generalization:   Agent-based Models meets Reinforcement Learning

Mohamed Akrout; Amal Feriani; Bob McLeod

arXiv:2204.14076·cs.MA·May 2, 2022

Dynamic Noises of Multi-Agent Environments Can Improve Generalization: Agent-based Models meets Reinforcement Learning

Mohamed Akrout, Amal Feriani, Bob McLeod

PDF

Open Access

TL;DR

This paper demonstrates that the inherent stochasticity in agent-based models used as environments for reinforcement learning can enhance the generalization capabilities of RL agents across diverse scenarios.

Contribution

It introduces the idea that the non-deterministic dynamics of ABMs can be beneficial for RL generalization, supported by empirical evidence from epidemic control simulations.

Findings

01

ABM-based environments improve RL reward performance.

02

Non-deterministic dynamics enhance generalization across parameters.

03

ABMs offer microfoundational insights despite higher computational costs.

Abstract

We study the benefits of reinforcement learning (RL) environments based on agent-based models (ABM). While ABMs are known to offer microfoundational simulations at the cost of computational complexity, we empirically show in this work that their non-deterministic dynamics can improve the generalization of RL agents. To this end, we examine the control of an epidemic SIR environments based on either differential equations or ABMs. Numerical simulations demonstrate that the intrinsic noise in the ABM-based dynamics of the SIR model not only improve the average reward but also allow the RL agent to generalize on a wider ranges of epidemic parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 epidemiological studies · Mathematical and Theoretical Epidemiology and Ecology Models · Complex Systems and Time Series Analysis