TL;DR
This paper presents a novel method to approximate multi-agent reinforcement learning dynamics in finite-state Markov games by rescaling learning rates and update frequencies, leading to a deterministic ODE approximation.
Contribution
It introduces a new rescaling approach for analyzing multi-agent learning in Markov games, proving convergence to an ODE under mild assumptions.
Findings
Rescaled learning process converges to an ODE.
Provides a deterministic approximation of complex learning dynamics.
Framework implementation available online.
Abstract
This paper introduces a new approach for approximating the learning dynamics of multiple reinforcement learning (RL) agents interacting in a finite-state Markov game. The idea is to rescale the learning process by simultaneously reducing the learning rate and increasing the update frequency, effectively treating the agent's parameters as a slow-evolving variable influenced by the fast-mixing game state. Under mild assumptions-ergodicity of the state process and continuity of the updates-we prove the convergence of this rescaled process to an ordinary differential equation (ODE). This ODE provides a tractable, deterministic approximation of the agent's learning dynamics. An implementation of the framework is available at\,: https://github.com/yannKerzreho/MarkovGameApproximation
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
