Generalization in Mean Field Games by Learning Master Policies
Sarah Perrin, Mathieu Lauri\`ere, Julien P\'erolat, Romuald, \'Elie, Matthieu Geist, Olivier Pietquin

TL;DR
This paper introduces the concept of Master policies in Mean Field Games, which can generalize across different initial distributions, enabling scalable multi-agent systems with a single policy that achieves Nash equilibrium.
Contribution
It proposes a novel method to learn Master policies using neural networks and reinforcement learning, demonstrating their effectiveness and generalization in large population MFGs.
Findings
Master policies provide Nash equilibrium for any initial distribution.
The proposed learning method effectively captures and generalizes policies.
Numerical examples show strong performance and generalization capabilities.
Abstract
Mean Field Games (MFGs) can potentially scale multi-agent systems to extremely large populations of agents. Yet, most of the literature assumes a single initial distribution for the agents, which limits the practical applications of MFGs. Machine Learning has the potential to solve a wider diversity of MFG problems thanks to generalizations capacities. We study how to leverage these generalization properties to learn policies enabling a typical agent to behave optimally against any population distribution. In reference to the Master equation in MFGs, we coin the term ``Master policies'' to describe them and we prove that a single Master policy provides a Nash equilibrium, whatever the initial distribution. We propose a method to learn such Master policies. Our approach relies on three ingredients: adding the current population distribution as part of the observation, approximating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Game Theory and Applications · Advanced Bandit Algorithms Research
