Population-size-Aware Policy Optimization for Mean-Field Games
Pengdeng Li, Xinrun Wang, Shuxin Li, Hau Chan, Bo An

TL;DR
This paper introduces PAPO, a novel method that efficiently generates policies for mean-field games across different population sizes, bridging finite-agent and infinite-agent game theories.
Contribution
The paper proposes PAPO, a population-size-aware policy optimization method that unifies augmentation and hypernetworks, enabling efficient multi-population policy training in mean-field games.
Findings
PAPO outperforms baseline methods in various environments.
The method effectively captures policy evolution with changing population sizes.
Extensive experiments validate the superiority and robustness of PAPO.
Abstract
In this work, we attempt to bridge the two fields of finite-agent and infinite-agent games, by studying how the optimal policies of agents evolve with the number of agents (population size) in mean-field games, an agent-centric perspective in contrast to the existing works focusing typically on the convergence of the empirical distribution of the population. To this end, the premise is to obtain the optimal policies of a set of finite-agent games with different population sizes. However, either deriving the closed-form solution for each game is theoretically intractable, training a distinct policy for each game is computationally intensive, or directly applying the policy trained in a game to other games is sub-optimal. We address these challenges through the Population-size-Aware Policy Optimization (PAPO). Our contributions are three-fold. First, to efficiently generate efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsHyperNetwork
