Population-size-Aware Policy Optimization for Mean-Field Games

Pengdeng Li; Xinrun Wang; Shuxin Li; Hau Chan; Bo An

arXiv:2302.03364·cs.LG·February 8, 2023

Population-size-Aware Policy Optimization for Mean-Field Games

Pengdeng Li, Xinrun Wang, Shuxin Li, Hau Chan, Bo An

PDF

Open Access 1 Video

TL;DR

This paper introduces PAPO, a novel method that efficiently generates policies for mean-field games across different population sizes, bridging finite-agent and infinite-agent game theories.

Contribution

The paper proposes PAPO, a population-size-aware policy optimization method that unifies augmentation and hypernetworks, enabling efficient multi-population policy training in mean-field games.

Findings

01

PAPO outperforms baseline methods in various environments.

02

The method effectively captures policy evolution with changing population sizes.

03

Extensive experiments validate the superiority and robustness of PAPO.

Abstract

In this work, we attempt to bridge the two fields of finite-agent and infinite-agent games, by studying how the optimal policies of agents evolve with the number of agents (population size) in mean-field games, an agent-centric perspective in contrast to the existing works focusing typically on the convergence of the empirical distribution of the population. To this end, the premise is to obtain the optimal policies of a set of finite-agent games with different population sizes. However, either deriving the closed-form solution for each game is theoretically intractable, training a distinct policy for each game is computationally intensive, or directly applying the policy trained in a game to other games is sub-optimal. We address these challenges through the Population-size-Aware Policy Optimization (PAPO). Our contributions are three-fold. First, to efficiently generate efficient…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Population-size-Aware Policy Optimization for Mean-Field Games· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsHyperNetwork