Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis
Sihan Zeng, Sujay Bhatt, Alec Koppel, Sumitra Ganesh

TL;DR
This paper introduces a novel single-loop algorithm, ASAC-MFG, for solving a broad class of mean field games with multiple equilibria, providing finite-time convergence guarantees under minimal assumptions.
Contribution
The work expands the class of solvable mean field games to include those with multiple equilibria and introduces a provably convergent single-loop policy optimization algorithm.
Findings
ASAC-MFG converges to a mean field equilibrium within finite time.
The algorithm is applicable to average-reward MDPs without contraction assumptions.
Theoretical convergence rates match state-of-the-art for MDPs.
Abstract
We consider discrete-time stationary mean field games (MFG) with unknown dynamics and design algorithms for finding the equilibrium with finite-time complexity guarantees. Prior solutions to the problem assume either the contraction of a mean field optimality-consistency operator or strict weak monotonicity, which may be overly restrictive. In this work, we introduce a new class of solvable MFGs, named the "fully herding class", which expands the known solvable class of MFGs and for the first time includes problems with multiple equilibria. We propose a direct policy optimization method, Accelerated Single-loop Actor Critic Algorithm for Mean Field Games (ASAC-MFG), that provably finds a global equilibrium for MFGs within this class, under suitable access to a single trajectory of Markovian samples. Different from the prior methods, ASAC-MFG is single-loop and single-sample-path. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Stochastic processes and financial applications · Risk and Portfolio Optimization
