Learning in Herding Mean Field Games: Single-Loop Algorithm with   Finite-Time Convergence Analysis

Sihan Zeng; Sujay Bhatt; Alec Koppel; Sumitra Ganesh

arXiv:2408.04780·math.OC·February 13, 2025·AISTATS

Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis

Sihan Zeng, Sujay Bhatt, Alec Koppel, Sumitra Ganesh

PDF

Open Access

TL;DR

This paper introduces a novel single-loop algorithm, ASAC-MFG, for solving a broad class of mean field games with multiple equilibria, providing finite-time convergence guarantees under minimal assumptions.

Contribution

The work expands the class of solvable mean field games to include those with multiple equilibria and introduces a provably convergent single-loop policy optimization algorithm.

Findings

01

ASAC-MFG converges to a mean field equilibrium within finite time.

02

The algorithm is applicable to average-reward MDPs without contraction assumptions.

03

Theoretical convergence rates match state-of-the-art for MDPs.

Abstract

We consider discrete-time stationary mean field games (MFG) with unknown dynamics and design algorithms for finding the equilibrium with finite-time complexity guarantees. Prior solutions to the problem assume either the contraction of a mean field optimality-consistency operator or strict weak monotonicity, which may be overly restrictive. In this work, we introduce a new class of solvable MFGs, named the "fully herding class", which expands the known solvable class of MFGs and for the first time includes problems with multiple equilibria. We propose a direct policy optimization method, Accelerated Single-loop Actor Critic Algorithm for Mean Field Games (ASAC-MFG), that provably finds a global equilibrium for MFGs within this class, under suitable access to a single trajectory of Markovian samples. Different from the prior methods, ASAC-MFG is single-loop and single-sample-path. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Stochastic processes and financial applications · Risk and Portfolio Optimization