Convergence of Actor-Critic Learning for Mean Field Games and Mean Field Control in Continuous Spaces

Jean-Pierre Fouque; Mathieu Lauri\`ere; Mengrui Zhang

arXiv:2511.06812·math.OC·November 11, 2025

Convergence of Actor-Critic Learning for Mean Field Games and Mean Field Control in Continuous Spaces

Jean-Pierre Fouque, Mathieu Lauri\`ere, Mengrui Zhang

PDF

Open Access

TL;DR

This paper proves the convergence of a deep actor-critic reinforcement learning algorithm for continuous-space mean field games and control problems, extending theoretical guarantees and demonstrating results on linear-quadratic examples.

Contribution

It establishes convergence of the actor-critic algorithm in continuous spaces for MFG and MFC, including discretization techniques and extensions to control games.

Findings

01

Convergence proven for continuous state and action spaces.

02

Numerical experiments on linear-quadratic problems confirm theoretical results.

03

Extension of convergence analysis to mean field control games.

Abstract

We establish the convergence of the deep actor-critic reinforcement learning algorithm presented in [Angiuli et al., 2023a] in the setting of continuous state and action spaces with an infinite discrete-time horizon. This algorithm provides solutions to Mean Field Game (MFG) or Mean Field Control (MFC) problems depending on the ratio between two learning rates: one for the value function and the other for the mean field term. In the MFC case, to rigorously identify the limit, we introduce a discretization of the state and action spaces, following the approach used in the finite-space case in [Angiuli et al., 2023b]. The convergence proofs rely on a generalization of the two-timescale framework introduced in [Borkar, 1997]. We further extend our convergence results to Mean Field Control Games, which involve locally cooperative and globally competitive populations. Finally, we present…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Reinforcement Learning in Robotics · Extremum Seeking Control Systems