Recurrent Structural Policy Gradient for Partially Observable Mean Field Games

Clarisse Wibault; Johannes Forkel; Sebastian Towers; Tiphaine Wibault; Juan Duque; George Whittle; Andreas Schaab; Yucheng Yang; Chiyuan Wang; Michael Osborne; Benjamin Moll; Jakob Foerster

arXiv:2602.20141·cs.AI·February 24, 2026

Recurrent Structural Policy Gradient for Partially Observable Mean Field Games

Clarisse Wibault, Johannes Forkel, Sebastian Towers, Tiphaine Wibault, Juan Duque, George Whittle, Andreas Schaab, Yucheng Yang, Chiyuan Wang, Michael Osborne, Benjamin Moll, Jakob Foerster

PDF

Open Access

TL;DR

This paper introduces RSPG, a novel history-aware hybrid structural method for partially observable mean field games, achieving faster convergence and enabling complex macroeconomic modeling with heterogeneous agents and common noise.

Contribution

The paper presents RSPG, the first history-aware hybrid structural method for partially observable MFGs, and MFAX, a framework that leverages known dynamics for improved performance.

Findings

01

RSPG achieves state-of-the-art performance.

02

RSPG converges an order of magnitude faster.

03

Successfully models macroeconomics MFG with heterogeneity and common noise.

Abstract

Mean Field Games (MFGs) provide a principled framework for modeling interactions in large population models: at scale, population dynamics become deterministic, with uncertainty entering only through aggregate shocks, or common noise. However, algorithmic progress has been limited since model-free methods are too high variance and exact methods scale poorly. Recent Hybrid Structural Methods (HSMs) use Monte Carlo rollouts for the common noise in combination with exact estimation of the expected return, conditioned on those samples. However, HSMs have not been scaled to Partially Observable settings. We propose Recurrent Structural Policy Gradient (RSPG), the first history-aware HSM for settings involving public information. We also introduce MFAX, our JAX-based framework for MFGs. By leveraging known transition dynamics, RSPG achieves state-of-the-art performance as well as an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Time Series Analysis · Opinion Dynamics and Social Influence · Advanced Causal Inference Techniques