Deep Policy Iteration for High-Dimensional Mean-Field Games with Regenerative Reformulation

Shuixin Fang; Shupeng Wang; Zhen Wu; Hui Zhang; Tao Zhou

arXiv:2604.26782·math.NA·May 18, 2026

Deep Policy Iteration for High-Dimensional Mean-Field Games with Regenerative Reformulation

Shuixin Fang, Shupeng Wang, Zhen Wu, Hui Zhang, Tao Zhou

PDF

TL;DR

This paper introduces a scalable deep policy iteration approach for high-dimensional mean-field games, reformulating the problem as a regenerative process to improve efficiency and handle dimensions up to 10,000.

Contribution

The paper proposes a novel regenerative reformulation and deep policy iteration method that significantly enhances scalability for high-dimensional mean-field games.

Findings

01

Effective handling of dimensions up to 10,000

02

Avoids solving coupled HJB and Fokker-Planck equations

03

Uses particle system and adversarial training for efficiency

Abstract

This paper develops a deep policy iteration method for high-dimensional finite-horizon mean-field games (MFG). We reformulate the game as a regenerative problem with deterministic cycles, which allows policy evaluation (PE), policy improvement (PI), and population measure estimation to be carried out cycle by cycle. Within this formulation, we approximate the population measure by a particle system and update it using a one-step random mapping induced by the Euler-Maruyama discretization of the state dynamics. This update transports a mini-batch of particles from one cycle to the next, avoiding sequential trajectory simulation over the entire time horizon at each iteration. The PE and PI subproblems are formulated through the relation between consecutive cycles, with adversarial training used for evaluation and averaged optimization used for improvement. The resulting method is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.