Learning optimal policies in potential Mean Field Games: Smoothed Policy   Iteration algorithms

Qing Tang; Jiahao Song

arXiv:2212.04791·math.OC·April 18, 2023·SIAM J. Control. Optim.·1 cites

Learning optimal policies in potential Mean Field Games: Smoothed Policy Iteration algorithms

Qing Tang, Jiahao Song

PDF

Open Access

TL;DR

This paper introduces two Smoothed Policy Iteration algorithms for learning Nash equilibria in second order potential Mean Field Games, with proven convergence properties and numerical validation.

Contribution

The paper proposes novel Smoothed Policy Iteration algorithms for potential MFGs, establishing convergence under certain conditions and linking to Fictitious Play.

Findings

01

Global convergence under monotonicity condition

02

Local convergence for multiple solutions

03

Numerical simulations validate theoretical results

Abstract

We introduce two Smoothed Policy Iteration algorithms (\textbf{SPI}s) as rules for learning policies and methods for computing Nash equilibria in second order potential Mean Field Games (MFGs). Global convergence is proved if the coupling term in the MFG system satisfy the Lasry Lions monotonicity condition. Local convergence to a stable solution is proved for system which may have multiple solutions. The convergence analysis shows close connections between \textbf{SPI}s and the Fictitious Play algorithm, which has been widely studied in the MFG literature. Numerical simulation results based on finite difference schemes are presented to supplement the theoretical analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFrequency Control in Power Systems · Economic Policies and Impacts · Magnetic and transport properties of perovskites and related materials