Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic   Mean-Field Games

Zuyue Fu; Zhuoran Yang; Yongxin Chen; Zhaoran Wang

arXiv:1910.07498·math.OC·October 17, 2019·19 cites

Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games

Zuyue Fu, Zhuoran Yang, Yongxin Chen, Zhaoran Wang

PDF

Open Access

TL;DR

This paper introduces a model-free reinforcement learning algorithm that provably finds Nash equilibria in linear-quadratic mean-field games with infinite agents, using an actor-critic approach with linear function approximation.

Contribution

It provides the first provably convergent, model-free RL method for discrete-time mean-field Markov games with linear-quadratic structure.

Findings

01

Algorithm converges to Nash equilibrium at a linear rate

02

First application of model-free RL with convergence guarantees in this setting

03

Proposes a practical actor-critic method without needing the model of dynamics

Abstract

We study discrete-time mean-field Markov games with infinite numbers of agents where each agent aims to minimize its ergodic cost. We consider the setting where the agents have identical linear state transitions and quadratic cost functions, while the aggregated effect of the agents is captured by the population mean of their states, namely, the mean-field state. For such a game, based on the Nash certainty equivalence principle, we provide sufficient conditions for the existence and uniqueness of its Nash equilibrium. Moreover, to find the Nash equilibrium, we propose a mean-field actor-critic algorithm with linear function approximation, which does not require knowing the model of dynamics. Specifically, at each iteration of our algorithm, we use the single-agent actor-critic algorithm to approximately obtain the optimal policy of the each agent given the current mean-field state, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Game Theory and Applications · Advanced Bandit Algorithms Research