Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework
Runyu Zhang, Gioele Zardini, Asuman Ozdaglar, Jeff Shamma, Na Li

TL;DR
This paper introduces a unified actor-critic framework for multi-agent reinforcement learning that guides the selection of equilibria with desirable properties like high social welfare, extending equilibrium selection principles from normal-form games to stochastic games.
Contribution
It proposes a novel actor-critic method that incorporates equilibrium selection rules, ensuring convergence to equilibria with optimal social welfare in stochastic games.
Findings
The framework achieves potential-maximizing policies in Markov potential games.
It finds Pareto-optimal equilibria in general-sum stochastic games.
Sample-based implementation demonstrates practical applicability.
Abstract
While multi-agent reinforcement learning (MARL) has produced numerous algorithms that converge to Nash or related equilibria, such equilibria are often non-unique and can exhibit widely varying efficiency. This raises a fundamental question: how can one design learning dynamics that not only converge to equilibrium but also select equilibria with desirable performance, such as high social welfare? In contrast to the MARL literature, equilibrium selection has been extensively studied in normal-form games, where decentralized dynamics are known to converge to potential-maximizing or Pareto-optimal Nash equilibria (NEs). Motivated by these results, we study equilibrium selection in finite-horizon stochastic games. We propose a unified actor-critic framework in which a critic learns state-action value functions, and an actor applies a classical equilibrium-selection rule state-wise,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Auction Theory and Applications · Economic theories and models
