Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework

Runyu Zhang; Gioele Zardini; Asuman Ozdaglar; Jeff Shamma; Na Li

arXiv:2406.08844·cs.GT·January 29, 2026

Equilibrium Selection for Multi-agent Reinforcement Learning: A Unified Framework

Runyu Zhang, Gioele Zardini, Asuman Ozdaglar, Jeff Shamma, Na Li

PDF

Open Access

TL;DR

This paper introduces a unified actor-critic framework for multi-agent reinforcement learning that guides the selection of equilibria with desirable properties like high social welfare, extending equilibrium selection principles from normal-form games to stochastic games.

Contribution

It proposes a novel actor-critic method that incorporates equilibrium selection rules, ensuring convergence to equilibria with optimal social welfare in stochastic games.

Findings

01

The framework achieves potential-maximizing policies in Markov potential games.

02

It finds Pareto-optimal equilibria in general-sum stochastic games.

03

Sample-based implementation demonstrates practical applicability.

Abstract

While multi-agent reinforcement learning (MARL) has produced numerous algorithms that converge to Nash or related equilibria, such equilibria are often non-unique and can exhibit widely varying efficiency. This raises a fundamental question: how can one design learning dynamics that not only converge to equilibrium but also select equilibria with desirable performance, such as high social welfare? In contrast to the MARL literature, equilibrium selection has been extensively studied in normal-form games, where decentralized dynamics are known to converge to potential-maximizing or Pareto-optimal Nash equilibria (NEs). Motivated by these results, we study equilibrium selection in finite-horizon stochastic games. We propose a unified actor-critic framework in which a critic learns state-action value functions, and an actor applies a classical equilibrium-selection rule state-wise,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSupply Chain and Inventory Management · Auction Theory and Applications · Economic theories and models