Learning Density-Based Correlated Equilibria for Markov Games
Libo Zhang, Yang Chen, Toru Takisaka, Bakh Khoussainov, Michael, Witbrock, and Jiamou Liu

TL;DR
This paper introduces Density-Based Correlated Equilibria (DBCE), a novel approach that explicitly incorporates state density requirements into the equilibrium concept for Markov games, improving coordination with safety and fairness considerations.
Contribution
It proposes a new equilibrium concept, DBCE, and a corresponding policy iteration algorithm to explicitly meet state density constraints in multi-agent systems.
Findings
DBCE effectively incorporates state density requirements.
The proposed algorithm outperforms existing methods in relevant scenarios.
Experiments demonstrate improved safety and fairness in multi-agent coordination.
Abstract
Correlated Equilibrium (CE) is a well-established solution concept that captures coordination among agents and enjoys good algorithmic properties. In real-world multi-agent systems, in addition to being in an equilibrium, agents' policies are often expected to meet requirements with respect to safety, and fairness. Such additional requirements can often be expressed in terms of the state density which measures the state-visitation frequencies during the course of a game. However, existing CE notions or CE-finding approaches cannot explicitly specify a CE with particular properties concerning state density; they do so implicitly by either modifying reward functions or using value functions as the selection criteria. The resulting CE may thus not fully fulfil the state-density requirements. In this paper, we propose Density-Based Correlated Equilibria (DBCE), a new notion of CE that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Game Theory and Applications
