Calibration of Shared Equilibria in General Sum Partially Observable Markov Games
Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

TL;DR
This paper introduces the concept of Shared equilibrium in multi-agent systems with shared policies, proves convergence properties, and develops a dual-Reinforcement Learning method to calibrate these equilibria to real-world data, demonstrated on a market example.
Contribution
It formally defines Shared equilibrium, proves convergence for certain games, and presents a novel calibration method using dual-Reinforcement Learning for multi-agent systems.
Findings
Shared equilibrium is a symmetric pure Nash equilibrium in a Functional Form Game.
The proposed calibration method effectively matches emergent behaviors to external targets.
Application to a market example demonstrates practical scalability and behavior differentiation.
Abstract
Training multi-agent systems (MAS) to achieve realistic equilibria gives us a useful tool to understand and model real-world systems. We consider a general sum partially observable Markov game where agents of different types share a single policy network, conditioned on agent-specific information. This paper aims at i) formally understanding equilibria reached by such agents, and ii) matching emergent phenomena of such equilibria to real-world targets. Parameter sharing with decentralized execution has been introduced as an efficient way to train multiple agents using a single policy network. However, the nature of resulting equilibria reached by such agents has not been yet studied: we introduce the novel concept of Shared equilibrium as a symmetric pure Nash equilibrium of a certain Functional Form Game (FFG) and prove convergence to the latter for a certain class of games using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGame Theory and Applications · Reinforcement Learning in Robotics · Optimization and Search Problems
