Calibration of Shared Equilibria in General Sum Partially Observable   Markov Games

Nelson Vadori; Sumitra Ganesh; Prashant Reddy; Manuela Veloso

arXiv:2006.13085·cs.MA·October 26, 2020·6 cites

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

PDF

Open Access 1 Video

TL;DR

This paper introduces the concept of Shared equilibrium in multi-agent systems with shared policies, proves convergence properties, and develops a dual-Reinforcement Learning method to calibrate these equilibria to real-world data, demonstrated on a market example.

Contribution

It formally defines Shared equilibrium, proves convergence for certain games, and presents a novel calibration method using dual-Reinforcement Learning for multi-agent systems.

Findings

01

Shared equilibrium is a symmetric pure Nash equilibrium in a Functional Form Game.

02

The proposed calibration method effectively matches emergent behaviors to external targets.

03

Application to a market example demonstrates practical scalability and behavior differentiation.

Abstract

Training multi-agent systems (MAS) to achieve realistic equilibria gives us a useful tool to understand and model real-world systems. We consider a general sum partially observable Markov game where agents of different types share a single policy network, conditioned on agent-specific information. This paper aims at i) formally understanding equilibria reached by such agents, and ii) matching emergent phenomena of such equilibria to real-world targets. Parameter sharing with decentralized execution has been introduced as an efficient way to train multiple agents using a single policy network. However, the nature of resulting equilibria reached by such agents has not been yet studied: we introduce the novel concept of Shared equilibrium as a symmetric pure Nash equilibrium of a certain Functional Form Game (FFG) and prove convergence to the latter for a certain class of games using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games· slideslive

Taxonomy

TopicsGame Theory and Applications · Reinforcement Learning in Robotics · Optimization and Search Problems