Resolving Implicit Coordination in Multi-Agent Deep Reinforcement   Learning with Deep Q-Networks & Game Theory

Griffin Adams; Sarguna Janani Padmanabhan; Shivang Shekhar

arXiv:2012.09136·cs.MA·December 17, 2020

Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game Theory

Griffin Adams, Sarguna Janani Padmanabhan, Shivang Shekhar

PDF

Open Access 1 Repo

TL;DR

This paper proposes a novel multi-agent deep reinforcement learning approach combining Deep Q-Networks with game theory to improve implicit coordination, convergence, and stability in complex multi-agent environments.

Contribution

It introduces the Friend-or-Foe algorithm, leveraging Nash equilibrium concepts and residual networks to enhance coordination and convergence in multi-agent deep reinforcement learning.

Findings

01

Faster convergence with the Friend-or-Foe algorithm compared to Nash-Q.

02

Successful coordination in complex environments like Predator Prey and Warehouse.

03

Insights into game theoretic variables affecting multi-agent learning.

Abstract

We address two major challenges of implicit coordination in multi-agent deep reinforcement learning: non-stationarity and exponential growth of state-action space, by combining Deep-Q Networks for policy learning with Nash equilibrium for action selection. Q-values proxy as payoffs in Nash settings, and mutual best responses define joint action selection. Coordination is implicit because multiple/no Nash equilibria are resolved deterministically. We demonstrate that knowledge of game type leads to an assumption of mirrored best responses and faster convergence than Nash-Q. Specifically, the Friend-or-Foe algorithm demonstrates signs of convergence to a Set Controller which jointly chooses actions for two agents. This encouraging given the highly unstable nature of decentralized coordination over joint actions. Inspired by the dueling network architecture, which decouples the Q-function…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

psjanani/RLAProject
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsDense Connections · Convolution · Double Q-learning · Dueling Network