Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game Theory
Griffin Adams, Sarguna Janani Padmanabhan, Shivang Shekhar

TL;DR
This paper proposes a novel multi-agent deep reinforcement learning approach combining Deep Q-Networks with game theory to improve implicit coordination, convergence, and stability in complex multi-agent environments.
Contribution
It introduces the Friend-or-Foe algorithm, leveraging Nash equilibrium concepts and residual networks to enhance coordination and convergence in multi-agent deep reinforcement learning.
Findings
Faster convergence with the Friend-or-Foe algorithm compared to Nash-Q.
Successful coordination in complex environments like Predator Prey and Warehouse.
Insights into game theoretic variables affecting multi-agent learning.
Abstract
We address two major challenges of implicit coordination in multi-agent deep reinforcement learning: non-stationarity and exponential growth of state-action space, by combining Deep-Q Networks for policy learning with Nash equilibrium for action selection. Q-values proxy as payoffs in Nash settings, and mutual best responses define joint action selection. Coordination is implicit because multiple/no Nash equilibria are resolved deterministically. We demonstrate that knowledge of game type leads to an assumption of mirrored best responses and faster convergence than Nash-Q. Specifically, the Friend-or-Foe algorithm demonstrates signs of convergence to a Set Controller which jointly chooses actions for two agents. This encouraging given the highly unstable nature of decentralized coordination over joint actions. Inspired by the dueling network architecture, which decouples the Q-function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsDense Connections · Convolution · Double Q-learning · Dueling Network
