Fully Decentralized Multi-Agent Reinforcement Learning with Networked   Agents

Kaiqing Zhang; Zhuoran Yang; Han Liu; Tong Zhang; and Tamer Ba\c{s}ar

arXiv:1802.08757·cs.LG·February 28, 2018·250 cites

Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents

Kaiqing Zhang, Zhuoran Yang, Han Liu, Tong Zhang, and Tamer Ba\c{s}ar

PDF

Open Access 5 Repos

TL;DR

This paper introduces fully decentralized multi-agent reinforcement learning algorithms for networked agents with provable convergence, enabling large-scale, online, and distributed decision-making without centralized control.

Contribution

It presents the first fully decentralized MARL algorithms with convergence guarantees, applicable to large-scale problems with function approximation.

Findings

01

Algorithms are fully incremental and online.

02

Convergence is proven for linear function approximation.

03

Extensive simulations validate effectiveness.

Abstract

We consider the problem of \emph{fully decentralized} multi-agent reinforcement learning (MARL), where the agents are located at the nodes of a time-varying communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the information observed locally and the messages received from its neighbors over the network. Within this setting, the collective goal of the agents is to maximize the globally averaged return over the network through exchanging information with their neighbors. To this end, we propose two decentralized actor-critic algorithms with function approximation, which are applicable to large-scale MARL problems where both the number of states and the number of agents are massively large. Under the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Control Multi-Agent Systems · Adaptive Dynamic Programming Control · Reinforcement Learning in Robotics