Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
Kaiqing Zhang, Zhuoran Yang, Han Liu, Tong Zhang, and Tamer Ba\c{s}ar

TL;DR
This paper introduces fully decentralized multi-agent reinforcement learning algorithms for networked agents with provable convergence, enabling large-scale, online, and distributed decision-making without centralized control.
Contribution
It presents the first fully decentralized MARL algorithms with convergence guarantees, applicable to large-scale problems with function approximation.
Findings
Algorithms are fully incremental and online.
Convergence is proven for linear function approximation.
Extensive simulations validate effectiveness.
Abstract
We consider the problem of \emph{fully decentralized} multi-agent reinforcement learning (MARL), where the agents are located at the nodes of a time-varying communication network. Specifically, we assume that the reward functions of the agents might correspond to different tasks, and are only known to the corresponding agent. Moreover, each agent makes individual decisions based on both the information observed locally and the messages received from its neighbors over the network. Within this setting, the collective goal of the agents is to maximize the globally averaged return over the network through exchanging information with their neighbors. To this end, we propose two decentralized actor-critic algorithms with function approximation, which are applicable to large-scale MARL problems where both the number of states and the number of agents are massively large. Under the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Adaptive Dynamic Programming Control · Reinforcement Learning in Robotics
