Coding for Distributed Multi-Agent Reinforcement Learning

Baoqian Wang; Junfei Xie; Nikolay Atanasov

arXiv:2101.02308·cs.LG·January 8, 2021·1 cites

Coding for Distributed Multi-Agent Reinforcement Learning

Baoqian Wang, Junfei Xie, Nikolay Atanasov

PDF

Open Access

TL;DR

This paper introduces a coded distributed learning framework to mitigate straggler effects in multi-agent reinforcement learning, improving training speed while maintaining accuracy through various coding schemes.

Contribution

It proposes a novel coded distributed learning framework for MARL that effectively handles stragglers, with a specific implementation for MADDPG and evaluation of multiple coding schemes.

Findings

01

Coded framework speeds up MARL training in presence of stragglers

02

Different coding schemes are effective in distributed MARL

03

Simulations show promising performance in multi-robot tasks

Abstract

This paper aims to mitigate straggler effects in synchronous distributed learning for multi-agent reinforcement learning (MARL) problems. Stragglers arise frequently in a distributed learning system, due to the existence of various system disturbances such as slow-downs or failures of compute nodes and communication bottlenecks. To resolve this issue, we propose a coded distributed learning framework, which speeds up the training of MARL algorithms in the presence of stragglers, while maintaining the same accuracy as the centralized approach. As an illustration, a coded distributed version of the multi-agent deep deterministic policy gradient(MADDPG) algorithm is developed and evaluated. Different coding schemes, including maximum distance separable (MDS)code, random sparse code, replication-based code, and regular low density parity check (LDPC) code are also investigated. Simulations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Advanced Bandit Algorithms Research