Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning
Mahmoud Assran, Joshua Romoff, Nicolas Ballas, Joelle Pineau, Michael, Rabbat

TL;DR
GALA introduces a gossip-based peer-to-peer communication framework for distributed deep reinforcement learning, enhancing scalability, efficiency, and robustness over traditional synchronous methods like A2C.
Contribution
The paper proposes GALA, a novel gossip-based architecture for actor-learner agents that improves scalability and efficiency in distributed deep reinforcement learning.
Findings
GALA maintains agents within an epsilon-ball during training.
GALA outperforms A2C in robustness and sample efficiency.
GALA achieves higher hardware utilization and frame-rates on GPUs.
Abstract
Multi-simulator training has contributed to the recent success of Deep Reinforcement Learning by stabilizing learning and allowing for higher training throughputs. We propose Gossip-based Actor-Learner Architectures (GALA) where several actor-learners (such as A2C agents) are organized in a peer-to-peer communication topology, and exchange information through asynchronous gossip in order to take advantage of a large number of distributed simulators. We prove that GALA agents remain within an epsilon-ball of one-another during training when using loosely coupled asynchronous communication. By reducing the amount of synchronization between agents, GALA is more computationally efficient and scalable compared to A2C, its fully-synchronous counterpart. GALA also outperforms A2C, being more robust and sample efficient. We show that we can run several loosely coupled GALA agents in parallel on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Distributed Control Multi-Agent Systems
MethodsGlobal-and-Local attention · A2C
