Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning

Mahmoud Assran; Joshua Romoff; Nicolas Ballas; Joelle Pineau; Michael; Rabbat

arXiv:1906.04585·cs.LG·April 23, 2020·19 cites

Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning

Mahmoud Assran, Joshua Romoff, Nicolas Ballas, Joelle Pineau, Michael, Rabbat

PDF

Open Access 1 Repo

TL;DR

GALA introduces a gossip-based peer-to-peer communication framework for distributed deep reinforcement learning, enhancing scalability, efficiency, and robustness over traditional synchronous methods like A2C.

Contribution

The paper proposes GALA, a novel gossip-based architecture for actor-learner agents that improves scalability and efficiency in distributed deep reinforcement learning.

Findings

01

GALA maintains agents within an epsilon-ball during training.

02

GALA outperforms A2C in robustness and sample efficiency.

03

GALA achieves higher hardware utilization and frame-rates on GPUs.

Abstract

Multi-simulator training has contributed to the recent success of Deep Reinforcement Learning by stabilizing learning and allowing for higher training throughputs. We propose Gossip-based Actor-Learner Architectures (GALA) where several actor-learners (such as A2C agents) are organized in a peer-to-peer communication topology, and exchange information through asynchronous gossip in order to take advantage of a large number of distributed simulators. We prove that GALA agents remain within an epsilon-ball of one-another during training when using loosely coupled asynchronous communication. By reducing the amount of synchronization between agents, GALA is more computationally efficient and scalable compared to A2C, its fully-synchronous counterpart. GALA also outperforms A2C, being more robust and sample efficient. We show that we can run several loosely coupled GALA agents in parallel on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/gala
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Distributed Control Multi-Agent Systems

MethodsGlobal-and-Local attention · A2C