TorchBeast: A PyTorch Platform for Distributed RL
Heinrich K\"uttler, Nantas Nardelli, Thibaut Lavril, Marco Selvatici,, Viswanath Sivakumar, Tim Rockt\"aschel, Edward Grefenstette

TL;DR
TorchBeast is an open-source PyTorch platform that simplifies scalable reinforcement learning research by implementing IMPALA with both single-machine and multi-machine versions, maintaining high performance and ease of use.
Contribution
It introduces a flexible, easy-to-use RL platform in PyTorch with both pure Python and high-performance multi-machine implementations, facilitating scalable RL research.
Findings
Performs on par with IMPALA on Atari benchmarks
Provides both Python-only and multi-machine high-performance versions
Enables scalable RL research with minimal programming complexity
Abstract
TorchBeast is a platform for reinforcement learning (RL) research in PyTorch. It implements a version of the popular IMPALA algorithm for fast, asynchronous, parallel training of RL agents. Additionally, TorchBeast has simplicity as an explicit design goal: We provide both a pure-Python implementation ("MonoBeast") as well as a multi-machine high-performance version ("PolyBeast"). In the latter, parts of the implementation are written in C++, but all parts pertaining to machine learning are kept in simple Python using PyTorch, with the environments provided using the OpenAI Gym interface. This enables researchers to conduct scalable RL research using TorchBeast without any programming knowledge beyond Python and PyTorch. In this paper, we describe the TorchBeast design principles and implementation and demonstrate that it performs on-par with IMPALA on Atari. TorchBeast is released as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel-Driven Software Engineering Techniques · Modular Robots and Swarm Intelligence · Embedded Systems Design Techniques
MethodsTorchBeast · Sigmoid Activation · Tanh Activation · V-trace · Experience Replay · Entropy Regularization · Residual Connection · Gradient Clipping · RMSProp · *Communicated@Fast*How Do I Communicate to Expedia?
