TorchBeast: A PyTorch Platform for Distributed RL

Heinrich K\"uttler; Nantas Nardelli; Thibaut Lavril; Marco Selvatici,; Viswanath Sivakumar; Tim Rockt\"aschel; Edward Grefenstette

arXiv:1910.03552·cs.LG·October 9, 2019·29 cites

TorchBeast: A PyTorch Platform for Distributed RL

Heinrich K\"uttler, Nantas Nardelli, Thibaut Lavril, Marco Selvatici,, Viswanath Sivakumar, Tim Rockt\"aschel, Edward Grefenstette

PDF

Open Access 3 Repos

TL;DR

TorchBeast is an open-source PyTorch platform that simplifies scalable reinforcement learning research by implementing IMPALA with both single-machine and multi-machine versions, maintaining high performance and ease of use.

Contribution

It introduces a flexible, easy-to-use RL platform in PyTorch with both pure Python and high-performance multi-machine implementations, facilitating scalable RL research.

Findings

01

Performs on par with IMPALA on Atari benchmarks

02

Provides both Python-only and multi-machine high-performance versions

03

Enables scalable RL research with minimal programming complexity

Abstract

TorchBeast is a platform for reinforcement learning (RL) research in PyTorch. It implements a version of the popular IMPALA algorithm for fast, asynchronous, parallel training of RL agents. Additionally, TorchBeast has simplicity as an explicit design goal: We provide both a pure-Python implementation ("MonoBeast") as well as a multi-machine high-performance version ("PolyBeast"). In the latter, parts of the implementation are written in C++, but all parts pertaining to machine learning are kept in simple Python using PyTorch, with the environments provided using the OpenAI Gym interface. This enables researchers to conduct scalable RL research using TorchBeast without any programming knowledge beyond Python and PyTorch. In this paper, we describe the TorchBeast design principles and implementation and demonstrate that it performs on-par with IMPALA on Atari. TorchBeast is released as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel-Driven Software Engineering Techniques · Modular Robots and Swarm Intelligence · Embedded Systems Design Techniques

MethodsTorchBeast · Sigmoid Activation · Tanh Activation · V-trace · Experience Replay · Entropy Regularization · Residual Connection · Gradient Clipping · RMSProp · *Communicated@Fast*How Do I Communicate to Expedia?