SURREAL-System: Fully-Integrated Stack for Distributed Deep   Reinforcement Learning

Linxi Fan; Yuke Zhu; Jiren Zhu; Zihua Liu; Orien Zeng; Anchit Gupta,; Joan Creus-Costa; Silvio Savarese; Li Fei-Fei

arXiv:1909.12989·cs.LG·October 14, 2019·1 cites

SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning

Linxi Fan, Yuke Zhu, Jiren Zhu, Zihua Liu, Orien Zeng, Anchit Gupta,, Joan Creus-Costa, Silvio Savarese, Li Fei-Fei

PDF

Open Access

TL;DR

SURREAL-System is a comprehensive, scalable framework for distributed reinforcement learning that simplifies deployment across diverse hardware and achieves state-of-the-art performance on standard benchmarks.

Contribution

It introduces a fully-integrated stack for distributed RL, combining hardware abstraction, high-level scheduling, optimized communication, and scalable algorithms.

Findings

01

Achieves state-of-the-art results on OpenAI Gym tasks.

02

Supports scaling to thousands of CPU cores and hundreds of GPUs.

03

Provides a flexible framework adaptable to various hardware environments.

Abstract

We present an overview of SURREAL-System, a reproducible, flexible, and scalable framework for distributed reinforcement learning (RL). The framework consists of a stack of four layers: Provisioner, Orchestrator, Protocol, and Algorithms. The Provisioner abstracts away the machine hardware and node pools across different cloud providers. The Orchestrator provides a unified interface for scheduling and deploying distributed algorithms by high-level description, which is capable of deploying to a wide range of hardware from a personal laptop to full-fledged cloud clusters. The Protocol provides network communication primitives optimized for RL. Finally, the SURREAL algorithms, such as Proximal Policy Optimization (PPO) and Evolution Strategies (ES), can easily scale to 1000s of CPU cores and 100s of GPUs. The learning performances of our distributed algorithms establish new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · IoT and Edge/Fog Computing