SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning
Linxi Fan, Yuke Zhu, Jiren Zhu, Zihua Liu, Orien Zeng, Anchit Gupta,, Joan Creus-Costa, Silvio Savarese, Li Fei-Fei

TL;DR
SURREAL-System is a comprehensive, scalable framework for distributed reinforcement learning that simplifies deployment across diverse hardware and achieves state-of-the-art performance on standard benchmarks.
Contribution
It introduces a fully-integrated stack for distributed RL, combining hardware abstraction, high-level scheduling, optimized communication, and scalable algorithms.
Findings
Achieves state-of-the-art results on OpenAI Gym tasks.
Supports scaling to thousands of CPU cores and hundreds of GPUs.
Provides a flexible framework adaptable to various hardware environments.
Abstract
We present an overview of SURREAL-System, a reproducible, flexible, and scalable framework for distributed reinforcement learning (RL). The framework consists of a stack of four layers: Provisioner, Orchestrator, Protocol, and Algorithms. The Provisioner abstracts away the machine hardware and node pools across different cloud providers. The Orchestrator provides a unified interface for scheduling and deploying distributed algorithms by high-level description, which is capable of deploying to a wide range of hardware from a personal laptop to full-fledged cloud clusters. The Protocol provides network communication primitives optimized for RL. Finally, the SURREAL algorithms, such as Proximal Policy Optimization (PPO) and Evolution Strategies (ES), can easily scale to 1000s of CPU cores and 100s of GPUs. The learning performances of our distributed algorithms establish new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · IoT and Edge/Fog Computing
