WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement   Learning on a GPU

Tian Lan; Sunil Srinivasa; Huan Wang; Stephan Zheng

arXiv:2108.13976·cs.LG·October 12, 2021·1 cites

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Tian Lan, Sunil Srinivasa, Huan Wang, Stephan Zheng

PDF

Open Access 3 Repos

TL;DR

WarpDrive is an open-source GPU-based framework that significantly accelerates multi-agent deep reinforcement learning by enabling thousands of concurrent simulations with high throughput, reducing training time for complex environments.

Contribution

The paper introduces WarpDrive, a novel GPU-accelerated framework for multi-agent RL that eliminates CPU-GPU data transfer bottlenecks and enables scalable, high-throughput training.

Findings

01

Achieves 2.9 million environment steps per second in benchmarks

02

Scales almost linearly with the number of agents and environments

03

Outperforms CPU implementations by at least 100x in throughput

Abstract

Deep reinforcement learning (RL) is a powerful framework to train decision-making models in complex environments. However, RL can be slow as it requires repeated interaction with a simulation of the environment. In particular, there are key system engineering bottlenecks when using RL in complex environments that feature multiple agents with high-dimensional state, observation, or action spaces. We present WarpDrive, a flexible, lightweight, and easy-to-use open-source RL framework that implements end-to-end deep multi-agent RL on a single GPU (Graphics Processing Unit), built on PyCUDA and PyTorch. Using the extreme parallelization capability of GPUs, WarpDrive enables orders-of-magnitude faster RL compared to common implementations that blend CPU simulations and GPU models. Our design runs simulations and the agents in each simulation in parallel. It eliminates data copying between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Robotic Path Planning Algorithms