WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
Tian Lan, Sunil Srinivasa, Huan Wang, Stephan Zheng

TL;DR
WarpDrive is an open-source GPU-based framework that significantly accelerates multi-agent deep reinforcement learning by enabling thousands of concurrent simulations with high throughput, reducing training time for complex environments.
Contribution
The paper introduces WarpDrive, a novel GPU-accelerated framework for multi-agent RL that eliminates CPU-GPU data transfer bottlenecks and enables scalable, high-throughput training.
Findings
Achieves 2.9 million environment steps per second in benchmarks
Scales almost linearly with the number of agents and environments
Outperforms CPU implementations by at least 100x in throughput
Abstract
Deep reinforcement learning (RL) is a powerful framework to train decision-making models in complex environments. However, RL can be slow as it requires repeated interaction with a simulation of the environment. In particular, there are key system engineering bottlenecks when using RL in complex environments that feature multiple agents with high-dimensional state, observation, or action spaces. We present WarpDrive, a flexible, lightweight, and easy-to-use open-source RL framework that implements end-to-end deep multi-agent RL on a single GPU (Graphics Processing Unit), built on PyCUDA and PyTorch. Using the extreme parallelization capability of GPUs, WarpDrive enables orders-of-magnitude faster RL compared to common implementations that blend CPU simulations and GPU models. Our design runs simulations and the agents in each simulation in parallel. It eliminates data copying between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Robotic Path Planning Algorithms
