Optimizing Data Collection in Deep Reinforcement Learning

James Gleeson; Daniel Snider; Yvonne Yang; Moshe Gabel; Eyal de Lara,; Gennady Pekhimenko

arXiv:2207.07736·cs.LG·July 19, 2022

Optimizing Data Collection in Deep Reinforcement Learning

James Gleeson, Daniel Snider, Yvonne Yang, Moshe Gabel, Eyal de Lara,, Gennady Pekhimenko

PDF

Open Access

TL;DR

This paper introduces GPU-based optimizations, including vectorization and kernel fusion, to significantly accelerate data collection in deep reinforcement learning, reducing training time and improving efficiency.

Contribution

It presents novel GPU-centric techniques for simulation acceleration in RL, achieving up to 1024x speedup and demonstrating their effectiveness over traditional CPU-based methods.

Findings

01

GPU vectorization achieves up to 1024x speedup.

02

ML compiler implementations outperform DNN frameworks by 13.4x.

03

Kernel fusion provides up to 1024x speedup, especially with complex simulators.

Abstract

Reinforcement learning (RL) workloads take a notoriously long time to train due to the large number of samples collected at run-time from simulators. Unfortunately, cluster scale-up approaches remain expensive, and commonly used CPU implementations of simulators induce high overhead when switching back and forth between GPU computations. We explore two optimizations that increase RL data collection efficiency by increasing GPU utilization: (1) GPU vectorization: parallelizing simulation on the GPU for increased hardware parallelism, and (2) simulator kernel fusion: fusing multiple simulation steps to run in a single GPU kernel launch to reduce global memory bandwidth requirements. We find that GPU vectorization can achieve up to $1024 \times$ speedup over commonly used CPU simulators. We profile the performance of different implementations and show that for a simple simulator, ML…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Modular Robots and Swarm Intelligence