Accelerated Methods for Deep Reinforcement Learning

Adam Stooke; Pieter Abbeel

arXiv:1803.02811·cs.LG·January 14, 2019·98 cites

Accelerated Methods for Deep Reinforcement Learning

Adam Stooke, Pieter Abbeel

PDF

Open Access 5 Repos

TL;DR

This paper presents a unified framework for accelerating deep reinforcement learning by optimizing algorithms for modern CPU-GPU architectures, enabling faster experiments and training times.

Contribution

It introduces a parallelization framework that significantly speeds up deep RL training on CPUs and GPUs, allowing complex tasks like Atari game learning in minutes.

Findings

01

Parallelization improves training speed without performance loss

02

Large batch sizes do not harm sample efficiency

03

Training on DGX-1 achieves Atari strategies in minutes

Abstract

Deep reinforcement learning (RL) has achieved many recent successes, yet experiment turn-around time remains a key bottleneck in research and in practice. We investigate how to optimize existing deep RL algorithms for modern computers, specifically for a combination of CPUs and GPUs. We confirm that both policy gradient and Q-value learning algorithms can be adapted to learn using many parallel simulator instances. We further find it possible to train using batch sizes considerably larger than are standard, without negatively affecting sample complexity or final performance. We leverage these facts to build a unified framework for parallelization that dramatically hastens experiments in both classes of algorithm. All neural network computations use GPUs, accelerating both data collection and training. Our results include using an entire DGX-1 to learn successful strategies in Atari…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Modular Robots and Swarm Intelligence