Efficient Parallel Methods for Deep Reinforcement Learning

Alfredo V. Clemente; Humberto N. Castej\'on; Arjun Chandra

arXiv:1705.04862·cs.LG·May 17, 2017·80 cites

Efficient Parallel Methods for Deep Reinforcement Learning

Alfredo V. Clemente, Humberto N. Castej\'on, Arjun Chandra

PDF

Open Access 5 Repos

TL;DR

This paper introduces a GPU-friendly, parallel framework for deep reinforcement learning that accelerates training times and is compatible with various algorithms, demonstrated by achieving state-of-the-art results on Atari games.

Contribution

The authors present a novel, algorithm-agnostic parallelization framework for deep reinforcement learning that significantly reduces training time on a single machine.

Findings

01

Achieved state-of-the-art Atari performance within hours

02

Framework is compatible with multiple RL algorithms

03

Open-source implementation available for rapid experimentation

Abstract

We propose a novel framework for efficient parallelization of deep reinforcement learning algorithms, enabling these algorithms to learn from multiple actors on a single machine. The framework is algorithm agnostic and can be applied to on-policy, off-policy, value based and policy gradient based algorithms. Given its inherent parallelism, the framework can be efficiently implemented on a GPU, allowing the usage of powerful models while significantly reducing training time. We demonstrate the effectiveness of our framework by implementing an advantage actor-critic algorithm on a GPU, using on-policy experiences and employing synchronous updates. Our algorithm achieves state-of-the-art performance on the Atari domain after only a few hours of training. Our framework thus opens the door for much faster experimentation on demanding problem domains. Our implementation is open-source and is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Adversarial Robustness in Machine Learning