Deep Reinforcement Learning that Matters

Peter Henderson; Riashat Islam; Philip Bachman; Joelle Pineau; Doina; Precup; David Meger

arXiv:1709.06560·cs.LG·January 31, 2019·365 cites

Deep Reinforcement Learning that Matters

Peter Henderson, Riashat Islam, Philip Bachman, Joelle Pineau, Doina, Precup, David Meger

PDF

Open Access 4 Repos

TL;DR

This paper discusses the challenges of reproducibility in deep reinforcement learning, emphasizing the need for standardized reporting and significance metrics to ensure meaningful progress in the field.

Contribution

It identifies key issues in reproducibility and proposes guidelines to improve experimental reporting and result interpretation in deep RL research.

Findings

01

Reproducibility issues cause variability in reported results.

02

Standardized reporting can improve comparison of deep RL methods.

03

Guidelines can help ensure meaningful progress in the field.

Abstract

In recent years, significant progress has been made in solving challenging problems across various domains using deep reinforcement learning (RL). Reproducing existing work and accurately judging the improvements offered by novel methods is vital to sustaining this progress. Unfortunately, reproducing results for state-of-the-art deep RL methods is seldom straightforward. In particular, non-determinism in standard benchmark environments, combined with variance intrinsic to the methods, can make reported results tough to interpret. Without significance metrics and tighter standardization of experimental reporting, it is difficult to determine whether improvements over the prior state-of-the-art are meaningful. In this paper, we investigate challenges posed by reproducibility, proper experimental techniques, and reporting procedures. We illustrate the variability in reported metrics and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics