A Dissection of Overfitting and Generalization in Continuous   Reinforcement Learning

Amy Zhang; Nicolas Ballas; Joelle Pineau

arXiv:1806.07937·cs.LG·June 26, 2018·93 cites

A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning

Amy Zhang, Nicolas Ballas, Joelle Pineau

PDF

Open Access

TL;DR

This paper explores overfitting in continuous deep reinforcement learning, proposing new diagnostic tools and strategies to improve generalization and reduce overfitting risks in complex environments.

Contribution

It introduces novel perspectives on diagnosing and preventing overfitting in continuous RL, emphasizing training diversity and practical insights for researchers.

Findings

01

Overfitting can be diagnosed in MDPs using new criteria.

02

Increasing training diversity reduces overfitting risks.

03

Deep RL methods are inherently brittle without proper regularization.

Abstract

The risks and perils of overfitting in machine learning are well known. However most of the treatment of this, including diagnostic tools and remedies, was developed for the supervised learning case. In this work, we aim to offer new perspectives on the characterization and prevention of overfitting in deep Reinforcement Learning (RL) methods, with a particular focus on continuous domains. We examine several aspects, such as how to define and diagnose overfitting in MDPs, and how to reduce risks by injecting sufficient training diversity. This work complements recent findings on the brittleness of deep RL methods and offers practical observations for RL researchers and practitioners.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Software Engineering Research