Investigating Generalisation in Continuous Deep Reinforcement Learning

Chenyang Zhao; Olivier Sigaud; Freek Stulp; Timothy M. Hospedales

arXiv:1902.07015·cs.LG·February 21, 2019·34 cites

Investigating Generalisation in Continuous Deep Reinforcement Learning

Chenyang Zhao, Olivier Sigaud, Freek Stulp, Timothy M. Hospedales

PDF

Open Access

TL;DR

This paper examines the generalisation challenges in deep reinforcement learning, highlighting the gap between training performance and real-world robustness, and evaluates techniques to improve generalisation under domain shifts.

Contribution

It characterises sources of uncertainty affecting generalisation, introduces a new benchmark, and empirically evaluates state-of-the-art methods for robustness in Deep RL.

Findings

01

Training performance alone is misleading for generalisation.

02

Common techniques vary in robustness under domain shift.

03

Certain methods significantly improve generalisation in Deep RL.

Abstract

Deep Reinforcement Learning has shown great success in a variety of control tasks. However, it is unclear how close we are to the vision of putting Deep RL into practice to solve real world problems. In particular, common practice in the field is to train policies on largely deterministic simulators and to evaluate algorithms through training performance alone, without a train/test distinction to ensure models generalise and are not overfitted. Moreover, it is not standard practice to check for generalisation under domain shift, although robustness to such system change between training and testing would be necessary for real-world Deep RL control, for example, in robotics. In this paper we study these issues by first characterising the sources of uncertainty that provide generalisation challenges in Deep RL. We then provide a new benchmark and thorough empirical evaluation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Machine Learning and Algorithms