Continual Model-Based Reinforcement Learning with Hypernetworks

Yizhou Huang; Kevin Xie; Homanga Bharadhwaj; Florian Shkurti

arXiv:2009.11997·cs.LG·March 31, 2021·6 cites

Continual Model-Based Reinforcement Learning with Hypernetworks

Yizhou Huang, Kevin Xie, Homanga Bharadhwaj, Florian Shkurti

PDF

Open Access 1 Repo

TL;DR

HyperCRL introduces a continual learning approach for model-based reinforcement learning using task-conditional hypernetworks, enabling efficient lifelong robot learning without revisiting all past data.

Contribution

It proposes a novel continual learning method with fixed-capacity hypernetworks that efficiently models non-stationary dynamics in sequential tasks.

Findings

01

Outperforms existing continual learning methods with fixed-capacity networks.

02

Achieves competitive results with methods that store all past experiences.

03

Effective in robot locomotion and manipulation tasks like pushing and door opening.

Abstract

Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model. In many instances of MBRL and MPC, this model is assumed to be stationary and is periodically re-trained from scratch on state transition experience collected from the beginning of environment interactions. This implies that the time required to train the dynamics model - and the pause required between plan executions - grows linearly with the size of the collected experience. We argue that this is too slow for lifelong robot learning and propose HyperCRL, a method that continually learns the encountered dynamics in a sequence of tasks using task-conditional hypernetworks. Our method has three main attributes: first, it includes dynamics learning sessions that do not revisit training data from previous tasks, so it only needs to store…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rvl-lab-utoronto/HyperCRL
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robot Manipulation and Learning