Continual Model-Based Reinforcement Learning with Hypernetworks
Yizhou Huang, Kevin Xie, Homanga Bharadhwaj, Florian Shkurti

TL;DR
HyperCRL introduces a continual learning approach for model-based reinforcement learning using task-conditional hypernetworks, enabling efficient lifelong robot learning without revisiting all past data.
Contribution
It proposes a novel continual learning method with fixed-capacity hypernetworks that efficiently models non-stationary dynamics in sequential tasks.
Findings
Outperforms existing continual learning methods with fixed-capacity networks.
Achieves competitive results with methods that store all past experiences.
Effective in robot locomotion and manipulation tasks like pushing and door opening.
Abstract
Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model. In many instances of MBRL and MPC, this model is assumed to be stationary and is periodically re-trained from scratch on state transition experience collected from the beginning of environment interactions. This implies that the time required to train the dynamics model - and the pause required between plan executions - grows linearly with the size of the collected experience. We argue that this is too slow for lifelong robot learning and propose HyperCRL, a method that continually learns the encountered dynamics in a sequence of tasks using task-conditional hypernetworks. Our method has three main attributes: first, it includes dynamics learning sessions that do not revisit training data from previous tasks, so it only needs to store…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robot Manipulation and Learning
