Deep Online Learning via Meta-Learning: Continual Adaptation for Model-Based RL
Anusha Nagabandi, Chelsea Finn, Sergey Levine

TL;DR
This paper introduces MOLe, a meta-learning approach enabling deep neural networks to perform continual online adaptation in non-stationary environments, particularly improving model-based reinforcement learning in dynamic real-world scenarios.
Contribution
The paper develops a meta-learning framework for online deep learning that effectively adapts to changing tasks and distributions, enhancing model-based RL performance.
Findings
MOLe outperforms prior methods in non-stationary environments.
Meta-training improves online adaptation of deep models.
Effective continual learning demonstrated in diverse RL tasks.
Abstract
Humans and animals can learn complex predictive models that allow them to accurately and reliably reason about real-world phenomena, and they can adapt such models extremely quickly in the face of unexpected changes. Deep neural network models allow us to represent very complex functions, but lack this capacity for rapid online adaptation. The goal in this paper is to develop a method for continual online learning from an incoming stream of data, using deep neural network models. We formulate an online learning procedure that uses stochastic gradient descent to update model parameters, and an expectation maximization algorithm with a Chinese restaurant process prior to develop and maintain a mixture of models to handle non-stationary task distributions. This allows for all models to be adapted as necessary, with new models instantiated for task changes and old models recalled when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Reinforcement Learning in Robotics
MethodsStochastic Gradient Descent
