A Unifying Bayesian View of Continual Learning
Sebastian Farquhar, Yarin Gal

TL;DR
This paper introduces a unifying Bayesian framework for continual learning that combines prior- and likelihood-focused methods, addressing challenges with posterior approximations in Bayesian neural networks.
Contribution
It proposes a new likelihood-focused approach and unifies it with prior-focused methods into a single Bayesian continual learning framework.
Findings
Likelihood-focused methods improve continual learning performance.
Unified framework bridges prior and likelihood approaches.
Addresses intractability of exact posterior evaluation.
Abstract
Some machine learning applications require continual learning - where data comes in a sequence of datasets, each is used for training and then permanently discarded. From a Bayesian perspective, continual learning seems straightforward: Given the model posterior one would simply use this as the prior for the next task. However, exact posterior evaluation is intractable with many models, especially with Bayesian neural networks (BNNs). Instead, posterior approximations are often sought. Unfortunately, when posterior approximations are used, prior-focused approaches do not succeed in evaluations designed to capture properties of realistic continual learning use cases. As an alternative to prior-focused methods, we introduce a new approximate Bayesian derivation of the continual learning loss. Our loss does not rely on the posterior from earlier tasks, and instead adapts the model itself…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and ELM
