A Unifying Bayesian View of Continual Learning

Sebastian Farquhar; Yarin Gal

arXiv:1902.06494·stat.ML·February 19, 2019·48 cites

A Unifying Bayesian View of Continual Learning

Sebastian Farquhar, Yarin Gal

PDF

Open Access 2 Repos

TL;DR

This paper introduces a unifying Bayesian framework for continual learning that combines prior- and likelihood-focused methods, addressing challenges with posterior approximations in Bayesian neural networks.

Contribution

It proposes a new likelihood-focused approach and unifies it with prior-focused methods into a single Bayesian continual learning framework.

Findings

01

Likelihood-focused methods improve continual learning performance.

02

Unified framework bridges prior and likelihood approaches.

03

Addresses intractability of exact posterior evaluation.

Abstract

Some machine learning applications require continual learning - where data comes in a sequence of datasets, each is used for training and then permanently discarded. From a Bayesian perspective, continual learning seems straightforward: Given the model posterior one would simply use this as the prior for the next task. However, exact posterior evaluation is intractable with many models, especially with Bayesian neural networks (BNNs). Instead, posterior approximations are often sought. Unfortunately, when posterior approximations are used, prior-focused approaches do not succeed in evaluations designed to capture properties of realistic continual learning use cases. As an alternative to prior-focused methods, we introduce a new approximate Bayesian derivation of the continual learning loss. Our loss does not rely on the posterior from earlier tasks, and instead adapts the model itself…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and ELM