On Sequential Maximum a Posteriori Inference for Continual Learning

Menghao Waiyan William Zhu; Ercan Engin Kuruo\u{g}lu

arXiv:2405.16498·cs.LG·March 11, 2025

On Sequential Maximum a Posteriori Inference for Continual Learning

Menghao Waiyan William Zhu, Ercan Engin Kuruo\u{g}lu

PDF

Open Access 1 Repo

TL;DR

This paper introduces two novel methods for continual learning based on sequential MAP inference, demonstrating their effectiveness on classical and image classification tasks with fixed feature extractors.

Contribution

It formulates continual learning as recursive loss function approximation and proposes two coreset-free methods, one quadratic and one neural, for this purpose.

Findings

01

Neural consolidation performs well on small input dimension tasks.

02

Autodiff quadratic consolidation is effective on image tasks with fixed features.

03

Both methods achieve performance comparable to joint MAP training.

Abstract

We formulate sequential maximum a posteriori inference as a recursion of loss functions and reduce the problem of continual learning to approximating the previous loss function. We then propose two coreset-free methods: autodiff quadratic consolidation, which uses an accurate and full quadratic approximation, and neural consolidation, which uses a neural network approximation. These methods are not scalable with respect to the neural network size, and we study them for classification tasks in combination with a fixed pre-trained feature extractor. We also introduce simple but challenging classical task sequences based on Iris and Wine datasets. We find that neural consolidation performs well in the classical task sequences, where the input dimension is small, while autodiff quadratic consolidation performs consistently well in image task sequences with a fixed pre-trained feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

blackblitz/bcl
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning