Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models

Lukas Thede; Karsten Roth; Olivier J. H\'enaff; Matthias Bethge; Zeynep Akata

arXiv:2406.09384·cs.LG·September 23, 2025

Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models

Lukas Thede, Karsten Roth, Olivier J. H\'enaff, Matthias Bethge, Zeynep Akata

PDF

Open Access

TL;DR

This paper critically examines rehearsal-free continual learning with pretrained models, revealing that many advanced methods rely on simple baselines and highlighting the importance of understanding their true effectiveness and relation to traditional techniques.

Contribution

The study provides a comprehensive analysis of P-RFCL methods, demonstrating their reliance on simple baselines and clarifying their relationship with classical continual learning approaches.

Findings

01

P-RFCL methods often collapse to simple PEFT baselines.

02

Input-conditional query mechanisms do not significantly improve performance.

03

Standard CL techniques like EWC or SI remain relevant in this context.

Abstract

With the advent and recent ubiquity of foundation models, continual learning (CL) has recently shifted from continual training from scratch to the continual adaptation of pretrained models, seeing particular success on rehearsal-free CL benchmarks (RFCL). To achieve this, most proposed methods adapt and restructure parameter-efficient finetuning techniques (PEFT) to suit the continual nature of the problem. Based most often on input-conditional query-mechanisms or regularizations on top of prompt- or adapter-based PEFT, these PEFT-style RFCL (P-RFCL) approaches report peak performances; often convincingly outperforming existing CL techniques. However, on the other end, critical studies have recently highlighted competitive results by training on just the first task or via simple non-parametric baselines. Consequently, questions arise about the relationship between methodological choices…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsElastic Weight Consolidation