Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models
Lukas Thede, Karsten Roth, Olivier J. H\'enaff, Matthias Bethge, Zeynep Akata

TL;DR
This paper critically examines rehearsal-free continual learning with pretrained models, revealing that many advanced methods rely on simple baselines and highlighting the importance of understanding their true effectiveness and relation to traditional techniques.
Contribution
The study provides a comprehensive analysis of P-RFCL methods, demonstrating their reliance on simple baselines and clarifying their relationship with classical continual learning approaches.
Findings
P-RFCL methods often collapse to simple PEFT baselines.
Input-conditional query mechanisms do not significantly improve performance.
Standard CL techniques like EWC or SI remain relevant in this context.
Abstract
With the advent and recent ubiquity of foundation models, continual learning (CL) has recently shifted from continual training from scratch to the continual adaptation of pretrained models, seeing particular success on rehearsal-free CL benchmarks (RFCL). To achieve this, most proposed methods adapt and restructure parameter-efficient finetuning techniques (PEFT) to suit the continual nature of the problem. Based most often on input-conditional query-mechanisms or regularizations on top of prompt- or adapter-based PEFT, these PEFT-style RFCL (P-RFCL) approaches report peak performances; often convincingly outperforming existing CL techniques. However, on the other end, critical studies have recently highlighted competitive results by training on just the first task or via simple non-parametric baselines. Consequently, questions arise about the relationship between methodological choices…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsElastic Weight Consolidation
