Fine-Tuning Regimes Define Distinct Continual Learning Problems
Paul-Tiberiu Iordache, Elena Burceanu

TL;DR
This paper demonstrates that the choice of fine-tuning regime significantly impacts continual learning evaluations, affecting method rankings, forgetting, and update magnitudes, thus advocating for regime-aware benchmarking.
Contribution
It formalizes adaptation regimes as projected optimization over trainable subspaces and shows their influence on continual learning performance and comparisons.
Findings
Method rankings vary across different fine-tuning regimes.
Deeper regimes lead to larger updates and more forgetting.
Evaluation protocols should consider trainable depth as an explicit factor.
Abstract
Continual learning (CL) studies how models acquire tasks sequentially while retaining previously learned knowledge. Despite substantial progress in benchmarking CL methods, comparative evaluations typically keep the fine-tuning regime fixed. In this paper, we argue that the fine-tuning regime, defined by the trainable parameter subspace, is itself a key evaluation variable. We formalize adaptation regimes as projected optimization over fixed trainable subspaces, showing that changing the trainable depth alters the effective update signal through which both current task fitting and knowledge preservation operate. This analysis motivates the hypothesis that method comparisons need not be invariant across regimes. We test this hypothesis in task incremental CL, five trainable depth regimes, and four standard methods: online EWC, LwF, SI, and GEM. Across five benchmark datasets, namely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
