An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates
Albin Soutif--Cormerais, Simone Magistri, Joost van de Weijer, Andew D. Bagdanov

TL;DR
This paper investigates how low-rank adaptation (LoRA) impacts forgetting in pretrained models during incremental learning, revealing that LoRA rank influences forgetting and introduces a novel 'contextual' forgetting behavior in vision transformers.
Contribution
It studies the effect of LoRA rank on model forgetting and introduces the concept of 'contextual' forgetting in vision transformers, a novel observation in continual learning.
Findings
LoRA rank significantly affects forgetting of pretraining and downstream tasks.
Vision transformers exhibit 'contextual' forgetting not seen in residual networks.
LoRA merging impacts model plasticity and retention during incremental updates.
Abstract
Broad, open source availability of large pretrained foundation models on the internet through platforms such as HuggingFace has taken the world of practical deep learning by storm. A classical pipeline for neural network training now typically consists of finetuning these pretrained network on a small target dataset instead of training from scratch. In the case of large models this can be done even on modest hardware using a low rank training technique known as Low-Rank Adaptation (LoRA). While Low Rank training has already been studied in the continual learning setting, existing works often consider storing the learned adapter along with the existing model but rarely attempt to modify the weights of the pretrained model by merging the LoRA with the existing weights after finishing the training of each task. In this article we investigate this setting and study the impact of LoRA rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Algorithms
MethodsAdapter
