An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates

Albin Soutif--Cormerais; Simone Magistri; Joost van de Weijer; Andew D. Bagdanov

arXiv:2405.18069·cs.LG·May 21, 2025

An Empirical Analysis of Forgetting in Pre-trained Models with Incremental Low-Rank Updates

Albin Soutif--Cormerais, Simone Magistri, Joost van de Weijer, Andew D. Bagdanov

PDF

Open Access 1 Repo

TL;DR

This paper investigates how low-rank adaptation (LoRA) impacts forgetting in pretrained models during incremental learning, revealing that LoRA rank influences forgetting and introduces a novel 'contextual' forgetting behavior in vision transformers.

Contribution

It studies the effect of LoRA rank on model forgetting and introduces the concept of 'contextual' forgetting in vision transformers, a novel observation in continual learning.

Findings

01

LoRA rank significantly affects forgetting of pretraining and downstream tasks.

02

Vision transformers exhibit 'contextual' forgetting not seen in residual networks.

03

LoRA merging impacts model plasticity and retention during incremental updates.

Abstract

Broad, open source availability of large pretrained foundation models on the internet through platforms such as HuggingFace has taken the world of practical deep learning by storm. A classical pipeline for neural network training now typically consists of finetuning these pretrained network on a small target dataset instead of training from scratch. In the case of large models this can be done even on modest hardware using a low rank training technique known as Low-Rank Adaptation (LoRA). While Low Rank training has already been studied in the continual learning setting, existing works often consider storing the learned adapter along with the existing model but rarely attempt to modify the weights of the pretrained model by merging the LoRA with the existing weights after finishing the training of each task. In this article we investigate this setting and study the impact of LoRA rank…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AlbinSou/lora_cl_analysis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Algorithms

MethodsAdapter