JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models

Alexandra Dragomir; Ioana Pintilie; Antonio Barbalau; Marius Dragoi; Florin Brad; Cristian Daniel Paduraru; Alexandru Tifrea; Elena Burceanu; Radu Tudor Ionescu

arXiv:2604.16171·cs.LG·April 29, 2026

JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models

Alexandra Dragomir, Ioana Pintilie, Antonio Barbalau, Marius Dragoi, Florin Brad, Cristian Daniel Paduraru, Alexandru Tifrea, Elena Burceanu, Radu Tudor Ionescu

PDF

TL;DR

JumpLoRA introduces a sparsity-inducing framework for LoRA adapters in continual learning, enhancing task isolation and outperforming existing methods like ELLA in large language models.

Contribution

It proposes JumpLoRA, a novel approach that adaptively induces sparsity in LoRA adapters using JumpReLU gating, improving continual learning performance.

Findings

01

JumpLoRA significantly boosts IncLoRA performance.

02

Outperforms the state-of-the-art CL method ELLA.

03

Achieves dynamic parameter isolation to prevent task interference.

Abstract

Adapter-based methods have become a cost-effective approach to continual learning (CL) for Large Language Models (LLMs), by sequentially learning a low-rank update matrix for each task. To mitigate catastrophic forgetting, state-of-the-art approaches impose constraints on new adapters with respect to the previous ones, by targeting either subspace or coordinate-wise interference. In this paper, we propose JumpLoRA, a novel framework to adaptively induce sparsity in the Low-Rank Adaptation (LoRA) blocks through the use of JumpReLU gating. The method achieves dynamic parameter isolation, which helps prevent task interference. We demonstrate that our method is highly modular and compatible with LoRA-based CL approaches. Specifically, it significantly boosts the performance of IncLoRA and outperforms the leading state-of-the-art CL method, ELLA.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.