Sparse Adapter Fusion for Continual Learning in NLP

Min Zeng; Xi Chen; Haiqin Yang; Yike Guo

arXiv:2602.02502·cs.LG·February 4, 2026

Sparse Adapter Fusion for Continual Learning in NLP

Min Zeng, Xi Chen, Haiqin Yang, Yike Guo

PDF

Open Access 1 Video

TL;DR

This paper introduces SAFM, a novel method for continual learning in NLP that dynamically fuses adapters to improve knowledge sharing, reduce parameter usage, and prevent catastrophic forgetting across tasks.

Contribution

SAFM is a new approach that adaptively decides to reuse, add, or fuse adapters, enhancing parameter efficiency and knowledge retention in continual NLP learning.

Findings

01

SAFM outperforms state-of-the-art methods in NLP continual learning tasks.

02

Achieves comparable performance with less than 60% of the parameters.

03

Effectively mitigates catastrophic forgetting across diverse tasks.

Abstract

Continual learning in natural language processing plays a crucial role in adapting to evolving data and preventing catastrophic forgetting. Despite significant progress, existing methods still face challenges, such as inefficient parameter reuse across tasks, risking catastrophic forgetting when tasks are dissimilar, and the unnecessary introduction of new parameters for each task, which hampers knowledge sharing among similar tasks. To tackle these issues, we propose a Sparse Adapter Fusion Method (SAFM), which dynamically fuses old and new adapters to address these challenges. SAFM operates in two stages: the decision stage and the tuning stage. In the decision stage, SAFM determines whether to incorporate a new adapter, reuse an existing one, or add an empty adapter. The architecture search procedure, designed to prioritize reusing or adding empty adapters, minimizes parameter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Sparse Adapter Fusion for Continual Learning in NLP· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications