Sparse Adapter Fusion for Continual Learning in NLP
Min Zeng, Xi Chen, Haiqin Yang, Yike Guo

TL;DR
This paper introduces SAFM, a novel method for continual learning in NLP that dynamically fuses adapters to improve knowledge sharing, reduce parameter usage, and prevent catastrophic forgetting across tasks.
Contribution
SAFM is a new approach that adaptively decides to reuse, add, or fuse adapters, enhancing parameter efficiency and knowledge retention in continual NLP learning.
Findings
SAFM outperforms state-of-the-art methods in NLP continual learning tasks.
Achieves comparable performance with less than 60% of the parameters.
Effectively mitigates catastrophic forgetting across diverse tasks.
Abstract
Continual learning in natural language processing plays a crucial role in adapting to evolving data and preventing catastrophic forgetting. Despite significant progress, existing methods still face challenges, such as inefficient parameter reuse across tasks, risking catastrophic forgetting when tasks are dissimilar, and the unnecessary introduction of new parameters for each task, which hampers knowledge sharing among similar tasks. To tackle these issues, we propose a Sparse Adapter Fusion Method (SAFM), which dynamically fuses old and new adapters to address these challenges. SAFM operates in two stages: the decision stage and the tuning stage. In the decision stage, SAFM determines whether to incorporate a new adapter, reuse an existing one, or add an empty adapter. The architecture search procedure, designed to prioritize reusing or adding empty adapters, minimizes parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
