Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning
Xinyuan Gao, Songlin Dong, Yuhang He, Qiang Wang, and Yihong Gong

TL;DR
This paper introduces Continual Adapter (C-ADA), a novel approach for rehearsal-free continual learning that extends model capabilities with adapters and transfer modules, avoiding key-query matching errors and improving performance.
Contribution
The paper proposes C-ADA, a flexible, parameter-extensible adapter framework with transfer modules and orthogonal loss, advancing rehearsal-free continual learning without relying on old sample storage.
Findings
Outperforms state-of-the-art methods in continual learning tasks.
Reduces training time significantly compared to existing approaches.
Effective in domain-incremental learning scenarios.
Abstract
The problem of Rehearsal-Free Continual Learning (RFCL) aims to continually learn new knowledge while preventing forgetting of the old knowledge, without storing any old samples and prototypes. The latest methods leverage large-scale pre-trained models as the backbone and use key-query matching to generate trainable prompts to learn new knowledge. However, the domain gap between the pre-training dataset and the downstream datasets can easily lead to inaccuracies in key-query matching prompt selection when directly generating queries using the pre-trained model, which hampers learning new knowledge. Thus, in this paper, we propose a beyond prompt learning approach to the RFCL task, called Continual Adapter (C-ADA). It mainly comprises a parameter-extensible continual adapter layer (CAL) and a scaling and shifting (S&S) module in parallel with the pre-trained model. C-ADA flexibly extends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Advanced Data Compression Techniques
MethodsAdapter
