DATA: Decomposed Attention-based Task Adaptation for Rehearsal-Free Continual Learning
Huanxuan Liao, Shizhu He, Yupu Hao, Jun Zhao, Kang Liu

TL;DR
The paper introduces DATA, a decomposed attention-based method for rehearsal-free continual learning that balances plasticity and knowledge retention by dynamically adjusting task-specific adapters, achieving state-of-the-art results.
Contribution
It proposes a novel decomposed attention mechanism with task-specific adapters that adaptively balance plasticity and stability without rehearsal, advancing continual learning techniques.
Findings
Achieves state-of-the-art performance on three benchmarks.
Significantly improves model plasticity and reduces catastrophic forgetting.
Effective dynamic adjustment of adapter weights based on task relevance.
Abstract
Continual learning (CL) is essential for Large Language Models (LLMs) to adapt to evolving real-world demands, yet they are susceptible to catastrophic forgetting (CF). While traditional CF solutions rely on expensive data rehearsal, recent rehearsal-free methods employ model-based and regularization-based strategies to address this issue. However, these approaches often neglect the model's plasticity, which is crucial to achieving optimal performance on newly learned tasks. Consequently, a key challenge in CL is striking a balance between preserving plasticity and mitigating CF. To tackle this challenge, we propose the ecomposed ttention-based ask daptation (DATA), which explicitly decouples and learns both task-specific and task-shared knowledge using high-rank and low-rank task adapters (e.g., LoRAs). For new tasks, DATA dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Analog and Mixed-Signal Circuit Design · Neural Networks and Applications
