TL;DR
This paper introduces LoDA, a task-driven subspace decomposition method for LoRA-based continual learning that enhances knowledge sharing and task isolation, outperforming existing methods.
Contribution
LoDA proposes a novel energy-based decomposition approach that decouples task-shared and task-specific directions for improved continual learning performance.
Findings
LoDA outperforms existing CL methods in experiments.
LoDA effectively separates shared and task-specific knowledge.
LoDA's closed-form recalibration improves feature-level optimization.
Abstract
Continual Learning (CL) requires models to sequentially adapt to new tasks without forgetting old knowledge. Recently, Low-Rank Adaptation (LoRA), a representative Parameter-Efficient Fine-Tuning (PEFT) method, has gained increasing attention in CL. Several LoRA-based CL methods reduce interference across tasks by separating their update spaces, typically building the new space from the estimated null space of past tasks. However, they (i) overlook task-shared directions, which suppresses knowledge transfer, and (ii) fail to capture truly effective task-specific directions since these ``null bases" of old tasks can remain nearly inactive for new task under correlated tasks. To address this, we study LoRA learning capability from a projection energy perspective, and propose Low-rank Decomposition and Adaptation (LoDA). It performs a task-driven decomposition to build general and truly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
