CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning
Marco Paul E. Apolinario, Sakshi Choudhary, Kaushik Roy

TL;DR
CODE-CL introduces a novel conceptor-based gradient projection method that effectively balances knowledge retention and transfer in deep continual learning, outperforming existing approaches on benchmark tasks.
Contribution
The paper presents CODE-CL, a new method using conceptor matrices to adaptively manage correlated tasks, reducing catastrophic forgetting and enhancing forward knowledge transfer.
Findings
Superior performance on continual learning benchmarks
Reduced catastrophic forgetting compared to state-of-the-art methods
Improved forward knowledge transfer in correlated tasks
Abstract
Continual learning (CL) - the ability to progressively acquire and integrate new concepts - is essential to intelligent systems to adapt to dynamic environments. However, deep neural networks struggle with catastrophic forgetting (CF) when learning tasks sequentially, as training for new tasks often overwrites previously learned knowledge. To address this, recent approaches constrain updates to orthogonal subspaces using gradient projection, effectively preserving important gradient directions for previous tasks. While effective in reducing forgetting, these approaches inadvertently hinder forward knowledge transfer (FWT), particularly when tasks are highly correlated. In this work, we propose Conceptor-based gradient projection for Deep Continual Learning (CODE-CL), a novel method that leverages conceptor matrix representations, a form of regularized reconstruction, to adaptively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
