CLR: Channel-wise Lightweight Reprogramming for Continual Learning
Yunhao Ge, Yuecheng Li, Shuo Ni, Jiaping Zhao, Ming-Hsuan Yang,, Laurent Itti

TL;DR
The paper introduces CLR, a lightweight channel-wise reprogramming method for CNNs that effectively mitigates catastrophic forgetting in continual learning by using task-specific reprogramming parameters while keeping most of the model fixed.
Contribution
CLR proposes a novel, parameter-efficient reprogramming approach that enables CNNs to learn new tasks sequentially without forgetting previous tasks.
Findings
CLR requires less than 0.6% additional parameters per new task.
CLR outperforms 13 state-of-the-art continual learning methods on 53 datasets.
The method maintains high performance with minimal parameter overhead.
Abstract
Continual learning aims to emulate the human ability to continually accumulate knowledge over sequential tasks. The main challenge is to maintain performance on previously learned tasks after learning new tasks, i.e., to avoid catastrophic forgetting. We propose a Channel-wise Lightweight Reprogramming (CLR) approach that helps convolutional neural networks (CNNs) overcome catastrophic forgetting during continual learning. We show that a CNN model trained on an old task (or self-supervised proxy task) could be ``reprogrammed" to solve a new task by using our proposed lightweight (very cheap) reprogramming parameter. With the help of CLR, we have a better stability-plasticity trade-off to solve continual learning problems: To maintain stability and retain previous task ability, we use a common task-agnostic immutable part as the shared ``anchor" parameter set. We then add task-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
CLR: Channel-wise Lightweight Reprogramming for Continual Learning· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
