ImpressLearn: Continual Learning via Combined Task Impressions
Dhrupad Bhardwaj, Julia Kempe, Artem Vysogorets, Angela M. Teng, and, Evaristus C. Ezekwem

TL;DR
ImpressLearn introduces a continual learning method that combines task-specific supermasks linearly, enabling neural networks to learn multiple tasks efficiently, retain previous knowledge, and adapt quickly to new or unseen tasks without extensive parameter overhead.
Contribution
This work demonstrates that linearly combining a small set of task impressions can match dedicated masks' performance, reducing parameter costs and enabling task-agnostic inference.
Findings
Achieves high accuracy on multiple tasks with fewer parameters.
Effectively adapts to unseen tasks using combined impressions.
Operates without task labels at inference time.
Abstract
This work proposes a new method to sequentially train deep neural networks on multiple tasks without suffering catastrophic forgetting, while endowing it with the capability to quickly adapt to unseen tasks. Starting from existing work on network masking (Wortsman et al., 2020), we show that simply learning a linear combination of a small number of task-specific supermasks (impressions) on a randomly initialized backbone network is sufficient to both retain accuracy on previously learned tasks, as well as achieve high accuracy on unseen tasks. In contrast to previous methods, we do not require to generate dedicated masks or contexts for each new task, instead leveraging transfer learning to keep per-task parameter overhead small. Our work illustrates the power of linearly combining individual impressions, each of which fares poorly in isolation, to achieve performance comparable to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications
