Continual Learning in Low-rank Orthogonal Subspaces
Arslan Chaudhry, Naeemullah Khan, Puneet K. Dokania, Philip H. S. Torr

TL;DR
This paper introduces a novel continual learning method that learns tasks in orthogonal low-rank subspaces using isometric mappings, significantly reducing interference and outperforming existing approaches on standard benchmarks.
Contribution
It proposes a new approach to continual learning by learning task-specific orthogonal subspaces with isometric mappings, minimizing task interference.
Findings
Achieves strong results over experience-replay baselines.
Outperforms existing methods on standard classification benchmarks.
First to report such results with this approach.
Abstract
In continual learning (CL), a learner is faced with a sequence of tasks, arriving one after the other, and the goal is to remember all the tasks once the continual learning experience is finished. The prior art in CL uses episodic memory, parameter regularization or extensible network structures to reduce interference among tasks, but in the end, all the approaches learn different tasks in a joint vector space. We believe this invariably leads to interference among different tasks. We propose to learn tasks in different (low-rank) vector subspaces that are kept orthogonal to each other in order to minimize interference. Further, to keep the gradients of different tasks coming from these subspaces orthogonal to each other, we learn isometric mappings by posing network training as an optimization problem over the Stiefel manifold. To the best of our understanding, we report, for the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
