Gradient Projection Memory for Continual Learning

Gobinda Saha; Isha Garg; Kaushik Roy

arXiv:2103.09762·cs.LG·March 18, 2021·27 cites

Gradient Projection Memory for Continual Learning

Gobinda Saha, Isha Garg, Kaushik Roy

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Gradient Projection Memory (GPM), a novel continual learning method that prevents forgetting by orthogonal gradient updates based on SVD-derived subspaces, achieving competitive results without network growth or data replay.

Contribution

The paper proposes a new continual learning approach using orthogonal gradient steps guided by SVD-based subspace analysis, reducing interference and forgetting.

Findings

01

GPM effectively mitigates forgetting in continual learning tasks.

02

GPM achieves comparable or superior performance to state-of-the-art methods.

03

The approach requires no network growth or data replay.

Abstract

The ability to learn continually without forgetting the past tasks is a desired attribute for artificial learning systems. Existing approaches to enable such learning in artificial neural networks usually rely on network growth, importance based weight update or replay of old data from the memory. In contrast, we propose a novel approach where a neural network learns new tasks by taking gradient steps in the orthogonal direction to the gradient subspaces deemed important for the past tasks. We find the bases of these subspaces by analyzing network representations (activations) after learning each task with Singular Value Decomposition (SVD) in a single shot manner and store them in the memory as Gradient Projection Memory (GPM). With qualitative and quantitative analyses, we show that such orthogonal gradient descent induces minimum to no interference with the past tasks, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sahagobinda/GPM
pytorchOfficial

Videos

Gradient Projection Memory for Continual Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications