Addressing Loss of Plasticity and Catastrophic Forgetting in Continual   Learning

Mohamed Elsayed; A. Rupam Mahmood

arXiv:2404.00781·cs.LG·May 2, 2024·1 cites

Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning

Mohamed Elsayed, A. Rupam Mahmood

PDF

Open Access 1 Repo

TL;DR

This paper introduces UPGD, a novel continual learning method that balances plasticity and stability by perturbing gradients based on unit usefulness, outperforming existing approaches in streaming learning and reinforcement learning tasks.

Contribution

The paper proposes Utility-based Perturbed Gradient Descent (UPGD), a new method that simultaneously addresses catastrophic forgetting and loss of plasticity in continual learning.

Findings

01

UPGD outperforms existing methods in streaming learning benchmarks.

02

UPGD maintains high accuracy over multiple tasks with non-stationarities.

03

In reinforcement learning, UPGD prevents performance drops seen with Adam.

Abstract

Deep representation learning methods struggle with continual learning, suffering from both catastrophic forgetting of useful units and loss of plasticity, often due to rigid and unuseful units. While many methods address these two issues separately, only a few currently deal with both simultaneously. In this paper, we introduce Utility-based Perturbed Gradient Descent (UPGD) as a novel approach for the continual learning of representations. UPGD combines gradient updates with perturbations, where it applies smaller modifications to more useful units, protecting them from forgetting, and larger modifications to less useful units, rejuvenating their plasticity. We use a challenging streaming learning setup where continual learning problems have hundreds of non-stationarities and unknown task boundaries. We show that many existing methods suffer from at least one of the issues,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mohmdelsayed/upgd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHigher Education Learning Practices · Education and Critical Thinking Development · Educator Training and Historical Pedagogy

MethodsEntropy Regularization · Proximal Policy Optimization · Adam