Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning
Mohamed Elsayed, A. Rupam Mahmood

TL;DR
This paper introduces UPGD, a novel continual learning method that balances plasticity and stability by perturbing gradients based on unit usefulness, outperforming existing approaches in streaming learning and reinforcement learning tasks.
Contribution
The paper proposes Utility-based Perturbed Gradient Descent (UPGD), a new method that simultaneously addresses catastrophic forgetting and loss of plasticity in continual learning.
Findings
UPGD outperforms existing methods in streaming learning benchmarks.
UPGD maintains high accuracy over multiple tasks with non-stationarities.
In reinforcement learning, UPGD prevents performance drops seen with Adam.
Abstract
Deep representation learning methods struggle with continual learning, suffering from both catastrophic forgetting of useful units and loss of plasticity, often due to rigid and unuseful units. While many methods address these two issues separately, only a few currently deal with both simultaneously. In this paper, we introduce Utility-based Perturbed Gradient Descent (UPGD) as a novel approach for the continual learning of representations. UPGD combines gradient updates with perturbations, where it applies smaller modifications to more useful units, protecting them from forgetting, and larger modifications to less useful units, rejuvenating their plasticity. We use a challenging streaming learning setup where continual learning problems have hundreds of non-stationarities and unknown task boundaries. We show that many existing methods suffer from at least one of the issues,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHigher Education Learning Practices · Education and Critical Thinking Development · Educator Training and Historical Pedagogy
MethodsEntropy Regularization · Proximal Policy Optimization · Adam
