Continual Backprop: Stochastic Gradient Descent with Persistent   Randomness

Shibhansh Dohare; Richard S. Sutton; A. Rupam Mahmood

arXiv:2108.06325·cs.LG·May 6, 2022·1 cites

Continual Backprop: Stochastic Gradient Descent with Persistent Randomness

Shibhansh Dohare, Richard S. Sutton, A. Rupam Mahmood

PDF

Open Access 1 Repo

TL;DR

This paper identifies performance degradation in traditional Backprop during continual learning and introduces Continual Backprop, an algorithm that maintains learning ability over time by injecting persistent randomness.

Contribution

The paper presents Continual Backprop, a novel extension of Backprop that sustains continual learning by injecting persistent randomness, addressing degradation issues.

Findings

01

Backprop performance degrades over time in continual learning.

02

Continual Backprop maintains adaptability in supervised and reinforcement learning.

03

Continual Backprop has similar computational complexity to standard Backprop.

Abstract

The Backprop algorithm for learning in neural networks utilizes two mechanisms: first, stochastic gradient descent and second, initialization with small random weights, where the latter is essential to the effectiveness of the former. We show that in continual learning setups, Backprop performs well initially, but over time its performance degrades. Stochastic gradient descent alone is insufficient to learn continually; the initial randomness enables only initial learning but not continual learning. To the best of our knowledge, ours is the first result showing this degradation in Backprop's ability to learn. To address this degradation in Backprop's plasticity, we propose an algorithm that continually injects random features alongside gradient descent using a new generate-and-test process. We call this the \textit{Continual Backprop} algorithm. We show that, unlike Backprop, Continual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shibhansh/loss-of-plasticity
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Adversarial Robustness in Machine Learning