Alleviating catastrophic forgetting using context-dependent gating and   synaptic stabilization

Nicolas Y. Masse; Gregory D. Grant; David J. Freedman

arXiv:1802.01569·cs.LG·April 4, 2019

Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization

Nicolas Y. Masse, Gregory D. Grant, David J. Freedman

PDF

3 Repos

TL;DR

This paper introduces a neuroscience-inspired method combining context-dependent gating with synaptic stabilization to reduce catastrophic forgetting in neural networks, enabling better sequential task learning.

Contribution

It proposes a novel, easy-to-implement approach that uses sparse, task-specific activation patterns alongside weight stabilization to improve continual learning in ANNs.

Findings

01

Enhanced performance on sequential tasks

02

Reduced interference between tasks

03

Low computational overhead

Abstract

Humans and most animals can learn new tasks without forgetting old ones. However, training artificial neural networks (ANNs) on new tasks typically cause it to forget previously learned tasks. This phenomenon is the result of "catastrophic forgetting", in which training an ANN disrupts connection weights that were important for solving previous tasks, degrading task performance. Several recent studies have proposed methods to stabilize connection weights of ANNs that are deemed most important for solving a task, which helps alleviate catastrophic forgetting. Here, drawing inspiration from algorithms that are believed to be implemented in vivo, we propose a complementary method: adding a context-dependent gating signal, such that only sparse, mostly non-overlapping patterns of units are active for any one task. This method is easy to implement, requires little computational overhead, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.