Beneficial perturbation network for continual learning

Shixian Wen; Laurent Itti

arXiv:1906.10528·cs.LG·June 26, 2019·1 cites

Beneficial perturbation network for continual learning

Shixian Wen, Laurent Itti

PDF

Open Access

TL;DR

The paper introduces Beneficial Perturbation Networks (BPN), a novel continual learning method that uses task-specific biasing units to prevent catastrophic forgetting without storing past data, showing strong results on standard datasets.

Contribution

BPN is a new continual learning approach that employs task-dependent biasing units and beneficial perturbations, improving efficiency and avoiding data storage.

Findings

01

BPN outperforms state-of-the-art methods on MNIST, CIFAR-10, CIFAR-100.

02

BPN does not require storing previous task data.

03

BPN is more parameter-efficient than network expansion methods.

Abstract

Sequential learning of multiple tasks in artificial neural networks using gradient descent leads to catastrophic forgetting, whereby previously learned knowledge is erased during learning of new, disjoint knowledge. Here, we propose a fundamentally new type of method - Beneficial Perturbation Network (BPN). We add task-dependent memory (biasing) units to allow the network to operate in different regimes for different tasks. We compute the most beneficial directions to train these units, in a manner inspired by recent work on adversarial examples. At test time, beneficial perturbations for a given task bias the network toward that task to overcome catastrophic forgetting. BPN is not only more parameter-efficient than network expansion methods, but also does not need to store any data from previous tasks, in contrast with episodic memory methods. Experiments on variants of the MNIST,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition