Continual Learning with Adaptive Weights (CLAW)

Tameem Adel; Han Zhao; Richard E. Turner

arXiv:1911.09514·stat.ML·June 17, 2020·27 cites

Continual Learning with Adaptive Weights (CLAW)

Tameem Adel, Han Zhao, Richard E. Turner

PDF

Open Access

TL;DR

CLAW is a novel continual learning method that adaptively determines which network parts to share across tasks, effectively balancing transfer learning and catastrophic forgetting, and achieving state-of-the-art results.

Contribution

Introduces CLAW, a probabilistic and variational inference-based approach that adaptively shares network components across tasks in continual learning.

Findings

01

Achieves state-of-the-art performance on six benchmarks.

02

Effectively mitigates catastrophic forgetting.

03

Balances transfer learning and model size.

Abstract

Approaches to continual learning aim to successfully learn a set of related tasks that arrive in an online manner. Recently, several frameworks have been developed which enable deep learning to be deployed in this learning scenario. A key modelling decision is to what extent the architecture should be shared across tasks. On the one hand, separately modelling each task avoids catastrophic forgetting but it does not support transfer learning and leads to large models. On the other hand, rigidly specifying a shared component and a task-specific part enables task transfer and limits the model size, but it is vulnerable to catastrophic forgetting and restricts the form of task-transfer that can occur. Ideally, the network should adaptively identify which parts of the network to share in a data driven way. Here we introduce such an approach called Continual Learning with Adaptive Weights…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Machine Learning and Algorithms