Avoiding Forgetting and Allowing Forward Transfer in Continual Learning   via Sparse Networks

Ghada Sokar; Decebal Constantin Mocanu; Mykola Pechenizkiy

arXiv:2110.05329·cs.LG·July 7, 2022·1 cites

Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse Networks

Ghada Sokar, Decebal Constantin Mocanu, Mykola Pechenizkiy

PDF

Open Access 1 Repo

TL;DR

This paper introduces AFAF, a continual learning method that uses sparse networks to prevent forgetting and facilitate forward transfer, especially in class-incremental learning with fixed-capacity models.

Contribution

AFAF allocates task-specific sub-networks within a fixed-capacity model to balance knowledge retention and transfer, addressing class ambiguity issues in class-IL.

Findings

01

AFAF outperforms state-of-the-art methods on multiple benchmarks.

02

It effectively balances forgetting prevention and forward transfer.

03

AFAF handles class ambiguities due to semantic similarities.

Abstract

Using task-specific components within a neural network in continual learning (CL) is a compelling strategy to address the stability-plasticity dilemma in fixed-capacity models without access to past data. Current methods focus only on selecting a sub-network for a new task that reduces forgetting of past tasks. However, this selection could limit the forward transfer of relevant past knowledge that helps in future learning. Our study reveals that satisfying both objectives jointly is more challenging when a unified classifier is used for all classes of seen tasks-class-Incremental Learning (class-IL)-as it is prone to ambiguities between classes across tasks. Moreover, the challenge increases when the semantic similarity of classes across tasks increases. To address this challenge, we propose a new CL method, named AFAF, that aims to Avoid Forgetting and Allow Forward transfer in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GhadaSokar/AFAF
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsAttentive Walk-Aggregating Graph Neural Network