Plasticity-Optimized Complementary Networks for Unsupervised Continual   Learning

Alex Gomez-Villa; Bartlomiej Twardowski; Kai Wang; Joost van de Weijer

arXiv:2309.06086·cs.LG·September 13, 2023

Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning

Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel approach for unsupervised continual learning that trains expert networks to improve plasticity and combines them to prevent forgetting, outperforming existing exemplar-free methods.

Contribution

It proposes training expert networks focused on new tasks and combining them with previous networks to enhance continual learning without exemplars.

Findings

01

Outperforms other exemplar-free CURL methods in multiple task settings.

02

Effectively adapts to semi-supervised continual learning, surpassing other methods.

03

Achieves comparable results to exemplar-based approaches in some scenarios.

Abstract

Continuous unsupervised representation learning (CURL) research has greatly benefited from improvements in self-supervised learning (SSL) techniques. As a result, existing CURL methods using SSL can learn high-quality representations without any labels, but with a notable performance drop when learning on a many-tasks data stream. We hypothesize that this is caused by the regularization losses that are imposed to prevent forgetting, leading to a suboptimal plasticity-stability trade-off: they either do not adapt fully to the incoming data (low plasticity), or incur significant forgetting when allowed to fully adapt to a new SSL pretext-task (low stability). In this work, we propose to train an expert network that is relieved of the duty of keeping the previous knowledge and can focus on performing optimally on the new tasks (optimizing plasticity). In the second phase, we combine this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alviur/pocon_wacv2024
pytorchOfficial

Videos

Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning· youtube

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsFocus