Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning
Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer

TL;DR
This paper introduces a novel approach for unsupervised continual learning that trains expert networks to improve plasticity and combines them to prevent forgetting, outperforming existing exemplar-free methods.
Contribution
It proposes training expert networks focused on new tasks and combining them with previous networks to enhance continual learning without exemplars.
Findings
Outperforms other exemplar-free CURL methods in multiple task settings.
Effectively adapts to semi-supervised continual learning, surpassing other methods.
Achieves comparable results to exemplar-based approaches in some scenarios.
Abstract
Continuous unsupervised representation learning (CURL) research has greatly benefited from improvements in self-supervised learning (SSL) techniques. As a result, existing CURL methods using SSL can learn high-quality representations without any labels, but with a notable performance drop when learning on a many-tasks data stream. We hypothesize that this is caused by the regularization losses that are imposed to prevent forgetting, leading to a suboptimal plasticity-stability trade-off: they either do not adapt fully to the incoming data (low plasticity), or incur significant forgetting when allowed to fully adapt to a new SSL pretext-task (low stability). In this work, we propose to train an expert network that is relieved of the duty of keeping the previous knowledge and can focus on performing optimally on the new tasks (optimizing plasticity). In the second phase, we combine this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning· youtube
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsFocus
