Self-Supervised Models are Continual Learners
Enrico Fini, Victor G. Turrisi da Costa, Xavier Alameda-Pineda, Elisa, Ricci, Karteek Alahari, Julien Mairal

TL;DR
This paper introduces a framework that adapts self-supervised learning methods for continual learning, significantly enhancing representation quality and compatibility with existing objectives, with minimal hyperparameter tuning.
Contribution
It proposes converting self-supervised loss functions into distillation mechanisms for continual learning by adding a predictor network, enabling effective lifelong visual representation learning.
Findings
Improved quality of learned representations in continual learning scenarios
Compatibility with multiple self-supervised objectives
Requires minimal hyperparameter tuning
Abstract
Self-supervised models have been shown to produce comparable or better visual representations than their supervised counterparts when trained offline on unlabeled data at scale. However, their efficacy is catastrophically reduced in a Continual Learning (CL) scenario where data is presented to the model sequentially. In this paper, we show that self-supervised loss functions can be seamlessly converted into distillation mechanisms for CL by adding a predictor network that maps the current state of the representations to their past state. This enables us to devise a framework for Continual self-supervised visual representation Learning that (i) significantly improves the quality of the learned representations, (ii) is compatible with several state-of-the-art self-supervised objectives, and (iii) needs little to no hyperparameter tuning. We demonstrate the effectiveness of our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
