Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill, Florian Strub, Florent Altch\'e, Corentin Tallec,, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires,, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu,, R\'emi Munos, Michal Valko

TL;DR
BYOL introduces a self-supervised learning method using two networks that learn from each other without negative pairs, achieving state-of-the-art image classification accuracy.
Contribution
It presents a novel self-supervised learning approach that outperforms previous methods relying on negative pairs, using a bootstrap mechanism with online and target networks.
Findings
Achieves 74.3% top-1 accuracy on ImageNet with ResNet-50
Performs on par or better on transfer and semi-supervised benchmarks
Does not require negative pairs for effective learning
Abstract
We introduce Bootstrap Your Own Latent (BYOL), a new approach to self-supervised image representation learning. BYOL relies on two neural networks, referred to as online and target networks, that interact and learn from each other. From an augmented view of an image, we train the online network to predict the target network representation of the same image under a different augmented view. At the same time, we update the target network with a slow-moving average of the online network. While state-of-the art methods rely on negative pairs, BYOL achieves a new state of the art without them. BYOL reaches top-1 classification accuracy on ImageNet using a linear evaluation with a ResNet-50 architecture and with a larger ResNet. We show that BYOL performs on par or better than the current state of the art on both transfer and semi-supervised benchmarks. Our implementation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis
Methods1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Bottleneck Residual Block · Batch Normalization · Average Pooling · Max Pooling · Global Average Pooling · Residual Connection · Kaiming Initialization · Convolution
