TL;DR
This paper unifies various self-supervised learning methods into a single gradient-based framework, revealing their similarities and proposing a new effective gradient form called UniGrad that achieves state-of-the-art results.
Contribution
The work introduces a unified gradient framework for self-supervised learning methods and proposes UniGrad, a simple gradient form that outperforms existing approaches without needing additional components.
Findings
Little performance gap among different methods when unified
Momentum encoder significantly boosts performance
UniGrad achieves state-of-the-art results without extra modules
Abstract
Self-supervised learning has shown its great potential to extract powerful visual representations without human annotations. Various works are proposed to deal with self-supervised learning from different perspectives: (1) contrastive learning methods (e.g., MoCo, SimCLR) utilize both positive and negative samples to guide the training direction; (2) asymmetric network methods (e.g., BYOL, SimSiam) get rid of negative samples via the introduction of a predictor network and the stop-gradient operation; (3) feature decorrelation methods (e.g., Barlow Twins, VICReg) instead aim to reduce the redundancy between feature dimensions. These methods appear to be quite different in the designed loss functions from various motivations. The final accuracy numbers also vary, where different networks and tricks are utilized in different works. In this work, we demonstrate that these methods can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning · Bootstrap Your Own Latent · InfoNCE · Barlow Twins · Batch Normalization · Momentum Contrast
