A Simple Framework for Contrastive Learning of Visual Representations
Ting Chen, Simon Kornblith, Mohammad Norouzi, Geoffrey Hinton

TL;DR
SimCLR introduces a straightforward contrastive learning framework that significantly improves visual representation quality without complex architectures, leveraging data augmentation, nonlinear transformations, and large batch training.
Contribution
The paper simplifies contrastive self-supervised learning, systematically studies key components, and achieves state-of-the-art results on ImageNet with a simple framework.
Findings
Data augmentation composition is crucial for effective learning.
Learnable nonlinear transformation enhances representation quality.
Larger batch sizes and more training steps improve contrastive learning.
Abstract
This paper presents SimCLR: a simple framework for contrastive learning of visual representations. We simplify recently proposed contrastive self-supervised learning algorithms without requiring specialized architectures or a memory bank. In order to understand what enables the contrastive prediction tasks to learn useful representations, we systematically study the major components of our framework. We show that (1) composition of data augmentations plays a critical role in defining effective predictive tasks, (2) introducing a learnable nonlinear transformation between the representation and the contrastive loss substantially improves the quality of the learned representations, and (3) contrastive learning benefits from larger batch sizes and more training steps compared to supervised learning. By combining these findings, we are able to considerably outperform previous methods for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗microsoft/BiomedVLP-CXR-BERT-specializedmodel· 117k dl· ♡ 34117k dl♡ 34
- 🤗wgcban/mix-btmodel· ♡ 1♡ 1
- 🤗lightly-ai/simclrv1-imagenet1k-resnet50-1xmodel
- 🤗lightly-ai/simclrv1-imagenet1k-resnet50-2xmodel
- 🤗lightly-ai/simclrv1-imagenet1k-resnet50-4xmodel· ♡ 1♡ 1
- 🤗lightly-ai/in1k-benchmark-logsmodel
- 🤗AnonRes/PrimusM-OpenMind-MAEmodel· 1 dl1 dl
- 🤗AnonRes/ResEncL-OpenMind-MAEmodel· 19 dl· ♡ 119 dl♡ 1
- 🤗AnonRes/ResEncL-OpenMind-S3Dmodel· 11 dl11 dl
- 🤗AnonRes/ResEncL-OpenMind-VFmodel· 6 dl6 dl
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
MethodsDense Connections · Normalized Temperature-scaled Cross Entropy Loss · Random Resized Crop · Random Gaussian Blur · Color Jitter · Feedforward Network · Linear Warmup With Linear Decay · Weight Decay · LARS · SimCLR
