A Simple Framework for Contrastive Learning of Visual Representations

Ting Chen; Simon Kornblith; Mohammad Norouzi; Geoffrey Hinton

arXiv:2002.05709·cs.LG·July 2, 2020·7.3k cites

A Simple Framework for Contrastive Learning of Visual Representations

Ting Chen, Simon Kornblith, Mohammad Norouzi, Geoffrey Hinton

PDF

Open Access 5 Repos 10 Models 2 Videos

TL;DR

SimCLR introduces a straightforward contrastive learning framework that significantly improves visual representation quality without complex architectures, leveraging data augmentation, nonlinear transformations, and large batch training.

Contribution

The paper simplifies contrastive self-supervised learning, systematically studies key components, and achieves state-of-the-art results on ImageNet with a simple framework.

Findings

01

Data augmentation composition is crucial for effective learning.

02

Learnable nonlinear transformation enhances representation quality.

03

Larger batch sizes and more training steps improve contrastive learning.

Abstract

This paper presents SimCLR: a simple framework for contrastive learning of visual representations. We simplify recently proposed contrastive self-supervised learning algorithms without requiring specialized architectures or a memory bank. In order to understand what enables the contrastive prediction tasks to learn useful representations, we systematically study the major components of our framework. We show that (1) composition of data augmentations plays a critical role in defining effective predictive tasks, (2) introducing a learnable nonlinear transformation between the representation and the contrastive loss substantially improves the quality of the learned representations, and (3) contrastive learning benefits from larger batch sizes and more training steps compared to supervised learning. By combining these findings, we are able to considerably outperform previous methods for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

Adrien Gaidon — Advancing ML Research in Autonomous Vehicles· youtube

A Simple Framework for Contrastive Learning of Visual Representations· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsDense Connections · Normalized Temperature-scaled Cross Entropy Loss · Random Resized Crop · Random Gaussian Blur · Color Jitter · Feedforward Network · Linear Warmup With Linear Decay · Weight Decay · LARS · SimCLR