Contrasting Contrastive Self-Supervised Representation Learning   Pipelines

Klemen Kotar; Gabriel Ilharco; Ludwig Schmidt; Kiana Ehsani; Roozbeh; Mottaghi

arXiv:2103.14005·cs.CV·August 20, 2021

Contrasting Contrastive Self-Supervised Representation Learning Pipelines

Klemen Kotar, Gabriel Ilharco, Ludwig Schmidt, Kiana Ehsani, Roozbeh, Mottaghi

PDF

Open Access 1 Repo

TL;DR

This paper provides a comprehensive analysis of contrastive self-supervised learning methods, examining how training algorithms, datasets, and tasks influence performance, supported by extensive experiments and a new benchmark.

Contribution

It offers a detailed empirical study of contrastive self-supervised learning, including a large-scale experiment analysis and the introduction of the Visual Representation Benchmark (ViRB).

Findings

01

Contrastive models often outperform supervised models on downstream tasks.

02

Pre-training datasets significantly impact end task performance.

03

Current benchmarks may not fully capture model capabilities.

Abstract

In the past few years, we have witnessed remarkable breakthroughs in self-supervised representation learning. Despite the success and adoption of representations learned through this paradigm, much is yet to be understood about how different training methods and datasets influence performance on downstream tasks. In this paper, we analyze contrastive approaches as one of the most successful and popular variants of self-supervised representation learning. We perform this analysis from the perspective of the training algorithms, pre-training datasets and end tasks. We examine over 700 training experiments including 30 encoders, 4 pre-training datasets and 20 diverse downstream tasks. Our experiments address various questions regarding the performance of self-supervised models compared to their supervised counterparts, current benchmarks used for evaluation, and the effect of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/ViRB
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition