Self-Supervised Learning of Pretext-Invariant Representations

Ishan Misra; Laurens van der Maaten

arXiv:1912.01991·cs.CV·December 5, 2019

Self-Supervised Learning of Pretext-Invariant Representations

Ishan Misra, Laurens van der Maaten

PDF

5 Repos

TL;DR

PIRL is a self-supervised learning method that learns invariant image representations, significantly improving semantic quality and outperforming supervised pre-training on benchmarks like object detection.

Contribution

We introduce PIRL, a novel self-supervised approach that enforces invariance to transformations, enhancing semantic representation quality beyond existing methods.

Findings

01

PIRL achieves state-of-the-art results on self-supervised benchmarks.

02

PIRL outperforms supervised pre-training in object detection tasks.

03

PIRL improves the semantic invariance of learned representations.

Abstract

The goal of self-supervised learning from images is to construct image representations that are semantically meaningful via pretext tasks that do not require semantic annotations for a large training set of images. Many pretext tasks lead to representations that are covariant with image transformations. We argue that, instead, semantic representations ought to be invariant under such transformations. Specifically, we develop Pretext-Invariant Representation Learning (PIRL, pronounced as "pearl") that learns invariant representations based on pretext tasks. We use PIRL with a commonly used pretext task that involves solving jigsaw puzzles. We find that PIRL substantially improves the semantic quality of the learned image representations. Our approach sets a new state-of-the-art in self-supervised learning from images on several popular benchmarks for self-supervised learning. Despite…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsNPID · PIRL · NPID++ · Jigsaw · Average Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block