Unsupervised Representation Learning by Predicting Image Rotations

Spyros Gidaris; Praveer Singh; Nikos Komodakis

arXiv:1803.07728·cs.CV·March 22, 2018·1.5k cites

Unsupervised Representation Learning by Predicting Image Rotations

Spyros Gidaris, Praveer Singh, Nikos Komodakis

PDF

Open Access 5 Repos

TL;DR

This paper introduces a simple yet effective unsupervised learning method for image features by training ConvNets to recognize image rotations, achieving state-of-the-art results across multiple benchmarks.

Contribution

Proposes predicting image rotations as a self-supervised task to learn semantic features without manual labels, significantly improving unsupervised learning performance.

Findings

01

Achieves state-of-the-art unsupervised performance on multiple benchmarks.

02

Close gap between unsupervised and supervised feature learning.

03

Effective transfer of learned features to various vision tasks.

Abstract

Over the last years, deep convolutional neural networks (ConvNets) have transformed the field of computer vision thanks to their unparalleled capacity to learn high level semantic image features. However, in order to successfully learn those features, they usually require massive amounts of manually labeled data, which is both expensive and impractical to scale. Therefore, unsupervised semantic feature learning, i.e., learning without requiring manual annotation effort, is of crucial importance in order to successfully harvest the vast amount of visual data that are available today. In our work we propose to learn image features by training ConvNets to recognize the 2d rotation that is applied to the image that it gets as input. We demonstrate both qualitatively and quantitatively that this apparently simple task actually provides a very powerful supervisory signal for semantic feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

Methods1x1 Convolution · Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax · How do I speak to a person at Expedia?-/+/