Self-Supervised Ranking for Representation Learning

Ali Varamesh; Ali Diba; Tinne Tuytelaars; Luc Van Gool

arXiv:2010.07258·cs.CV·November 23, 2020·5 cites

Self-Supervised Ranking for Representation Learning

Ali Varamesh, Ali Diba, Tinne Tuytelaars, Luc Van Gool

PDF

Open Access

TL;DR

This paper introduces S2R2, a self-supervised image representation learning framework that uses ranking instead of contrastive pairs, leading to improved performance on diverse datasets without relying on object-centric data.

Contribution

The paper proposes a novel ranking-based self-supervised learning framework, S2R2, which outperforms existing contrastive methods and reduces dependence on curated datasets.

Findings

01

S2R2 outperforms SimCLR and SwAV on STL10 and MS-COCO datasets.

02

S2R2 is conceptually simpler and easier to implement.

03

S2R2 is more effective on diverse scenes, reducing the need for object-centric datasets.

Abstract

We present a new framework for self-supervised representation learning by formulating it as a ranking problem in an image retrieval context on a large number of random views (augmentations) obtained from images. Our work is based on two intuitions: first, a good representation of images must yield a high-quality image ranking in a retrieval task; second, we would expect random views of an image to be ranked closer to a reference view of that image than random views of other images. Hence, we model representation learning as a learning to rank problem for image retrieval. We train a representation encoder by maximizing average precision (AP) for ranking, where random views of an image are considered positively related, and that of the other images considered negatives. The new framework, dubbed S2R2, enables computing a global objective on multiple views, compared to the local objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsContrastive Learning · 1x1 Convolution · Average Pooling · Batch Normalization · Residual Connection · Residual Block · Bottleneck Residual Block · Max Pooling · Convolution · Global Average Pooling