Explicit homography estimation improves contrastive self-supervised   learning

David Torpey; Richard Klein

arXiv:2101.04713·cs.CV·January 14, 2021

Explicit homography estimation improves contrastive self-supervised learning

David Torpey, Richard Klein

PDF

Open Access

TL;DR

This paper introduces an explicit homography estimation module as an additional objective in contrastive self-supervised learning, enhancing performance and convergence speed without enforcing invariance to all transformations.

Contribution

It proposes a novel module for homography regression in contrastive learning, improving effectiveness and training efficiency across multiple datasets.

Findings

01

Homography estimation improves contrastive learning performance.

02

Affine transformation performs better than general homography in all tested cases.

03

The method accelerates learning without enforcing invariance to all transformation components.

Abstract

The typical contrastive self-supervised algorithm uses a similarity measure in latent space as the supervision signal by contrasting positive and negative images directly or indirectly. Although the utility of self-supervised algorithms has improved recently, there are still bottlenecks hindering their widespread use, such as the compute needed. In this paper, we propose a module that serves as an additional objective in the self-supervised contrastive learning paradigm. We show how the inclusion of this module to regress the parameters of an affine transformation or homography, in addition to the original contrastive objective, improves both performance and learning speed. Importantly, we ensure that this module does not enforce invariance to the various components of the affine transform, as this is not always ideal. We demonstrate the effectiveness of the additional objective on two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques

MethodsContrastive Learning