Explicit homography estimation improves contrastive self-supervised learning
David Torpey, Richard Klein

TL;DR
This paper introduces an explicit homography estimation module as an additional objective in contrastive self-supervised learning, enhancing performance and convergence speed without enforcing invariance to all transformations.
Contribution
It proposes a novel module for homography regression in contrastive learning, improving effectiveness and training efficiency across multiple datasets.
Findings
Homography estimation improves contrastive learning performance.
Affine transformation performs better than general homography in all tested cases.
The method accelerates learning without enforcing invariance to all transformation components.
Abstract
The typical contrastive self-supervised algorithm uses a similarity measure in latent space as the supervision signal by contrasting positive and negative images directly or indirectly. Although the utility of self-supervised algorithms has improved recently, there are still bottlenecks hindering their widespread use, such as the compute needed. In this paper, we propose a module that serves as an additional objective in the self-supervised contrastive learning paradigm. We show how the inclusion of this module to regress the parameters of an affine transformation or homography, in addition to the original contrastive objective, improves both performance and learning speed. Importantly, we ensure that this module does not enforce invariance to the various components of the affine transform, as this is not always ideal. We demonstrate the effectiveness of the additional objective on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
MethodsContrastive Learning
