Learning Contrastive Representation for Semantic Correspondence

Taihong Xiao; Sifei Liu; Shalini De Mello; Zhiding Yu; Jan Kautz,; Ming-Hsuan Yang

arXiv:2109.10967·cs.CV·March 11, 2022

Learning Contrastive Representation for Semantic Correspondence

Taihong Xiao, Sifei Liu, Shalini De Mello, Zhiding Yu, Jan Kautz,, Ming-Hsuan Yang

PDF

Open Access

TL;DR

This paper introduces a multi-level contrastive learning method for semantic correspondence that does not depend on ImageNet pretraining, effectively handling variations and reducing labeling efforts.

Contribution

It proposes a novel self-supervised contrastive learning framework for pixel-level semantic matching without relying on pretrained models.

Findings

01

Outperforms state-of-the-art on PF-PASCAL, PF-WILLOW, and SPair-71k datasets.

02

Effectively handles large appearance, scale, and pose variations.

03

Reduces dependence on labor-intensive pixel-level annotations.

Abstract

Dense correspondence across semantically related images has been extensively studied, but still faces two challenges: 1) large variations in appearance, scale and pose exist even for objects from the same category, and 2) labeling pixel-level dense correspondences is labor intensive and infeasible to scale. Most existing approaches focus on designing various matching approaches with fully-supervised ImageNet pretrained networks. On the other hand, while a variety of self-supervised approaches are proposed to explicitly measure image-level similarities, correspondence matching the pixel level remains under-explored. In this work, we propose a multi-level contrastive learning approach for semantic matching, which does not rely on any ImageNet pretrained model. We show that image-level contrastive learning is a key component to encourage the convolutional features to find correspondence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning