Learning Semantic Correspondence Exploiting an Object-level Prior

Junghyup Lee; Dohyung Kim; Wonkyung Lee; Jean Ponce; Bumsub Ham

arXiv:1911.12914·cs.CV·July 23, 2020

Learning Semantic Correspondence Exploiting an Object-level Prior

Junghyup Lee, Dohyung Kim, Wonkyung Lee, Jean Ponce, Bumsub Ham

PDF

TL;DR

This paper introduces SFNet, a CNN that uses object masks and synthetic deformations to improve dense semantic correspondence between images, outperforming previous methods.

Contribution

The paper proposes a novel CNN architecture, SFNet, incorporating an object-level prior and a differentiable argmax for end-to-end training in semantic correspondence.

Findings

01

SFNet significantly outperforms state-of-the-art methods on benchmark datasets.

02

Using object masks as supervision improves correspondence accuracy.

03

The differentiable argmax enables effective end-to-end training of the network.

Abstract

We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal provides an object-level prior for the semantic correspondence task and offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost of manually selecting point correspondences, and semantic alignment ones, where the regression of a single global geometric transformation between images may be sensitive to image-specific details such as background clutter. We propose a new CNN architecture, dubbed SFNet, which implements this idea. It…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.