SCNet: Learning Semantic Correspondence

Kai Han; Rafael S. Rezende; Bumsub Ham; Kwan-Yee K. Wong; Minsu Cho,; Cordelia Schmid; Jean Ponce

arXiv:1705.04043·cs.CV·August 18, 2017·24 cites

SCNet: Learning Semantic Correspondence

Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho,, Cordelia Schmid, Jean Ponce

PDF

Open Access 1 Repo

TL;DR

This paper introduces SCNet, a convolutional neural network that learns semantic correspondences between images by incorporating geometric consistency, outperforming previous methods on standard benchmarks.

Contribution

SCNet is a novel CNN architecture that explicitly models geometric consistency for semantic correspondence, using region proposals as matching primitives.

Findings

01

SCNet outperforms recent deep learning architectures.

02

SCNet surpasses previous hand-crafted feature methods.

03

The approach achieves superior results on standard benchmarks.

Abstract

This paper addresses the problem of establishing semantic correspondences between images depicting different instances of the same object or scene category. Previous approaches focus on either combining a spatial regularizer with hand-crafted features, or learning a correspondence model for appearance only. We propose instead a convolutional neural network architecture, called SCNet, for learning a geometrically plausible model for semantic correspondence. SCNet uses region proposals as matching primitives, and explicitly incorporates geometric consistency in its loss function. It is trained on image pairs obtained from the PASCAL VOC 2007 keypoint dataset, and a comparative evaluation on several standard benchmarks demonstrates that the proposed approach substantially outperforms both recent deep learning architectures and previous methods based on hand-crafted features.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

k-han/SCNet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Face recognition and analysis