SPair-71k: A Large-scale Benchmark for Semantic Correspondence

Juhong Min; Jongmin Lee; Jean Ponce; Minsu Cho

arXiv:1908.10543·cs.CV·August 29, 2019·69 cites

SPair-71k: A Large-scale Benchmark for Semantic Correspondence

Juhong Min, Jongmin Lee, Jean Ponce, Minsu Cho

PDF

Open Access 1 Datasets

TL;DR

SPair-71k is a large-scale, richly annotated dataset designed to advance research in semantic correspondence by providing diverse, challenging image pairs for benchmarking computer vision algorithms.

Contribution

The paper introduces SPair-71k, a significantly larger and more diverse dataset for semantic correspondence, with accurate annotations to facilitate research and evaluation.

Findings

01

Recent methods evaluated as baselines

02

Dataset covers large viewpoint and scale variations

03

Provides a reliable testbed for semantic matching research

Abstract

Establishing visual correspondences under large intra-class variations, which is often referred to as semantic correspondence or semantic matching, remains a challenging problem in computer vision. Despite its significance, however, most of the datasets for semantic correspondence are limited to a small amount of image pairs with similar viewpoints and scales. In this paper, we present a new large-scale benchmark dataset of semantically paired images, SPair-71k, which contains 70,958 image pairs with diverse variations in viewpoint and scale. Compared to previous datasets, it is significantly larger in number and contains more accurate and richer annotations. We believe this dataset will provide a reliable testbed to study the problem of semantic correspondence and will help to advance research in this area. We provide the results of recent methods on our new dataset as baselines for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

0jl/SPair-71k
dataset· 91 dl
91 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning