Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels

Olaf D\"unkel; Thomas Wimmer; Christian Theobalt; Christian Rupprecht; Adam Kortylewski

arXiv:2506.05312·cs.CV·September 24, 2025

Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels

Olaf D\"unkel, Thomas Wimmer, Christian Theobalt, Christian Rupprecht, Adam Kortylewski

PDF

Open Access

TL;DR

This paper introduces a 3D-aware pseudo-labeling method to improve semantic correspondence estimation in images, reducing annotation needs and achieving state-of-the-art results on SPair-71k.

Contribution

It proposes a novel 3D-aware pseudo-labeling approach with an adapter for feature refinement, enhancing semantic matching without extensive dataset annotations.

Findings

01

Achieved over 4% absolute improvement on SPair-71k

02

Reduced reliance on dataset-specific annotations

03

Demonstrated generality across different data sources

Abstract

Finding correspondences between semantically similar points across images and object instances is one of the everlasting challenges in computer vision. While large pre-trained vision models have recently been demonstrated as effective priors for semantic matching, they still suffer from ambiguities for symmetric objects or repeated object parts. We propose improving semantic correspondence estimation through 3D-aware pseudo-labeling. Specifically, we train an adapter to refine off-the-shelf features using pseudo-labels obtained via 3D-aware chaining, filtering wrong labels through relaxed cyclic consistency, and 3D spherical prototype mapping constraints. While reducing the need for dataset-specific annotations compared to prior work, we establish a new state-of-the-art on SPair-71k, achieving an absolute gain of over 4% and of over 7% compared to methods with similar supervision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Robotics and Sensor-Based Localization

MethodsAdapter · Sparse Evolutionary Training