Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels
Olaf D\"unkel, Thomas Wimmer, Christian Theobalt, Christian Rupprecht, Adam Kortylewski

TL;DR
This paper introduces a 3D-aware pseudo-labeling method to improve semantic correspondence estimation in images, reducing annotation needs and achieving state-of-the-art results on SPair-71k.
Contribution
It proposes a novel 3D-aware pseudo-labeling approach with an adapter for feature refinement, enhancing semantic matching without extensive dataset annotations.
Findings
Achieved over 4% absolute improvement on SPair-71k
Reduced reliance on dataset-specific annotations
Demonstrated generality across different data sources
Abstract
Finding correspondences between semantically similar points across images and object instances is one of the everlasting challenges in computer vision. While large pre-trained vision models have recently been demonstrated as effective priors for semantic matching, they still suffer from ambiguities for symmetric objects or repeated object parts. We propose improving semantic correspondence estimation through 3D-aware pseudo-labeling. Specifically, we train an adapter to refine off-the-shelf features using pseudo-labels obtained via 3D-aware chaining, filtering wrong labels through relaxed cyclic consistency, and 3D spherical prototype mapping constraints. While reducing the need for dataset-specific annotations compared to prior work, we establish a new state-of-the-art on SPair-71k, achieving an absolute gain of over 4% and of over 7% compared to methods with similar supervision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
MethodsAdapter · Sparse Evolutionary Training
