Shape-of-You: Fused Gromov-Wasserstein Optimal Transport for Semantic Correspondence in-the-Wild
Jiin Im, Sisung Liu, Je Hyeong Hong

TL;DR
Shape-of-You introduces a novel approach using Fused Gromov-Wasserstein optimal transport to improve semantic correspondence in challenging in-the-wild images by jointly considering feature similarity and structural consistency.
Contribution
It reformulates pseudo-label generation as a Fused Gromov-Wasserstein problem and proposes an efficient approximation method, enabling robust unsupervised learning of semantic correspondence.
Findings
Achieves state-of-the-art results on SPair-71k and AP-10k datasets.
Establishes a new benchmark in semantic correspondence without explicit geometric annotations.
Demonstrates robustness to structural ambiguities in in-the-wild images.
Abstract
Semantic correspondence is essential for handling diverse in-the-wild images lacking explicit correspondence annotations. While recent 2D foundation models offer powerful features, adapting them for unsupervised learning via nearest-neighbor pseudo-labels has key limitations: it operates locally, ignoring structural relationships, and consequently its reliance on 2D appearance fails to resolve geometric ambiguities arising from symmetries or repetitive features. In this work, we address this by reformulating pseudo-label generation as a Fused Gromov-Wasserstein (FGW) problem, which jointly optimizes inter-feature similarity and intra-structural consistency. Our framework, Shape-of-You (SoY), leverages a 3D foundation model to define this intra-structure in the geometric space, resolving abovementioned ambiguity. However, since FGW is a computationally prohibitive quadratic problem, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
