TL;DR
This paper introduces a novel method for establishing semantic correspondences between 2D image segments and 3D shape parts by distilling deep features and identifying Best Segmentation Buddies, demonstrating robustness across diverse pairs.
Contribution
The work presents a new approach to cross-modality segmentation correspondence by linking pixels to shape vertices through feature similarity and a novel buddy identification process.
Findings
Accurately establishes semantic correspondences across diverse image-shape pairs.
Effectively segments 3D shapes directly using distilled 2D features.
Demonstrates robustness and generality in challenging real-world scenarios.
Abstract
Finding correspondences is a fundamental and extensively researched problem in computer vision and graphics. In this work, we examine the underexplored task of estimating segmentation-to-segmentation correspondence between images in the wild and untextured 3D shapes. This task is highly challenging due to substantial differences in appearance, geometry, and viewpoint. Our approach bridges the cross-modality gap by linking pixels in the image segment to vertices in the corresponding semantic part of the 3D shape. To achieve this, we first distill deep visual features from a 2D vision model onto the 3D shape surface, allowing for the computation of feature similarity between image pixels and shape vertices. Then, we identify Best Segmentation Buddies, vertices whose most similar image pixel lies within the image segmentation region, enabling the reliable discovery of vertices in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
