PASR: Pose-Aware 3D Shape Retrieval from Occluded Single Views
Jiaxin Shi, Guofeng Zhang, Wufei Ma, Naifu Liang, Adam Kortylewski, Alan Yuille

TL;DR
PASR is a novel pose-aware framework for 3D shape retrieval from occluded single views, leveraging analysis-by-synthesis and knowledge distillation from a 2D foundation model to improve robustness and accuracy.
Contribution
The paper introduces PASR, a new method that formulates 3D shape retrieval as a feature analysis-by-synthesis problem, enhancing robustness to occlusion and fine details.
Findings
PASR significantly outperforms existing methods on multiple datasets.
It demonstrates robustness to partial occlusion in shape retrieval.
Achieves strong multi-task performance including shape retrieval, pose estimation, and classification.
Abstract
Single-view 3D shape retrieval is a fundamental yet challenging task that is increasingly important with the growth of available 3D data. Existing approaches largely fall into two categories: those using contrastive learning to map point cloud features into existing vision-language spaces and those that learn a common embedding space for 2D images and 3D shapes. However, these feed-forward, holistic alignments are often difficult to interpret, which in turn limits their robustness and generalization to real-world applications. To address this problem, we propose Pose-Aware 3D Shape Retrieval (PASR), a framework that formulates retrieval as a feature-level analysis-by-synthesis problem by distilling knowledge from a 2D foundation model (DINOv3) into a 3D encoder. By aligning pose-conditioned 3D projections with 2D feature maps, our method bridges the gap between real-world images and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
