Accurate and efficient zero-shot 6D pose estimation with frozen foundation models
Andrea Caraffa, Davide Boscaini, Fabio Poiesi

TL;DR
FreeZeV2 is a training-free, highly generalizable 6D pose estimation method leveraging foundation models, achieving state-of-the-art accuracy and efficiency on unseen objects without task-specific training.
Contribution
It introduces a training-free approach using foundation models, with a sparse feature extraction, feature-aware scoring, and modular design for robustness and speed.
Findings
Establishes new state-of-the-art on BOP datasets for unseen objects.
Achieves 8x speedup and 5% accuracy improvement over previous FreeZe.
Gains additional accuracy with ensemble segmentation models.
Abstract
Estimating the 6D pose of objects from RGBD data is a fundamental problem in computer vision, with applications in robotics and augmented reality. A key challenge is achieving generalization to novel objects that were not seen during training. Most existing approaches address this by scaling up training on synthetic data tailored to the task, a process that demands substantial computational resources. But is task-specific training really necessary for accurate and efficient 6D pose estimation of novel objects? To answer No!, we introduce FreeZeV2, the second generation of FreeZe: a training-free method that achieves strong generalization to unseen objects by leveraging geometric and vision foundation models pre-trained on unrelated data. FreeZeV2 improves both accuracy and efficiency over FreeZe through three key contributions: (i) a sparse feature extraction strategy that reduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · 3D Shape Modeling and Analysis
