HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation
Yibo Liu, Zhaodong Jiang, Binbin Xu, Guile Wu, Yuan Ren, Tongtong Cao,, Bingbing Liu, Rui Heng Yang, Amir Rasouli, Jinjun Shan

TL;DR
HIPPo introduces a model-free, zero-shot 6D pose estimation framework that leverages image-to-3D priors from diffusion models, enabling rapid, accurate pose estimation without curated models or reference images.
Contribution
The paper presents HIPPo, a novel framework that uses diffusion-based image-to-3D priors for instant, model-free 6D pose estimation of unseen objects in real-time scenarios.
Findings
HIPPo outperforms state-of-the-art methods with limited prior references.
HIPPo can generate 3D meshes of unseen objects in seconds.
HIPPo effectively refines object meshes through online observations.
Abstract
This work focuses on model-free zero-shot 6D object pose estimation for robotics applications. While existing methods can estimate the precise 6D pose of objects, they heavily rely on curated CAD models or reference images, the preparation of which is a time-consuming and labor-intensive process. Moreover, in real-world scenarios, 3D models or reference images may not be available in advance and instant robot reaction is desired. In this work, we propose a novel framework named HIPPo, which eliminates the need for curated CAD models and reference images by harnessing image-to-3D priors from Diffusion Models, enabling model-free zero-shot 6D pose estimation. Specifically, we construct HIPPo Dreamer, a rapid image-to-mesh model built on a multiview Diffusion Model and a 3D reconstruction foundation model. Our HIPPo Dreamer can generate a 3D mesh of any unseen objects from a single glance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Optical Sensing Technologies · Advanced Vision and Imaging
MethodsDiffusion
