ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models
Weihao Cheng, Yan-Pei Cao, Ying Shan

TL;DR
ID-Pose introduces a novel, training-free approach that leverages pre-trained diffusion models to accurately estimate camera poses from sparse views by inverting the denoising process, outperforming existing methods.
Contribution
The paper presents ID-Pose, a new method that inverts diffusion models for camera pose estimation without training, extending to multiple images and generalizing to open-world scenarios.
Findings
ID-Pose outperforms state-of-the-art methods on various datasets.
The approach effectively estimates poses from casual photos and rendered images.
It handles multiple images through triangular relations.
Abstract
Given sparse views of a 3D object, estimating their camera poses is a long-standing and intractable problem. Toward this goal, we consider harnessing the pre-trained diffusion model of novel views conditioned on viewpoints (Zero-1-to-3). We present ID-Pose which inverses the denoising diffusion process to estimate the relative pose given two input images. ID-Pose adds a noise to one image, and predicts the noise conditioned on the other image and a hypothesis of the relative pose. The prediction error is used as the minimization objective to find the optimal pose with the gradient descent method. We extend ID-Pose to handle more than two images and estimate each pose with multiple image pairs from triangular relations. ID-Pose requires no training and generalizes to open-world images. We conduct extensive experiments using casually captured photos and rendered images with random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
MethodsDiffusion
