TL;DR
This paper introduces the first fully end-to-end trainable neural network for solving the challenging blind Perspective-n-Point problem, integrating differentiable geometric optimization techniques for improved accuracy and efficiency.
Contribution
It presents a novel end-to-end trainable framework that combines differentiable geometric optimization with neural networks to solve blind PnP without prior pose or correspondences.
Findings
Outperforms existing methods on synthetic data
Achieves state-of-the-art results on real datasets
Efficiently handles large search spaces without priors
Abstract
Blind Perspective-n-Point (PnP) is the problem of estimating the position and orientation of a camera relative to a scene, given 2D image points and 3D scene points, without prior knowledge of the 2D-3D correspondences. Solving for pose and correspondences simultaneously is extremely challenging since the search space is very large. Fortunately it is a coupled problem: the pose can be found easily given the correspondences and vice versa. Existing approaches assume that noisy correspondences are provided, that a good pose prior is available, or that the problem size is small. We instead propose the first fully end-to-end trainable network for solving the blind PnP problem efficiently and globally, that is, without the need for pose priors. We make use of recent results in differentiating optimization problems to incorporate geometric model fitting into an end-to-end learning framework,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
