ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation
Hao Tang, Weiyao Wang, Pierre Gleize, and Matt Feiszli

TL;DR
ADen introduces an adaptive density representation framework that unifies regression and probabilistic methods for sparse-view camera pose estimation, improving accuracy and efficiency in challenging scenarios.
Contribution
The paper proposes ADen, a novel framework combining generator-discriminator architecture to model multi-modal pose distributions efficiently.
Findings
Achieves higher accuracy than previous methods.
Reduces runtime while maintaining precision.
Effectively handles symmetry and multi-modal ambiguities.
Abstract
Recovering camera poses from a set of images is a foundational task in 3D computer vision, which powers key applications such as 3D scene/object reconstructions. Classic methods often depend on feature correspondence, such as keypoints, which require the input images to have large overlap and small viewpoint changes. Such requirements present considerable challenges in scenarios with sparse views. Recent data-driven approaches aim to directly output camera poses, either through regressing the 6DoF camera poses or formulating rotation as a probability distribution. However, each approach has its limitations. On one hand, directly regressing the camera poses can be ill-posed, since it assumes a single mode, which is not true under symmetry and leads to sub-optimal solutions. On the other hand, probabilistic approaches are capable of modeling the symmetry ambiguity, yet they sample the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Optical measurement and interference techniques
MethodsSparse Evolutionary Training
