6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model
Matteo Bortolon, Theodore Tsesmelis, Stuart James, Fabio Poiesi,, Alessio Del Bue

TL;DR
6DGS introduces a novel, non-iterative method for 6DoF camera pose estimation from a single image using 3D Gaussian Splatting, achieving high accuracy and real-time performance without pose initialization.
Contribution
The paper presents a closed-form, initialization-free approach for 6D pose estimation leveraging 3D Gaussian Splatting, improving accuracy and efficiency over existing methods.
Findings
12% improvement in rotational accuracy over baselines
22% improvement in translation accuracy
Operates at 15fps on consumer hardware
Abstract
We propose 6DGS to estimate the camera pose of a target RGB image given a 3D Gaussian Splatting (3DGS) model representing the scene. 6DGS avoids the iterative process typical of analysis-by-synthesis methods (e.g. iNeRF) that also require an initialization of the camera pose in order to converge. Instead, our method estimates a 6DoF pose by inverting the 3DGS rendering process. Starting from the object surface, we define a radiant Ellicell that uniformly generates rays departing from each ellipsoid that parameterize the 3DGS model. Each Ellicell ray is associated with the rendering parameters of each ellipsoid, which in turn is used to obtain the best bindings between the target image pixels and the cast rays. These pixel-ray bindings are then ranked to select the best scoring bundle of rays, which their intersection provides the camera center and, in turn, the camera rotation. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Object Detection Techniques · Optical measurement and interference techniques · Advanced Vision and Imaging
