Level Set-Based Camera Pose Estimation From Multiple 2D/3D Ellipse-Ellipsoid Correspondences
Matthieu Zins, Gilles Simon, Marie-Odile Berger

TL;DR
This paper introduces a novel level set-based method for camera pose estimation using 2D-3D ellipse correspondences, improving robustness to partial visibility and incorporating uncertainty for better accuracy.
Contribution
It develops a new ellipse-ellipse cost function based on level set sampling and integrates predictive uncertainty to enhance pose estimation accuracy.
Findings
The proposed method handles partial object visibility effectively.
It outperforms traditional metrics in ellipse-ellipse correspondence accuracy.
Incorporating uncertainty improves pose estimation robustness.
Abstract
In this paper, we propose an object-based camera pose estimation from a single RGB image and a pre-built map of objects, represented with ellipsoidal models. We show that contrary to point correspondences, the definition of a cost function characterizing the projection of a 3D object onto a 2D object detection is not straightforward. We develop an ellipse-ellipse cost based on level sets sampling, demonstrate its nice properties for handling partially visible objects and compare its performance with other common metrics. Finally, we show that the use of a predictive uncertainty on the detected ellipses allows a fair weighting of the contribution of the correspondences which improves the computed pose. The code is released at https://gitlab.inria.fr/tangram/level-set-based-camera-pose-estimation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Object Detection Techniques · Robotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage
