Correct-by-Construction Vision-based Pose Estimation using Geometric Generative Models
Ulices Santa Cruz, Mahmoud Elfar, Yasser Shoukry

TL;DR
This paper introduces a framework for certifiable vision-based pose estimation that combines geometric generative models with neural networks, providing provable guarantees crucial for safety-critical autonomous systems.
Contribution
It presents a novel approach integrating physics-driven geometric models with learning-based neural networks to achieve certified pose estimation in cluttered environments.
Findings
Effective pose estimation with certified error bounds
Successful application to synthetic and real images
Robust detection and estimation in cluttered scenes
Abstract
We consider the problem of vision-based pose estimation for autonomous systems. While deep neural networks have been successfully used for vision-based tasks, they inherently lack provable guarantees on the correctness of their output, which is crucial for safety-critical applications. We present a framework for designing certifiable neural networks (NNs) for perception-based pose estimation that integrates physics-driven modeling with learning-based estimation. The proposed framework begins by leveraging the known geometry of planar objects commonly found in the environment, such as traffic signs and runway markings, referred to as target objects. At its core, it introduces a geometric generative model (GGM), a neural-network-like model whose parameters are derived from the image formation process of a target object observed by a camera. Once designed, the GGM can be used to train…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Robot Manipulation and Learning
