ImPosing: Implicit Pose Encoding for Efficient Visual Localization
Arthur Moreau, Thomas Gilles, Nathan Piasco, Dzmitry Tsishkou, Bogdan, Stanciulescu, Arnaud de La Fortelle

TL;DR
ImPosing introduces a compact, learning-based visual localization method that encodes images and poses into a shared latent space, enabling real-time, accurate localization in city-scale environments without large map storage.
Contribution
The paper presents a novel implicit pose encoding approach that improves localization accuracy and efficiency, especially in large-scale environments, by using neural networks to embed images and poses into a common latent space.
Findings
Achieves real-time localization in city-scale environments.
Significantly outperforms prior methods in accuracy.
Reduces storage requirements regardless of map size.
Abstract
We propose a novel learning-based formulation for visual localization of vehicles that can operate in real-time in city-scale environments. Visual localization algorithms determine the position and orientation from which an image has been captured, using a set of geo-referenced images or a 3D scene representation. Our new localization paradigm, named Implicit Pose Encoding (ImPosing), embeds images and camera poses into a common latent representation with 2 separate neural networks, such that we can compute a similarity score for each image-pose pair. By evaluating candidates through the latent space in a hierarchical manner, the camera position and orientation are not directly regressed but incrementally refined. Very large environments force competitors to store gigabytes of map data, whereas our method is very compact independently of the reference database size. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
ImPosing: Implicit Pose Encoding for Efficient Visual Localization· youtube
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Robotic Path Planning Algorithms
