Robust Image Retrieval-based Visual Localization using Kapture
Martin Humenberger, Yohann Cabon, Nicolas Guerin, Julien, Morat, Vincent Leroy, J\'er\^ome Revaud, Philippe Rerole, No\'e, Pion, Cesar de Souza, Gabriela Csurka

TL;DR
This paper introduces kapture, a flexible data format and toolbox for visual localization, enabling evaluation across multiple datasets and demonstrating high performance with various configurations.
Contribution
We present kapture, a unified data format and toolbox that simplifies dataset integration and evaluation for visual localization and structure-from-motion tasks.
Findings
Our pipeline achieves top rankings on eight public datasets.
Kapture enables versatile use of features, 3D data, and sensor inputs.
Open-source release facilitates future research and benchmarking.
Abstract
Visual localization tackles the challenge of estimating the camera pose from images by using correspondence analysis between query images and a map. This task is computation and data intensive which poses challenges on thorough evaluation of methods on various datasets. However, in order to further advance in the field, we claim that robust visual localization algorithms should be evaluated on multiple datasets covering a broad domain variety. To facilitate this, we introduce kapture, a new, flexible, unified data format and toolbox for visual localization and structure-from-motion (SFM). It enables easy usage of different datasets as well as efficient and reusable data processing. To demonstrate this, we present a versatile pipeline for visual localization that facilitates the use of different local and global features, 3D data (e.g. depth maps), non-vision sensor data (e.g. IMU, GPS,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
