Is Geometry Enough for Matching in Visual Localization?
Qunjie Zhou, S\'ergio Agostinho, Aljosa Osep, Laura Leal-Taix\'e

TL;DR
This paper introduces GoMatch, a geometry-only approach for visual localization that reduces storage needs and privacy concerns while maintaining high accuracy, by using a novel bearing vectors representation of 3D points.
Contribution
GoMatch presents a new geometry-based matching method that eliminates the need for visual descriptors, addressing storage and privacy issues in large-scale localization.
Findings
Achieves comparable accuracy to visual descriptor methods with significantly less storage.
Reduces median pose errors by over 10 meters and 95 degrees on benchmark datasets.
Demonstrates feasibility for city-scale localization without visual descriptor storage.
Abstract
In this paper, we propose to go beyond the well-established approach to vision-based localization that relies on visual descriptor matching between a query image and a 3D point cloud. While matching keypoints via visual descriptors makes localization highly accurate, it has significant storage demands, raises privacy concerns and requires update to the descriptors in the long-term. To elegantly address those practical challenges for large-scale localization, we present GoMatch, an alternative to visual-based matching that solely relies on geometric information for matching image keypoints to maps, represented as sets of bearing vectors. Our novel bearing vectors representation of 3D points, significantly relieves the cross-modal challenge in geometric-based matching that prevented prior work to tackle localization in a realistic environment. With additional careful architecture design,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Advanced Image and Video Retrieval Techniques
