Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition
Sergio Izquierdo, Javier Civera

TL;DR
This paper introduces CliqueMining, a novel sampling strategy that enhances the geographic distance sensitivity of visual place recognition embeddings, leading to significant improvements in recall rates on key benchmarks.
Contribution
We propose CliqueMining, a graph-based sampling method that improves the geographic distance sensitivity of VPR embeddings in a single-stage framework.
Findings
Recall@1 increased from 75% to 82% on MSLS Challenge.
Recall@1 increased from 76% to 90% on Nordland.
Significant state-of-the-art improvements achieved.
Abstract
Visual Place Recognition (VPR) plays a critical role in many localization and mapping pipelines. It consists of retrieving the closest sample to a query image, in a certain embedding space, from a database of geotagged references. The image embedding is learned to effectively describe a place despite variations in visual appearance, viewpoint, and geometric changes. In this work, we formulate how limitations in the Geographic Distance Sensitivity of current VPR embeddings result in a high probability of incorrectly sorting the top-k retrievals, negatively impacting the recall. In order to address this issue in single-stage VPR, we propose a novel mining strategy, CliqueMining, that selects positive and negative examples by sampling cliques from a graph of visually similar images. Our approach boosts the sensitivity of VPR embeddings at small distance ranges, significantly improving the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Video Surveillance and Tracking Methods
