Towards Test-time Efficient Visual Place Recognition via Asymmetric Query Processing
Jaeyoon Kim, Yoonki Cho, Sung-Eui Yoon

TL;DR
This paper presents an efficient asymmetric visual place recognition framework that reduces computational costs by using a high-capacity gallery model and a lightweight query network, leveraging a geographical memory bank and implicit embedding augmentation.
Contribution
It introduces a novel asymmetric VPR approach with a geographical memory bank and implicit augmentation, enabling resource-efficient deployment without expensive compatibility training.
Findings
Significantly reduces computational costs compared to existing methods.
Outperforms existing asymmetric retrieval techniques in accuracy.
Demonstrates effectiveness in resource-constrained environments.
Abstract
Visual Place Recognition (VPR) has advanced significantly with high-capacity foundation models like DINOv2, achieving remarkable performance. Nonetheless, their substantial computational cost makes deployment on resource-constrained devices impractical. In this paper, we introduce an efficient asymmetric VPR framework that incorporates a high-capacity gallery model for offline feature extraction with a lightweight query network for online processing. A key challenge in this setting is ensuring compatibility between these heterogeneous networks, which conventional approaches address through computationally expensive k-NN-based compatible training. To overcome this, we propose a geographical memory bank that structures gallery features using geolocation metadata inherent in VPR databases, eliminating the need for exhaustive k-NN computations. Additionally, we introduce an implicit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization
