Register assisted aggregation for Visual Place Recognition

Xuan Yu; Zhenyong Fu

arXiv:2405.11526·cs.CV·May 21, 2024

Register assisted aggregation for Visual Place Recognition

Xuan Yu, Zhenyong Fu

PDF

Open Access

TL;DR

This paper introduces a register-assisted feature aggregation method for Visual Place Recognition that improves the discrimination of place features by separating stable from unstable features, outperforming existing methods.

Contribution

A novel register-based approach to enhance feature aggregation in VPR, effectively distinguishing useful features and improving recognition accuracy.

Findings

01

Registers help separate stable from unstable features.

02

The method outperforms state-of-the-art VPR techniques.

03

Experimental results validate the effectiveness of the approach.

Abstract

Visual Place Recognition (VPR) refers to the process of using computer vision to recognize the position of the current query image. Due to the significant changes in appearance caused by season, lighting, and time spans between query images and database images for retrieval, these differences increase the difficulty of place recognition. Previous methods often discarded useless features (such as sky, road, vehicles) while uncontrolled discarding features that help improve recognition accuracy (such as buildings, trees). To preserve these useful features, we propose a new feature aggregation method to address this issue. Specifically, in order to obtain global and local features that contain discriminative place information, we added some registers on top of the original image tokens to assist in model training. After reallocating attention weights, these registers were discarded. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging