SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models

Bingxi Liu; Pengju Zhang; Li He; Hao Chen; Shiyi Guo; Yihong Wu; Jinqiang Cui; Hong Zhang

arXiv:2506.13073·cs.CV·June 17, 2025

SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models

Bingxi Liu, Pengju Zhang, Li He, Hao Chen, Shiyi Guo, Yihong Wu, Jinqiang Cui, Hong Zhang

PDF

Open Access

TL;DR

SuperPlace revives classical feature aggregation techniques for visual place recognition, integrating foundation model concepts, and introduces novel training and aggregation strategies that outperform recent methods on benchmark datasets.

Contribution

The paper introduces a unified training framework, a novel G$^2$M aggregation method, and a fine-tuning strategy for NetVLAD, advancing classical methods in the era of foundation models.

Findings

01

G$^2$M achieves high performance with fewer feature dimensions.

02

NVL-FT$^2$ ranks first on the MSLS leaderboard.

03

SuperPlace outperforms recent state-of-the-art methods.

Abstract

Recent visual place recognition (VPR) approaches have leveraged foundation models (FM) and introduced novel aggregation techniques. However, these methods have failed to fully exploit key concepts of FM, such as the effective utilization of extensive training sets, and they have overlooked the potential of classical aggregation methods, such as GeM and NetVLAD. Building on these insights, we revive classical feature aggregation methods and develop more fundamental VPR models, collectively termed SuperPlace. First, we introduce a supervised label alignment method that enables training across various VPR datasets within a unified framework. Second, we propose G $^{2}$ M, a compact feature aggregation method utilizing two GeMs, where one GeM learns the principal components of feature maps along the channel dimension and calibrates the output of the other. Third, we propose the secondary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Image Retrieval and Classification Techniques