FE-Fusion-VPR: Attention-based Multi-Scale Network Architecture for Visual Place Recognition by Fusing Frames and Events
Kuanxu Hou, Delei Kong, Junjie Jiang, Hao Zhuang, Xinjie Huang and, Zheng Fang

TL;DR
This paper introduces FE-Fusion-VPR, an attention-based multi-scale network that fuses frames and events from cameras to improve visual place recognition, outperforming existing methods on multiple datasets.
Contribution
It presents the first end-to-end network that directly fuses frame and event data for VPR, leveraging multi-scale features and attention mechanisms.
Findings
Achieves 29.26% and 33.59% higher Recall@1 on two datasets compared to state-of-the-art methods.
Outperforms existing event-based and frame-based VPR methods significantly.
Demonstrates robustness in challenging conditions like glare and high-speed motion.
Abstract
Traditional visual place recognition (VPR), usually using standard cameras, is easy to fail due to glare or high-speed motion. By contrast, event cameras have the advantages of low latency, high temporal resolution, and high dynamic range, which can deal with the above issues. Nevertheless, event cameras are prone to failure in weakly textured or motionless scenes, while standard cameras can still provide appearance information in this case. Thus, exploiting the complementarity of standard cameras and event cameras can effectively improve the performance of VPR algorithms. In the paper, we propose FE-Fusion-VPR, an attention-based multi-scale network architecture for VPR by fusing frames and events. First, the intensity frame and event volume are fed into the two-stream feature extraction network for shallow feature fusion. Next, the three-scale features are obtained through the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Advanced Image and Video Retrieval Techniques
Methodsfail
