Leveraging BEV Representation for 360-degree Visual Place Recognition
Xuecheng Xu, Yanmei Jiao, Sha Lu, Xiaqing Ding, Rong Xiong, Yue Wang

TL;DR
This paper introduces a novel BEV-based approach for 360-degree visual place recognition, demonstrating improved performance through spatially aware feature extraction, deformation compensation, and rotation-invariant aggregation, validated on diverse datasets.
Contribution
It is the first to employ BEV representation in visual place recognition, integrating vision-LiDAR fusion and rotation-invariant features for enhanced accuracy.
Findings
BEV representation outperforms baseline methods in VPR tasks.
Deformable attention improves robustness to camera misalignments.
Rotation-invariant aggregation enhances recognition in varied orientations.
Abstract
This paper investigates the advantages of using Bird's Eye View (BEV) representation in 360-degree visual place recognition (VPR). We propose a novel network architecture that utilizes the BEV representation in feature extraction, feature aggregation, and vision-LiDAR fusion, which bridges visual cues and spatial awareness. Our method extracts image features using standard convolutional networks and combines the features according to pre-defined 3D grid spatial points. To alleviate the mechanical and time misalignments between cameras, we further introduce deformable attention to learn the compensation. Upon the BEV feature representation, we then employ the polar transform and the Discrete Fourier transform for aggregation, which is shown to be rotation-invariant. In addition, the image and point cloud cues can be easily stated in the same coordinates, which benefits sensor fusion for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Indoor and Outdoor Localization Technologies · Advanced Image and Video Retrieval Techniques
