LiDAR Registration with Visual Foundation Models
Niclas V\"odisch, Giovanni Cioffi, Marco Cannici, Wolfram Burgard,, Davide Scaramuzza

TL;DR
This paper introduces a novel LiDAR registration method using visual foundation model features from surround-view images, achieving robust 6DoF alignment even with significant domain shifts and without retraining.
Contribution
The authors propose using DINOv2 features from images as point descriptors for LiDAR registration, outperforming complex baselines and not requiring domain-specific retraining.
Findings
Outperforms baseline methods by +24.8 and +17.3 registration recall on NCLT and Oxford datasets.
Handles both sparse LiDAR scans and dense 3D maps effectively.
Does not require retraining or domain-specific adjustments.
Abstract
LiDAR registration is a fundamental task in robotic mapping and localization. A critical component of aligning two point clouds is identifying robust point correspondences using point descriptors. This step becomes particularly challenging in scenarios involving domain shifts, seasonal changes, and variations in point cloud structures. These factors substantially impact both handcrafted and learning-based approaches. In this paper, we address these problems by proposing to use DINOv2 features, obtained from surround-view images, as point descriptors. We demonstrate that coupling these descriptors with traditional registration algorithms, such as RANSAC or ICP, facilitates robust 6DoF alignment of LiDAR scans with 3D maps, even when the map was recorded more than a year before. Although conceptually straightforward, our method substantially outperforms more complex baseline techniques.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage
