TL;DR
BEVCALIB introduces a novel LiDAR-camera calibration method using bird's-eye view features, achieving state-of-the-art accuracy and efficiency in autonomous driving datasets.
Contribution
It is the first model to utilize BEV features for LiDAR-camera calibration directly from raw data, with a geometric feature selector for improved performance.
Findings
Outperforms existing methods by over 47% in translation accuracy on KITTI.
Achieves 82% rotation accuracy improvement over baselines.
Demonstrates robustness under various noise conditions.
Abstract
Accurate LiDAR-camera calibration is fundamental to fusing multi-modal perception in autonomous driving and robotic systems. Traditional calibration methods require extensive data collection in controlled environments and cannot compensate for the transformation changes during the vehicle/robot movement. In this paper, we propose the first model that uses bird's-eye view (BEV) features to perform LiDAR camera calibration from raw data, termed BEVCALIB. To achieve this, we extract camera BEV features and LiDAR BEV features separately and fuse them into a shared BEV feature space. To fully utilize the geometric information from the BEV feature, we introduce a novel feature selector to filter the most important features in the transformation decoder, which reduces memory consumption and enables efficient training. Extensive evaluations on KITTI, NuScenes, and our own dataset demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
