SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection
Haimei Zhao, Qiming Zhang, Shanshan Zhao, Zhe Chen, Jing Zhang,, Dacheng Tao

TL;DR
SimDistill introduces a novel multi-modal distillation approach that enables camera-only 3D object detection models to learn from LiDAR data, significantly improving accuracy while maintaining low-cost deployment.
Contribution
The paper proposes a new SimDistill method with architecture and distillation strategies that bridge the modality gap, enabling effective multi-modal knowledge transfer from LiDAR to camera-based models.
Findings
Achieves 4.8% mAP improvement over baseline
Achieves 4.1% NDS improvement over baseline
Outperforms state-of-the-art methods in 3D detection
Abstract
Multi-view camera-based 3D object detection has become popular due to its low cost, but accurately inferring 3D geometry solely from camera data remains challenging and may lead to inferior performance. Although distilling precise 3D geometry knowledge from LiDAR data could help tackle this challenge, the benefits of LiDAR information could be greatly hindered by the significant modality gap between different sensory modalities. To address this issue, we propose a Simulated multi-modal Distillation (SimDistill) method by carefully crafting the model architecture and distillation strategy. Specifically, we devise multi-modal architectures for both teacher and student models, including a LiDAR-camera fusion-based teacher and a simulated fusion-based student. Owing to the ``identical'' architecture design, the student can mimic the teacher to generate multi-modal features with merely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Optical Sensing Technologies
MethodsKnowledge Distillation
