LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Sanmin Kim, Youngseok Kim, Sihwan Hwang, Hyeonjun Jeong, Dongsuk, Kum

TL;DR
LabelDistill introduces a novel label-guided cross-modal knowledge distillation method that enhances camera-based 3D object detection by effectively leveraging LiDAR data while mitigating its imperfections, leading to significant performance improvements.
Contribution
The paper proposes a new approach that uses label guidance and feature partitioning to improve cross-modal knowledge transfer in 3D detection, addressing LiDAR measurement ambiguities.
Findings
Improves mAP by 5.1 points over baseline
Enhances NDS by 4.9 points
Demonstrates effective knowledge transfer from LiDAR to camera-based detectors
Abstract
Recent advancements in camera-based 3D object detection have introduced cross-modal knowledge distillation to bridge the performance gap with LiDAR 3D detectors, leveraging the precise geometric information in LiDAR point clouds. However, existing cross-modal knowledge distillation methods tend to overlook the inherent imperfections of LiDAR, such as the ambiguity of measurements on distant or occluded objects, which should not be transferred to the image detector. To mitigate these imperfections in LiDAR teacher, we propose a novel method that leverages aleatoric uncertainty-free features from ground truth labels. In contrast to conventional label guidance approaches, we approximate the inverse function of the teacher's head to effectively embed label inputs into feature space. This approach provides additional accurate guidance alongside LiDAR teacher, thereby boosting the performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
MethodsKnowledge Distillation
