Towards Efficient 3D Object Detection with Knowledge Distillation
Jihan Yang, Shaoshuai Shi, Runyu Ding, Zhe Wang, Xiaojuan Qi

TL;DR
This paper explores knowledge distillation techniques to develop efficient 3D object detectors that balance accuracy and computational cost, achieving high performance with reduced model complexity.
Contribution
It introduces a novel KD pipeline with an enhanced logit method and teacher-guided initialization, along with a benchmark for 3D detection KD methods.
Findings
Achieved 65.75% LEVEL 2 mAPH on Waymo dataset, surpassing teacher models.
Developed models running at 51 FPS on NVIDIA A100, 2.2x faster than PointPillar.
Reduced FLOPs to 44% of teacher models while maintaining high accuracy.
Abstract
Despite substantial progress in 3D object detection, advanced 3D detectors often suffer from heavy computation overheads. To this end, we explore the potential of knowledge distillation (KD) for developing efficient 3D object detectors, focusing on popular pillar- and voxel-based detectors.In the absence of well-developed teacher-student pairs, we first study how to obtain student models with good trade offs between accuracy and efficiency from the perspectives of model compression and input resolution reduction. Then, we build a benchmark to assess existing KD methods developed in the 2D domain for 3D object detection upon six well-constructed teacher-student pairs. Further, we propose an improved KD pipeline incorporating an enhanced logit KD method that performs KD on only a few pivotal positions determined by teacher classification response, and a teacher-guided student model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · Visual Attention and Saliency Detection
MethodsKnowledge Distillation
