Towards Efficient 3D Object Detection with Knowledge Distillation

Jihan Yang; Shaoshuai Shi; Runyu Ding; Zhe Wang; Xiaojuan Qi

arXiv:2205.15156·cs.CV·October 17, 2022·28 cites

Towards Efficient 3D Object Detection with Knowledge Distillation

Jihan Yang, Shaoshuai Shi, Runyu Ding, Zhe Wang, Xiaojuan Qi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores knowledge distillation techniques to develop efficient 3D object detectors that balance accuracy and computational cost, achieving high performance with reduced model complexity.

Contribution

It introduces a novel KD pipeline with an enhanced logit method and teacher-guided initialization, along with a benchmark for 3D detection KD methods.

Findings

01

Achieved 65.75% LEVEL 2 mAPH on Waymo dataset, surpassing teacher models.

02

Developed models running at 51 FPS on NVIDIA A100, 2.2x faster than PointPillar.

03

Reduced FLOPs to 44% of teacher models while maintaining high accuracy.

Abstract

Despite substantial progress in 3D object detection, advanced 3D detectors often suffer from heavy computation overheads. To this end, we explore the potential of knowledge distillation (KD) for developing efficient 3D object detectors, focusing on popular pillar- and voxel-based detectors.In the absence of well-developed teacher-student pairs, we first study how to obtain student models with good trade offs between accuracy and efficiency from the perspectives of model compression and input resolution reduction. Then, we build a benchmark to assess existing KD methods developed in the 2D domain for 3D object detection upon six well-constructed teacher-student pairs. Further, we propose an improved KD pipeline incorporating an enhanced logit KD method that performs KD on only a few pivotal positions determined by teacher classification response, and a teacher-guided student model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvmi-lab/sparsekd
pytorchOfficial

Videos

Towards Efficient 3D Object Detection with Knowledge Distillation· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · Visual Attention and Saliency Detection

MethodsKnowledge Distillation