Distilling Object Detectors with Task Adaptive Regularization
Ruoyu Sun, Fuhui Tang, Xiaopeng Zhang, Hongkai Xiong, Qi Tian

TL;DR
This paper introduces a task adaptive knowledge distillation framework for object detectors, selectively transferring knowledge to improve efficiency and accuracy, especially on low-end devices.
Contribution
It proposes a novel adaptive distillation method that transfers knowledge based on task-specific priors and introduces a distillation decay strategy to enhance model generalization.
Findings
Achieves 39.0% mAP on COCO with ResNet-50, surpassing the baseline and even the teacher model.
Effectively transfers knowledge at multiple levels of the detector architecture.
Demonstrates improved detection performance with reduced computational costs.
Abstract
Current state-of-the-art object detectors are at the expense of high computational costs and are hard to deploy to low-end devices. Knowledge distillation, which aims at training a smaller student network by transferring knowledge from a larger teacher model, is one of the promising solutions for model miniaturization. In this paper, we investigate each module of a typical detector in depth, and propose a general distillation framework that adaptively transfers knowledge from teacher to student according to the task specific priors. The intuition is that simply distilling all information from teacher to student is not advisable, instead we should only borrow priors from the teacher model where the student cannot perform well. Towards this goal, we propose a region proposal sharing mechanism to interflow region responses between the teacher and student models. Based on this, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification
MethodsSoftmax · Region Proposal Network · Convolution · 1x1 Convolution · RoIPool · Feature Pyramid Network · Faster R-CNN
