Distilling Object Detectors with Feature Richness
Zhixing Du, Rui Zhang, Ming Chang, Xishan Zhang, Shaoli Liu, Tianshi, Chen, Yunji Chen

TL;DR
This paper introduces a novel feature-richness score method for distilling object detectors, effectively selecting important features outside bounding boxes and removing detrimental ones, leading to improved small model performance.
Contribution
The paper proposes the Feature-Richness Score (FRS) method for better feature selection during distillation, enhancing small detector accuracy beyond existing methods.
Findings
Achieves 39.7% mAP on COCO2017 with RetinaNet and ResNet-50.
Surpasses the teacher detector performance with ResNet-101.
Effective for both anchor-based and anchor-free detectors.
Abstract
In recent years, large-scale deep models have achieved great success, but the huge computational complexity and massive storage requirements make it a great challenge to deploy them in resource-limited devices. As a model compression and acceleration method, knowledge distillation effectively improves the performance of small models by transferring the dark knowledge from the teacher detector. However, most of the existing distillation-based detection methods mainly imitating features near bounding boxes, which suffer from two limitations. First, they ignore the beneficial features outside the bounding boxes. Second, these methods imitate some features which are mistakenly regarded as the background by the teacher detector. To address the above issues, we propose a novel Feature-Richness Score (FRS) method to choose important features that improve generalized detectability during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsFeature Pyramid Network · Knowledge Distillation · 1x1 Convolution · Convolution · Focal Loss · RetinaNet
