General Instance Distillation for Object Detection
Xing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu,, Erjin Zhou

TL;DR
This paper introduces a novel general instance distillation method for object detection that leverages relation and feature knowledge, significantly improving lightweight model performance across various frameworks.
Contribution
It proposes a general instance distillation approach that does not rely on ground truth labels and effectively utilizes relation-based knowledge for better detection accuracy.
Findings
Student models outperform teachers in various detection frameworks.
GID improves mAP of RetinaNet with ResNet-50 from 36.2% to 39.1%.
Student models can surpass teacher models in detection performance.
Abstract
In recent years, knowledge distillation has been proved to be an effective solution for model compression. This approach can make lightweight student models acquire the knowledge extracted from cumbersome teacher models. However, previous distillation methods of detection have weak generalization for different detection frameworks and rely heavily on ground truth (GT), ignoring the valuable relation information between instances. Thus, we propose a novel distillation method for detection tasks based on discriminative instances without considering the positive or negative distinguished by GT, which is called general instance distillation (GID). Our approach contains a general instance selection module (GISM) to make full use of feature-based, relation-based and response-based knowledge for distillation. Extensive results demonstrate that the student model achieves significant AP…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsKnowledge Distillation · Convolution · 1x1 Convolution · Feature Pyramid Network · Focal Loss · RetinaNet
