General Instance Distillation for Object Detection

Xing Dai; Zeren Jiang; Zhao Wu; Yiping Bao; Zhicheng Wang; Si Liu,; Erjin Zhou

arXiv:2103.02340·cs.CV·May 3, 2021·6 cites

General Instance Distillation for Object Detection

Xing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu,, Erjin Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel general instance distillation method for object detection that leverages relation and feature knowledge, significantly improving lightweight model performance across various frameworks.

Contribution

It proposes a general instance distillation approach that does not rely on ground truth labels and effectively utilizes relation-based knowledge for better detection accuracy.

Findings

01

Student models outperform teachers in various detection frameworks.

02

GID improves mAP of RetinaNet with ResNet-50 from 36.2% to 39.1%.

03

Student models can surpass teacher models in detection performance.

Abstract

In recent years, knowledge distillation has been proved to be an effective solution for model compression. This approach can make lightweight student models acquire the knowledge extracted from cumbersome teacher models. However, previous distillation methods of detection have weak generalization for different detection frameworks and rely heavily on ground truth (GT), ignoring the valuable relation information between instances. Thus, we propose a novel distillation method for detection tasks based on discriminative instances without considering the positive or negative distinguished by GT, which is called general instance distillation (GID). Our approach contains a general instance selection module (GISM) to make full use of feature-based, relation-based and response-based knowledge for distillation. Extensive results demonstrate that the student model achieves significant AP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

daixinghome/Distill_GID_detectron2
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsKnowledge Distillation · Convolution · 1x1 Convolution · Feature Pyramid Network · Focal Loss · RetinaNet