Focal and Global Knowledge Distillation for Detectors
Zhendong Yang, Zhe Li, Xiaohu Jiang, Yuan Gong, Zehuan Yuan, Danpei, Zhao, Chun Yuan

TL;DR
This paper introduces Focal and Global Distillation (FGD), a novel knowledge distillation method for object detection that improves student detector performance by focusing on critical pixels and global relations, applicable across various detectors.
Contribution
The paper proposes FGD, a new distillation approach that separates foreground and background and incorporates global pixel relations, addressing challenges in distilling complex detection models.
Findings
Significant mAP improvements on COCO2017 across multiple detectors.
FGD outperforms baseline models by 2.9% to 3.6% mAP.
Applicable to various detector architectures with only feature map loss calculation.
Abstract
Knowledge distillation has been applied to image classification successfully. However, object detection is much more sophisticated and most knowledge distillation methods have failed on it. In this paper, we point out that in object detection, the features of the teacher and student vary greatly in different areas, especially in the foreground and background. If we distill them equally, the uneven differences between feature maps will negatively affect the distillation. Thus, we propose Focal and Global Distillation (FGD). Focal distillation separates the foreground and background, forcing the student to focus on the teacher's critical pixels and channels. Global distillation rebuilds the relation between different pixels and transfers it from teachers to students, compensating for missing global information in focal distillation. As our method only needs to calculate the loss on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
MethodsFeature Pyramid Network · Knowledge Distillation · 1x1 Convolution · Convolution · Focal Loss · RetinaNet · RepPoints
