AMD: Adaptive Masked Distillation for Object Detection
Guang Yang, Yin Tang, Jun Li, Jianhua Xu, Xili Wan

TL;DR
This paper introduces AMD, a novel feature distillation framework for object detection that uses spatial-channel adaptive masking to improve student model performance, outperforming previous methods.
Contribution
The paper proposes a spatial-channel adaptive masked distillation network that enhances feature reconstruction for object detection, surpassing prior distillation techniques.
Findings
Student models achieve higher mAP scores with AMD.
AMD outperforms previous distillation methods like FGD and MGD.
The method improves object perception and detection capabilities.
Abstract
As a general model compression paradigm, feature-based knowledge distillation allows the student model to learn expressive features from the teacher counterpart. In this paper, we mainly focus on designing an effective feature-distillation framework and propose a spatial-channel adaptive masked distillation (AMD) network for object detection. More specifically, in order to accurately reconstruct important feature regions, we first perform attention-guided feature masking on the feature map of the student network, such that we can identify the important features via spatially adaptive feature masking instead of random masking in the previous methods. In addition, we employ a simple and efficient module to allow the student network channel to be adaptive, improving its model capability in object perception and detection. In contrast to the previous methods, more crucial object-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
MethodsConvolution · 1x1 Convolution · Focal Loss · Feature Pyramid Network · RepPoints · RetinaNet · Knowledge Distillation
