DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection
Zhourui Zhang, Jun Li, Zhijian Wu, Jifeng Shen, Jianhua Xu

TL;DR
This paper introduces DFMSD, a novel dual feature masking stage-wise knowledge distillation framework that effectively bridges the gap between heterogeneous teacher and student networks for improved object detection performance.
Contribution
The paper proposes a stage-wise adaptation module and masking enhancement strategy to improve heterogeneous knowledge distillation in object detection.
Findings
DFMSD outperforms state-of-the-art distillation methods.
Stage-wise adaptation effectively bridges network discrepancies.
Object-aware masking enhances feature reconstruction quality.
Abstract
In recent years, current mainstream feature masking distillation methods mainly function by reconstructing selectively masked regions of a student network from the feature maps of a teacher network. In these methods, attention mechanisms can help to identify spatially important regions and crucial object-aware channel clues, such that the reconstructed features are encoded with sufficient discriminative and representational power similar to teacher features. However, previous feature-masking distillation methods mainly address homogeneous knowledge distillation without fully taking into account the heterogeneous knowledge distillation scenario. In particular, the huge discrepancy between the teacher and the student frameworks within the heterogeneous distillation paradigm is detrimental to feature masking, leading to deteriorating reconstructed student features. In this study, a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Industrial Vision Systems and Defect Detection
MethodsSoftmax · Attention Is All You Need · Knowledge Distillation
