DMKD: Improving Feature-based Knowledge Distillation for Object Detection Via Dual Masking Augmentation
Guang Yang, Yin Tang, Zhijian Wu, Jun Li, Jianhua Xu, Xili Wan

TL;DR
This paper introduces DMKD, a novel feature-based knowledge distillation method for object detection that uses dual masking to capture both spatial and channel-wise informative clues, leading to improved student network performance.
Contribution
The study proposes a dual attention guided masking framework that captures comprehensive feature information for better knowledge distillation in object detection.
Findings
Achieved 4.1% and 4.3% performance improvements on RetinaNet and Cascade Mask R-CNN.
Outperformed existing state-of-the-art distillation methods.
Effective fusion of spatial and channel-wise features enhances detection accuracy.
Abstract
Recent mainstream masked distillation methods function by reconstructing selectively masked areas of a student network from the feature map of its teacher counterpart. In these methods, the masked regions need to be properly selected, such that reconstructed features encode sufficient discrimination and representation capability like the teacher feature. However, previous masked distillation methods only focus on spatial masking, making the resulting masked areas biased towards spatial importance without encoding informative channel clues. In this study, we devise a Dual Masked Knowledge Distillation (DMKD) framework which can capture both spatially important and channel-wise informative clues for comprehensive masked feature reconstruction. More specifically, we employ dual attention mechanism for guiding the respective masking branches, leading to reconstructed feature encoding dual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Image Enhancement Techniques
Methods1x1 Convolution · Feature Pyramid Network · Region Proposal Network · Softmax · Focal Loss · Cascade Mask R-CNN · RetinaNet · Focus · RoIAlign · Convolution
