DMKD: Improving Feature-based Knowledge Distillation for Object   Detection Via Dual Masking Augmentation

Guang Yang; Yin Tang; Zhijian Wu; Jun Li; Jianhua Xu; Xili Wan

arXiv:2309.02719·cs.CV·September 8, 2023·1 cites

DMKD: Improving Feature-based Knowledge Distillation for Object Detection Via Dual Masking Augmentation

Guang Yang, Yin Tang, Zhijian Wu, Jun Li, Jianhua Xu, Xili Wan

PDF

Open Access

TL;DR

This paper introduces DMKD, a novel feature-based knowledge distillation method for object detection that uses dual masking to capture both spatial and channel-wise informative clues, leading to improved student network performance.

Contribution

The study proposes a dual attention guided masking framework that captures comprehensive feature information for better knowledge distillation in object detection.

Findings

01

Achieved 4.1% and 4.3% performance improvements on RetinaNet and Cascade Mask R-CNN.

02

Outperformed existing state-of-the-art distillation methods.

03

Effective fusion of spatial and channel-wise features enhances detection accuracy.

Abstract

Recent mainstream masked distillation methods function by reconstructing selectively masked areas of a student network from the feature map of its teacher counterpart. In these methods, the masked regions need to be properly selected, such that reconstructed features encode sufficient discrimination and representation capability like the teacher feature. However, previous masked distillation methods only focus on spatial masking, making the resulting masked areas biased towards spatial importance without encoding informative channel clues. In this study, we devise a Dual Masked Knowledge Distillation (DMKD) framework which can capture both spatially important and channel-wise informative clues for comprehensive masked feature reconstruction. More specifically, we employ dual attention mechanism for guiding the respective masking branches, leading to reconstructed feature encoding dual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Image Enhancement Techniques

Methods1x1 Convolution · Feature Pyramid Network · Region Proposal Network · Softmax · Focal Loss · Cascade Mask R-CNN · RetinaNet · Focus · RoIAlign · Convolution