FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation
Jianlong Yuan, Qian Qi, Fei Du, Zhibin Wang, Fan Wang, Yifan Liu

TL;DR
FAKD introduces feature-space augmentations for knowledge distillation in semantic segmentation, improving performance by generating diverse augmented features and optimizing them simultaneously, without significant computational overhead.
Contribution
The paper proposes a novel feature-space augmentation technique for knowledge distillation, enhancing semantic segmentation performance beyond existing methods.
Findings
Boosts segmentation accuracy across four benchmarks.
Achieves performance gains without additional computational costs.
Effectively generates diverse feature augmentations for student training.
Abstract
In this work, we explore data augmentations for knowledge distillation on semantic segmentation. To avoid over-fitting to the noise in the teacher network, a large number of training examples is essential for knowledge distillation. Imagelevel argumentation techniques like flipping, translation or rotation are widely used in previous knowledge distillation framework. Inspired by the recent progress on semantic directions on feature-space, we propose to include augmentations in feature space for efficient distillation. Specifically, given a semantic direction, an infinite number of augmentations can be obtained for the student in the feature space. Furthermore, the analysis shows that those augmentations can be optimized simultaneously by minimizing an upper bound for the losses defined by augmentations. Based on the observation, a new algorithm is developed for knowledge distillation in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation· youtube
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsKnowledge Distillation
