Class-aware Information for Logit-based Knowledge Distillation
Shuoxi Zhang, Hanpeng Liu, John E. Hopcroft, Kun He

TL;DR
This paper introduces a class-aware logit knowledge distillation method that enhances semantic transfer by incorporating class-level information, outperforming existing logit-based and feature-based distillation techniques.
Contribution
The paper proposes CLKD, a novel logit-based distillation approach that captures class-level correlations, improving semantic transfer and distillation performance.
Findings
CLKD outperforms existing logit-based methods.
CLKD surpasses feature-based distillation methods.
Empirical results show improved accuracy on visual classification tasks.
Abstract
Knowledge distillation aims to transfer knowledge to the student model by utilizing the predictions/features of the teacher model, and feature-based distillation has recently shown its superiority over logit-based distillation. However, due to the cumbersome computation and storage of extra feature transformation, the training overhead of feature-based methods is much higher than that of logit-based distillation. In this work, we revisit the logit-based knowledge distillation, and observe that the existing logit-based distillation methods treat the prediction logits only in the instance level, while many other useful semantic information is overlooked. To address this issue, we propose a Class-aware Logit Knowledge Distillation (CLKD) method, that extents the logit distillation in both instance-level and class-level. CLKD enables the student model mimic higher semantic information from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods · Image Enhancement Techniques
MethodsKnowledge Distillation
