Local Dense Logit Relations for Enhanced Knowledge Distillation
Liuchi Xu, Kang Liu, Jinshuai Liu, Lu Wang, Lisheng Xu, Jun Cheng

TL;DR
This paper introduces a novel knowledge distillation method called Local Dense Relational Logit Distillation (LDRLD) that captures fine-grained inter-class relationships and adaptively emphasizes critical class pairs to improve student model performance.
Contribution
The paper proposes LDRLD, a new method for logit distillation that models detailed class relationships and uses adaptive weighting strategies for enhanced knowledge transfer.
Findings
LDRLD outperforms existing logit distillation methods on CIFAR-100, ImageNet-1K, and Tiny-ImageNet.
The adaptive decay weight strategy effectively emphasizes important class relationships.
Recursive decoupling improves the clarity and detail of transferred knowledge.
Abstract
State-of-the-art logit distillation methods exhibit versatility, simplicity, and efficiency. Despite the advances, existing studies have yet to delve thoroughly into fine-grained relationships within logit knowledge. In this paper, we propose Local Dense Relational Logit Distillation (LDRLD), a novel method that captures inter-class relationships through recursively decoupling and recombining logit information, thereby providing more detailed and clearer insights for student learning. To further optimize the performance, we introduce an Adaptive Decay Weight (ADW) strategy, which can dynamically adjust the weights for critical category pairs using Inverse Rank Weighting (IRW) and Exponential Rank Decay (ERD). Specifically, IRW assigns weights inversely proportional to the rank differences between pairs, while ERD adaptively controls weight decay based on total ranking scores of category…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
