Semi-Online Knowledge Distillation
Zhiqiang Liu, Yanxia Liu, Chengkai Huang

TL;DR
This paper introduces Semi-Online Knowledge Distillation (SOKD), a unified framework combining knowledge distillation and deep mutual learning to enhance model compression and performance, validated by extensive experiments on CIFAR-100 and ImageNet.
Contribution
It proposes a novel SOKD method that integrates KD and DML, improving student and teacher performance through peer-teaching and supervision signals.
Findings
Achieves state-of-the-art results on CIFAR-100.
Demonstrates significant performance improvements on ImageNet.
Framework can be extended to feature-based distillation methods.
Abstract
Knowledge distillation is an effective and stable method for model compression via knowledge transfer. Conventional knowledge distillation (KD) is to transfer knowledge from a large and well pre-trained teacher network to a small student network, which is a one-way process. Recently, deep mutual learning (DML) has been proposed to help student networks learn collaboratively and simultaneously. However, to the best of our knowledge, KD and DML have never been jointly explored in a unified framework to solve the knowledge distillation problem. In this paper, we investigate that the teacher model supports more trustworthy supervision signals in KD, while the student captures more similar behaviors from the teacher in DML. Based on these observations, we first propose to combine KD with DML in a unified framework. Furthermore, we propose a Semi-Online Knowledge Distillation (SOKD) method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
MethodsKnowledge Distillation
