Highlight Every Step: Knowledge Distillation via Collaborative Teaching
Haoran Zhao, Xin Sun, Junyu Dong, Changrui Chen, Zihe Dong

TL;DR
This paper introduces a collaborative teaching approach for knowledge distillation that uses two specialized teachers to guide a student network, significantly improving performance on multiple datasets.
Contribution
It proposes a novel collaborative teaching strategy employing a scratch-trained teacher and a pre-trained expert teacher to enhance knowledge distillation.
Findings
Achieves state-of-the-art performance on CIFAR-10, CIFAR-100, SVHN, and Tiny ImageNet.
Significantly improves student network accuracy through dual-teacher guidance.
Demonstrates efficiency and effectiveness of the proposed method.
Abstract
High storage and computational costs obstruct deep neural networks to be deployed on resource-constrained devices. Knowledge distillation aims to train a compact student network by transferring knowledge from a larger pre-trained teacher model. However, most existing methods on knowledge distillation ignore the valuable information among training process associated with training results. In this paper, we provide a new Collaborative Teaching Knowledge Distillation (CTKD) strategy which employs two special teachers. Specifically, one teacher trained from scratch (i.e., scratch teacher) assists the student step by step using its temporary outputs. It forces the student to approach the optimal path towards the final logits with high accuracy. The other pre-trained teacher (i.e., expert teacher) guides the student to focus on a critical region which is more useful for the task. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI
MethodsKnowledge Distillation
