Knowledge Condensation Distillation

Chenxin Li; Mingbao Lin; Zhiyuan Ding; Nie Lin; Yihong Zhuang; Yue; Huang; Xinghao Ding; Liujuan Cao

arXiv:2207.05409·cs.CV·July 13, 2022·1 cites

Knowledge Condensation Distillation

Chenxin Li, Mingbao Lin, Zhiyuan Ding, Nie Lin, Yihong Zhuang, Yue, Huang, Xinghao Ding, Liujuan Cao

PDF

Open Access 2 Repos

TL;DR

Knowledge Condensation Distillation (KCD) enhances traditional knowledge distillation by dynamically estimating knowledge value and iteratively condensing a compact knowledge set, leading to more efficient and effective student learning.

Contribution

KCD introduces a dynamic, EM-based framework for condensing teacher knowledge, improving distillation efficiency without extra parameters.

Findings

01

Boosts student model performance on benchmarks.

02

Achieves higher distillation efficiency.

03

No significant additional computational overhead.

Abstract

Knowledge Distillation (KD) transfers the knowledge from a high-capacity teacher network to strengthen a smaller student. Existing methods focus on excavating the knowledge hints and transferring the whole knowledge to the student. However, the knowledge redundancy arises since the knowledge shows different values to the student at different learning stages. In this paper, we propose Knowledge Condensation Distillation (KCD). Specifically, the knowledge value on each sample is dynamically estimated, based on which an Expectation-Maximization (EM) framework is forged to iteratively condense a compact knowledge set from the teacher to guide the student learning. Our approach is easy to build on top of the off-the-shelf KD methods, with no extra training parameters and negligible computation overhead. Thus, it presents one new perspective for KD, in which the student that actively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Neural Networks and Applications · Advanced Neural Network Applications