Improved Knowledge Distillation via Adversarial Collaboration

Zhiqiang Liu; Chengkai Huang; Yanxia Liu

arXiv:2111.14356·cs.CV·November 30, 2021·1 cites

Improved Knowledge Distillation via Adversarial Collaboration

Zhiqiang Liu, Chengkai Huang, Yanxia Liu

PDF

Open Access

TL;DR

This paper introduces ACKD, a novel knowledge distillation method that uses adversarial collaboration and auxiliary learners to enhance student model capacity, effectively reducing the capacity gap issue.

Contribution

It proposes a new adversarial collaborative learning framework with auxiliary learners to improve knowledge distillation performance.

Findings

01

ACKD outperforms existing methods on four classification tasks.

02

The use of adversarial collaborative module enhances student capacity.

03

Auxiliary learners contribute to better knowledge transfer.

Abstract

Knowledge distillation has become an important approach to obtain a compact yet effective model. To achieve this goal, a small student model is trained to exploit the knowledge of a large well-trained teacher model. However, due to the capacity gap between the teacher and the student, the student's performance is hard to reach the level of the teacher. Regarding this issue, existing methods propose to reduce the difficulty of the teacher's knowledge via a proxy way. We argue that these proxy-based methods overlook the knowledge loss of the teacher, which may cause the student to encounter capacity bottlenecks. In this paper, we alleviate the capacity gap problem from a new perspective with the purpose of averting knowledge loss. Instead of sacrificing part of the teacher's knowledge, we propose to build a more powerful student via adversarial collaborative learning. To this end, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsKnowledge Distillation