AI-KD: Adversarial learning and Implicit regularization for   self-Knowledge Distillation

Hyungmin Kim; Sungho Suh; Sunghyun Baek; Daehwan Kim; Daun Jeong,; Hansang Cho; and Junmo Kim

arXiv:2211.10938·cs.CV·March 22, 2024

AI-KD: Adversarial learning and Implicit regularization for self-Knowledge Distillation

Hyungmin Kim, Sungho Suh, Sunghyun Baek, Daehwan Kim, Daun Jeong,, Hansang Cho, and Junmo Kim

PDF

Open Access

TL;DR

AI-KD introduces an adversarial and regularization-based self-knowledge distillation method that enhances model training by aligning predictive distributions with pre-trained models, leading to improved performance.

Contribution

The paper proposes a novel AI-KD method combining adversarial learning and implicit regularization for self-knowledge distillation, which better aligns predictive distributions.

Findings

01

Achieves superior performance over state-of-the-art methods.

02

Effectively distills deterministic and progressive knowledge.

03

Improves distribution alignment between models.

Abstract

We present a novel adversarial penalized self-knowledge distillation method, named adversarial learning and implicit regularization for self-knowledge distillation (AI-KD), which regularizes the training procedure by adversarial learning and implicit distillations. Our model not only distills the deterministic and progressive knowledge which are from the pre-trained and previous epoch predictive probabilities but also transfers the knowledge of the deterministic predictive distributions using adversarial learning. The motivation is that the self-knowledge distillation methods regularize the predictive probabilities with soft targets, but the exact distributions may be hard to predict. Our method deploys a discriminator to distinguish the distributions between the pre-trained and student models while the student model is trained to fool the discriminator in the trained procedure. Thus,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning

MethodsALIGN