AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge   Distillation

Zihao Tang; Zheqi Lv; Shengyu Zhang; Yifan Zhou; Xinyu Duan; Fei Wu,; Kun Kuang

arXiv:2403.07030·cs.LG·March 19, 2024·1 cites

AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation

Zihao Tang, Zheqi Lv, Shengyu Zhang, Yifan Zhou, Xinyu Duan, Fei Wu,, Kun Kuang

PDF

Open Access 1 Repo

TL;DR

AuG-KD introduces an anchor-based mixup technique guided by uncertainty to improve out-of-domain knowledge distillation, effectively transferring relevant teacher knowledge to student models without access to original training data.

Contribution

It proposes a novel anchor-based mixup method that selectively transfers teacher knowledge aligned with the student domain in data-free settings.

Findings

01

Outperforms existing DFKD methods across multiple datasets

02

Demonstrates stability and robustness in various settings

03

Effectively balances OOD knowledge transfer and domain-specific learning

Abstract

Due to privacy or patent concerns, a growing number of large models are released without granting access to their training data, making transferring their knowledge inefficient and problematic. In response, Data-Free Knowledge Distillation (DFKD) methods have emerged as direct solutions. However, simply adopting models derived from DFKD for real-world applications suffers significant performance degradation, due to the discrepancy between teachers' training data and real-world scenarios (student domain). The degradation stems from the portions of teachers' knowledge that are not applicable to the student domain. They are specific to the teacher domain and would undermine students' performance. Hence, selectively transferring teachers' appropriate knowledge becomes the primary challenge in DFKD. In this work, we propose a simple but effective method AuG-KD. It utilizes an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ishikura-a/aug-kd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsKnowledge Distillation · ALIGN · Mixup