Selective Cross-Task Distillation
Su Lu, Han-Jia Ye, De-Chuan Zhan

TL;DR
This paper introduces a selective cross-task distillation method that efficiently identifies and utilizes the most relevant pre-trained teachers across different tasks by bridging label spaces with optimal transport, improving knowledge reuse.
Contribution
It proposes a novel framework for selecting and reusing teachers from diverse tasks via optimal transport, addressing efficiency and semantic gap challenges in knowledge distillation.
Findings
Effective teacher assessment without enumerating all models
Bridging label spaces improves cross-task knowledge transfer
Demonstrated superior performance on benchmark tasks
Abstract
The outpouring of various pre-trained models empowers knowledge distillation by providing abundant teacher resources, but there lacks a developed mechanism to utilize these teachers adequately. With a massive model repository composed of teachers pre-trained on diverse tasks, we must surmount two obstacles when using knowledge distillation to learn a new task. First, given a fixed computing budget, it is not affordable to try each teacher and train the student repeatedly, making it necessary to seek out the most contributive teacher precisely and efficiently. Second, semantic gaps exist between the teachers and the target student since they are trained on different tasks. Thus, we need to extract knowledge from a general label space that may be different from the student's. Faced with these two challenges, we study a new setting named selective cross-task distillation that includes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsKnowledge Distillation
