Divergence-Based Domain Transferability for Zero-Shot Classification
Alexander Pugantsov, Richard McCreadie

TL;DR
This paper introduces divergence-based statistical measures to efficiently identify beneficial task pairs for zero-shot transfer learning, reducing tuning time while maintaining performance.
Contribution
It proposes a novel divergence-based approach to estimate task transferability, enabling selective fine-tuning without extensive task pair testing.
Findings
Statistical measures can effectively predict beneficial task pairs.
The method reduces tuning runtime by up to 40%.
Experiments on 58 tasks and 6,600 task pairs validate the approach.
Abstract
Transferring learned patterns from pretrained neural language models has been shown to significantly improve effectiveness across a variety of language-based tasks, meanwhile further tuning on intermediate tasks has been demonstrated to provide additional performance benefits, provided the intermediate task is sufficiently related to the target task. However, how to identify related tasks is an open problem, and brute-force searching effective task combinations is prohibitively expensive. Hence, the question arises, are we able to improve the effectiveness and efficiency of tasks with no training examples through selective fine-tuning? In this paper, we explore statistical measures that approximate the divergence between domain representations as a means to estimate whether tuning using one task pair will exhibit performance benefits over tuning another. This estimation can then be used…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Speech Recognition and Synthesis
