Towards Estimating Transferability using Hard Subsets

Tarun Ram Menta; Surgan Jandial; Akash Patil; Vimal KB; Saketh Bachu,; Balaji Krishnamurthy; Vineeth N. Balasubramanian; Chirag Agarwal; Mausoom; Sarkar

arXiv:2301.06928·cs.LG·January 18, 2023

Towards Estimating Transferability using Hard Subsets

Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu,, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom, Sarkar

PDF

Open Access

TL;DR

This paper introduces HASTE, a method to estimate transferability of source models to target tasks using hard data subsets, improving the reliability of transferability metrics without expensive fine-tuning.

Contribution

HASTE is a novel approach that leverages hard data subsets and internal model representations to enhance transferability estimation accuracy.

Findings

01

HASTE improves the reliability of existing transferability metrics.

02

HASTE performs consistently better or on par with state-of-the-art metrics.

03

Theoretical bounds are validated empirically across multiple datasets.

Abstract

As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning. In this work, we propose HASTE (HArd Subset TransfErability), a new strategy to estimate the transferability of a source model to a particular target task using only a harder subset of target data. By leveraging the internal and output representations of model, we introduce two techniques, one class agnostic and another class specific, to identify harder subsets and show that HASTE can be used with any existing transferability metric to improve their reliability. We further analyze the relation between HASTE and the optimal average log likelihood as well as negative conditional entropy and empirically validate our theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Topic Modeling