Occam's model: Selecting simpler representations for better   transferability estimation

Prabhant Singh; Sibylle Hess; Joaquin Vanschoren

arXiv:2502.06925·cs.LG·February 12, 2025

Occam's model: Selecting simpler representations for better transferability estimation

Prabhant Singh, Sibylle Hess, Joaquin Vanschoren

PDF

Open Access

TL;DR

This paper introduces two novel metrics for transferability estimation of pre-trained models, demonstrating improved accuracy and robustness across diverse tasks, supported by theoretical insights and empirical evaluations.

Contribution

The paper proposes new transferability metrics based on model representations, with theoretical analysis and superior empirical performance over existing methods.

Findings

01

Metrics increase Kendall's Tau by up to 32%

02

Demonstrates robustness across diverse problem settings

03

Provides theoretical insights into transferability estimation

Abstract

Fine-tuning models that have been pre-trained on large datasets has become a cornerstone of modern machine learning workflows. With the widespread availability of online model repositories, such as Hugging Face, it is now easier than ever to fine-tune pre-trained models for specific tasks. This raises a critical question: which pre-trained model is most suitable for a given task? This problem is called transferability estimation. In this work, we introduce two novel and effective metrics for estimating the transferability of pre-trained models. Our approach is grounded in viewing transferability as a measure of how easily a pre-trained model's representations can be trained to separate target classes, providing a unique perspective on transferability estimation. We rigorously evaluate the proposed metrics against state-of-the-art alternatives across diverse problem settings,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis