Back to the Basics on Predicting Transfer Performance
Levy Chaves, Eduardo Valle, Alceu Bissoto, Sandra Avila

TL;DR
This paper evaluates various transferability scorers for pre-trained models, proposing benchmark guidelines and a method to combine scorers, improving transfer performance prediction across diverse datasets.
Contribution
It introduces robust benchmark guidelines and a novel technique to combine multiple transferability scorers, enhancing their predictive reliability.
Findings
Few scorers outperform raw ImageNet accuracy.
All predictors perform poorly on medical datasets.
Combining scorers improves transferability predictions.
Abstract
In the evolving landscape of deep learning, selecting the best pre-trained models from a growing number of choices is a challenge. Transferability scorers propose alleviating this scenario, but their recent proliferation, ironically, poses the challenge of their own assessment. In this work, we propose both robust benchmark guidelines for transferability scorers, and a well-founded technique to combine multiple scorers, which we show consistently improves their results. We extensively evaluate 13 scorers from literature across 11 datasets, comprising generalist, fine-grained, and medical imaging datasets. We show that few scorers match the predictive performance of the simple raw metric of models on ImageNet, and that all predictors suffer on medical datasets. Our results highlight the potential of combining different information sources for reliably predicting transferability across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac Valve Diseases and Treatments
