Transferability Estimation using Bhattacharyya Class Separability
Michal P\'andy, Andrea Agostinelli, Jasper Uijlings, Vittorio, Ferrari, Thomas Mensink

TL;DR
This paper introduces GBC, a new method to estimate transferability of pre-trained models to target tasks by measuring class separability in feature space, enabling better model and dataset selection.
Contribution
The paper proposes Gaussian Bhattacharyya Coefficient (GBC), a novel, efficient transferability metric applicable to classification and segmentation tasks, outperforming existing methods in various settings.
Findings
GBC outperforms state-of-the-art metrics in semantic segmentation transferability.
GBC matches top methods for dataset transferability in image classification.
GBC excels in architecture selection for image classification.
Abstract
Transfer learning has become a popular method for leveraging pre-trained models in computer vision. However, without performing computationally expensive fine-tuning, it is difficult to quantify which pre-trained source models are suitable for a specific target task, or, conversely, to which tasks a pre-trained source model can be easily adapted to. In this work, we propose Gaussian Bhattacharyya Coefficient (GBC), a novel method for quantifying transferability between a source model and a target dataset. In a first step we embed all target images in the feature space defined by the source model, and represent them with per-class Gaussians. Then, we estimate their pairwise class separability using the Bhattacharyya coefficient, yielding a simple and effective measure of how well the source model transfers to the target task. We evaluate GBC on image classification tasks in the context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications
