PACTran: PAC-Bayesian Metrics for Estimating the Transferability of   Pretrained Models to Classification Tasks

Nan Ding; Xi Chen; Tomer Levinboim; Beer Changpinyo; Radu Soricut

arXiv:2203.05126·cs.LG·July 20, 2022·1 cites

PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks

Nan Ding, Xi Chen, Tomer Levinboim, Beer Changpinyo, Radu Soricut

PDF

Open Access 1 Repo

TL;DR

PACTran introduces theoretically grounded PAC-Bayesian metrics for selecting pretrained models, demonstrating superior transferability measurement across vision and language tasks compared to existing heuristics.

Contribution

The paper develops PACTran, a novel family of transferability metrics derived from PAC-Bayesian bounds, providing a theoretically justified approach for pretrained model selection.

Findings

01

PACTran outperforms existing heuristics in transferability measurement.

02

Empirical evaluation on VTAB and OKVQA shows PACTran's effectiveness.

03

PACTran offers a more consistent transferability assessment.

Abstract

With the increasing abundance of pretrained models in recent years, the problem of selecting the best pretrained checkpoint for a particular downstream classification task has been gaining increased attention. Although several methods have recently been proposed to tackle the selection problem (e.g. LEEP, H-score), these methods resort to applying heuristics that are not well motivated by learning theory. In this paper we present PACTran, a theoretically grounded family of metrics for pretrained model selection and transferability measurement. We first show how to derive PACTran metrics from the optimal PAC-Bayesian bound under the transfer learning setting. We then empirically evaluate three metric instantiations of PACTran on a number of vision tasks (VTAB) as well as a language-and-vision (OKVQA) task. An analysis of the results shows PACTran is a more consistent and effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/pactran_metrics
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Explainable Artificial Intelligence (XAI)