LogME: Practical Assessment of Pre-trained Models for Transfer Learning

Kaichao You; Yong Liu; Jianmin Wang; Mingsheng Long

arXiv:2102.11005·cs.LG·June 24, 2021·39 cites

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

Kaichao You, Yong Liu, Jianmin Wang, Mingsheng Long

PDF

Open Access 1 Repo 1 Video

TL;DR

LogME is a fast, accurate, and general method for assessing pre-trained models' suitability for transfer learning across various tasks and modalities without fine-tuning, significantly speeding up model selection.

Contribution

The paper introduces LogME, a practical and universal assessment method for pre-trained models that is immune to overfitting and applicable to diverse tasks and modalities.

Findings

01

LogME outperforms prior methods in accuracy and speed.

02

LogME achieves up to 3000x speedup over brute-force fine-tuning.

03

It is applicable to supervised and unsupervised pre-trained models, and to classification and regression tasks.

Abstract

This paper studies task adaptive pre-trained model selection, an underexplored problem of assessing pre-trained models for the target task and select best ones from the model zoo \emph{without fine-tuning}. A few pilot works addressed the problem in transferring supervised pre-trained models to classification tasks, but they cannot handle emerging unsupervised pre-trained models or regression tasks. In pursuit of a practical assessment method, we propose to estimate the maximum value of label evidence given features extracted by pre-trained models. Unlike the maximum likelihood, the maximum evidence is \emph{immune to over-fitting}, while its expensive computation can be dramatically reduced by our carefully designed algorithm. The Logarithm of Maximum Evidence (LogME) can be used to assess pre-trained models for transfer learning: a pre-trained model with a high LogME value is likely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thuml/LogME
pytorchOfficial

Videos

LogME: Practical Assessment of Pre-trained Models for Transfer Learning· slideslive

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques