LogME: Practical Assessment of Pre-trained Models for Transfer Learning
Kaichao You, Yong Liu, Jianmin Wang, Mingsheng Long

TL;DR
LogME is a fast, accurate, and general method for assessing pre-trained models' suitability for transfer learning across various tasks and modalities without fine-tuning, significantly speeding up model selection.
Contribution
The paper introduces LogME, a practical and universal assessment method for pre-trained models that is immune to overfitting and applicable to diverse tasks and modalities.
Findings
LogME outperforms prior methods in accuracy and speed.
LogME achieves up to 3000x speedup over brute-force fine-tuning.
It is applicable to supervised and unsupervised pre-trained models, and to classification and regression tasks.
Abstract
This paper studies task adaptive pre-trained model selection, an underexplored problem of assessing pre-trained models for the target task and select best ones from the model zoo \emph{without fine-tuning}. A few pilot works addressed the problem in transferring supervised pre-trained models to classification tasks, but they cannot handle emerging unsupervised pre-trained models or regression tasks. In pursuit of a practical assessment method, we propose to estimate the maximum value of label evidence given features extracted by pre-trained models. Unlike the maximum likelihood, the maximum evidence is \emph{immune to over-fitting}, while its expensive computation can be dramatically reduced by our carefully designed algorithm. The Logarithm of Maximum Evidence (LogME) can be used to assess pre-trained models for transfer learning: a pre-trained model with a high LogME value is likely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques
