Assessing Pre-Trained Models for Transfer Learning Through Distribution of Spectral Components
Tengxue Zhang, Yang Shu, Xinyang Chen, Yifei Long, Chenjuan Guo, Bin, Yang

TL;DR
This paper introduces DISCO, a spectral component distribution-based assessment method for pre-trained models, which predicts transferability by analyzing the spectral features' singular values, improving model selection for downstream tasks.
Contribution
It proposes a novel spectral distribution approach for evaluating pre-trained models' transferability, incorporating downstream labels for more accurate assessment.
Findings
Achieves state-of-the-art model selection performance.
Effective across classification and regression tasks.
Demonstrates robustness on multiple benchmarks.
Abstract
Pre-trained model assessment for transfer learning aims to identify the optimal candidate for the downstream tasks from a model hub, without the need of time-consuming fine-tuning. Existing advanced works mainly focus on analyzing the intrinsic characteristics of the entire features extracted by each pre-trained model or how well such features fit the target labels. This paper proposes a novel perspective for pre-trained model assessment through the Distribution of Spectral Components (DISCO). Through singular value decomposition of features extracted from pre-trained models, we investigate different spectral components and observe that they possess distinct transferability, contributing diversely to the fine-tuning performance. Inspired by this, we propose an assessment method based on the distribution of spectral components which measures the proportions of their corresponding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Speech Recognition and Synthesis · Neural Networks and Applications
MethodsFocus
