Pre-Trained Model Recommendation for Downstream Fine-tuning

Jiameng Bai; Sai Wu; Jie Song; Junbo Zhao; Gang Chen

arXiv:2403.06382·cs.CV·March 12, 2024·3 cites

Pre-Trained Model Recommendation for Downstream Fine-tuning

Jiameng Bai, Sai Wu, Jie Song, Junbo Zhao, Gang Chen

PDF

Open Access

TL;DR

This paper introduces Fennec, a transfer learning framework that maps models and tasks into a transfer space for efficient model selection, leveraging a large repository and a novel encoding method, archi2vec.

Contribution

The paper proposes a new transferability-based model selection framework, Fennec, with a novel model encoding method, archi2vec, and provides a comprehensive benchmark for evaluation.

Findings

01

Fennec effectively ranks models with minimal computation.

02

The transfer score computation is O(1) in time complexity.

03

Benchmark results validate the framework's effectiveness.

Abstract

As a fundamental problem in transfer learning, model selection aims to rank off-the-shelf pre-trained models and select the most suitable one for the new target task. Existing model selection techniques are often constrained in their scope and tend to overlook the nuanced relationships between models and tasks. In this paper, we present a pragmatic framework \textbf{Fennec}, delving into a diverse, large-scale model repository while meticulously considering the intricate connections between tasks and models. The key insight is to map all models and historical tasks into a transfer-related subspace, where the distance between model vectors and task vectors represents the magnitude of transferability. A large vision model, as a proxy, infers a new task's representation in the transfer space, thereby circumventing the computational burden of extensive forward passes. We also investigate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReal-time simulation and control systems · Model Reduction and Neural Networks