Towards Efficient Task-Driven Model Reprogramming with Foundation Models

Shoukai Xu; Jiangchao Yao; Ran Luo; Shuhai Zhang; Zihao Lian; Mingkui; Tan; Bo Han; Yaowei Wang

arXiv:2304.02263·cs.CV·May 9, 2023·1 cites

Towards Efficient Task-Driven Model Reprogramming with Foundation Models

Shoukai Xu, Jiangchao Yao, Ran Luo, Shuhai Zhang, Zihao Lian, Mingkui, Tan, Bo Han, Yaowei Wang

PDF

Open Access

TL;DR

This paper introduces a Task-Driven Model Reprogramming framework that enables efficient knowledge transfer from large vision foundation models to smaller downstream models, addressing domain mismatch and limited data issues.

Contribution

The proposed TDMR framework reprograms foundation models into a proxy space and uses progressive distillation, allowing effective transfer to various target models with limited data.

Findings

01

TDMR improves transfer efficiency across CNN and transformer models.

02

The method outperforms traditional fine-tuning and knowledge distillation approaches.

03

TDMR is compatible with different model architectures and limited target data.

Abstract

Vision foundation models exhibit impressive power, benefiting from the extremely large model capacity and broad training data. However, in practice, downstream scenarios may only support a small model due to the limited computational resources or efficiency considerations. Moreover, the data used for pretraining foundation models are usually invisible and very different from the target data of downstream tasks. This brings a critical challenge for the real-world application of foundation models: one has to transfer the knowledge of a foundation model to the downstream task that has a quite different architecture with only downstream target data. Existing transfer learning or knowledge distillation methods depend on either the same model structure or finetuning of the foundation model. Thus, naively introducing these methods can be either infeasible or very inefficient. To address this,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsKnowledge Distillation