Towards Few-Shot Adaptation of Foundation Models via Multitask   Finetuning

Zhuoyan Xu; Zhenmei Shi; Junyi Wei; Fangzhou Mu; Yin Li; Yingyu Liang

arXiv:2402.15017·cs.LG·February 26, 2024·1 cites

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

Zhuoyan Xu, Zhenmei Shi, Junyi Wei, Fangzhou Mu, Yin Li, Yingyu Liang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper provides a theoretical and empirical study on how multitask finetuning of foundation models with diverse related tasks improves adaptation to new, low-label target tasks, and proposes a task selection algorithm.

Contribution

It offers a theoretical analysis linking task diversity and consistency to improved target task performance and introduces a practical task selection method.

Findings

01

Multitask finetuning reduces error on target tasks compared to direct adaptation.

02

Diversity and consistency metrics predict finetuning effectiveness.

03

The proposed task selection algorithm improves model performance on target tasks.

Abstract

Foundation models have emerged as a powerful tool for many AI problems. Despite the tremendous success of foundation models, effective adaptation to new tasks, particularly those with limited labels, remains an open question and lacks theoretical understanding. An emerging solution with recent success in vision and NLP involves finetuning a foundation model on a selection of relevant tasks, before its adaptation to a target task with limited labeled samples. In this paper, we study the theoretical justification of this multitask finetuning approach. Our theoretical analysis reveals that with a diverse set of related tasks, this multitask finetuning leads to reduced error in the target task, in comparison to directly adapting the same pretrained model. We quantify the relationship between finetuning tasks and target tasks by diversity and consistency metrics, and further propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oliverxuzy/foudation-model_multitask
pytorchOfficial

Videos

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning· slideslive

Taxonomy

TopicsDam Engineering and Safety · Infrastructure Maintenance and Monitoring · Geotechnical Engineering and Analysis

MethodsSparse Evolutionary Training