Rethinking Task Sampling for Few-shot Vision-Language Transfer Learning

Zhenhailong Wang; Hang Yu; Manling Li; Han Zhao; Heng Ji

arXiv:2203.04904·cs.MM·July 18, 2022

Rethinking Task Sampling for Few-shot Vision-Language Transfer Learning

Zhenhailong Wang, Hang Yu, Manling Li, Han Zhao, Heng Ji

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simple task sampling strategy called MAMF that improves few-shot vision-language transfer learning, outperforming classical fine-tuning across multiple tasks by focusing on effective task selection.

Contribution

The paper highlights the importance of task sampling in few-shot learning and proposes MAMF, a straightforward algorithm that enhances transfer performance without complex optimization.

Findings

01

MAMF outperforms classical fine-tuning on five tasks.

02

Task sampling significantly impacts few-shot transfer success.

03

Bi-level optimization in MAML is sensitive to zero-shot task performance.

Abstract

Despite achieving state-of-the-art zero-shot performance, existing vision-language models still fall short of few-shot transfer ability on domain-specific problems. Classical fine-tuning often fails to prevent highly expressive models from exploiting spurious correlations. Although model-agnostic meta-learning (MAML) presents as a natural alternative for few-shot transfer learning, the expensive computation due to implicit second-order optimization limits its use on large-scale vision-language models such as CLIP. While much literature has been devoted to exploring alternative optimization strategies, we identify another essential aspect towards effective few-shot transfer learning, task sampling, which is previously only be viewed as part of data pre-processing in MAML. To show the impact of task sampling, we propose a simple algorithm, Model-Agnostic Multitask Fine-tuning (MAMF),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mikewangwzhl/multitask-finetuning_clip
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research

MethodsModel-Agnostic Meta-Learning · Contrastive Language-Image Pre-training