Rethinking Task Sampling for Few-shot Vision-Language Transfer Learning
Zhenhailong Wang, Hang Yu, Manling Li, Han Zhao, Heng Ji

TL;DR
This paper introduces a simple task sampling strategy called MAMF that improves few-shot vision-language transfer learning, outperforming classical fine-tuning across multiple tasks by focusing on effective task selection.
Contribution
The paper highlights the importance of task sampling in few-shot learning and proposes MAMF, a straightforward algorithm that enhances transfer performance without complex optimization.
Findings
MAMF outperforms classical fine-tuning on five tasks.
Task sampling significantly impacts few-shot transfer success.
Bi-level optimization in MAML is sensitive to zero-shot task performance.
Abstract
Despite achieving state-of-the-art zero-shot performance, existing vision-language models still fall short of few-shot transfer ability on domain-specific problems. Classical fine-tuning often fails to prevent highly expressive models from exploiting spurious correlations. Although model-agnostic meta-learning (MAML) presents as a natural alternative for few-shot transfer learning, the expensive computation due to implicit second-order optimization limits its use on large-scale vision-language models such as CLIP. While much literature has been devoted to exploring alternative optimization strategies, we identify another essential aspect towards effective few-shot transfer learning, task sampling, which is previously only be viewed as part of data pre-processing in MAML. To show the impact of task sampling, we propose a simple algorithm, Model-Agnostic Multitask Fine-tuning (MAMF),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
MethodsModel-Agnostic Meta-Learning · Contrastive Language-Image Pre-training
