Robust fine-tuning of zero-shot models
Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon, Kornblith, Rebecca Roelofs, Raphael Gontijo-Lopes, Hannaneh Hajishirzi, Ali, Farhadi, Hongseok Namkoong, Ludwig Schmidt

TL;DR
The paper introduces WiSE-FT, a simple ensembling method that enhances the robustness of fine-tuned zero-shot models against distribution shifts without sacrificing target accuracy.
Contribution
It proposes WiSE-FT, an ensembling technique that improves robustness during fine-tuning of large models like CLIP, maintaining high accuracy across diverse data distributions.
Findings
WiSE-FT improves accuracy under distribution shift by 4-6 percentage points.
WiSE-FT increases ImageNet accuracy by 1.6 percentage points.
WiSE-FT achieves large robustness gains across multiple datasets.
Abstract
Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple and effective method for improving robustness while fine-tuning: ensembling the weights of the zero-shot and fine-tuned models (WiSE-FT). Compared to standard fine-tuning, WiSE-FT provides large accuracy improvements under distribution shift, while preserving high accuracy on the target distribution. On ImageNet and five derived distribution shifts, WiSE-FT improves accuracy under distribution shift by 4 to 6 percentage points (pp) over prior work while increasing ImageNet accuracy by 1.6 pp.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications
MethodsALIGN · Contrastive Language-Image Pre-training
