Robust fine-tuning of zero-shot models

Mitchell Wortsman; Gabriel Ilharco; Jong Wook Kim; Mike Li; Simon; Kornblith; Rebecca Roelofs; Raphael Gontijo-Lopes; Hannaneh Hajishirzi; Ali; Farhadi; Hongseok Namkoong; Ludwig Schmidt

arXiv:2109.01903·cs.CV·June 23, 2022·5 cites

Robust fine-tuning of zero-shot models

Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon, Kornblith, Rebecca Roelofs, Raphael Gontijo-Lopes, Hannaneh Hajishirzi, Ali, Farhadi, Hongseok Namkoong, Ludwig Schmidt

PDF

Open Access 3 Repos

TL;DR

The paper introduces WiSE-FT, a simple ensembling method that enhances the robustness of fine-tuned zero-shot models against distribution shifts without sacrificing target accuracy.

Contribution

It proposes WiSE-FT, an ensembling technique that improves robustness during fine-tuning of large models like CLIP, maintaining high accuracy across diverse data distributions.

Findings

01

WiSE-FT improves accuracy under distribution shift by 4-6 percentage points.

02

WiSE-FT increases ImageNet accuracy by 1.6 percentage points.

03

WiSE-FT achieves large robustness gains across multiple datasets.

Abstract

Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple and effective method for improving robustness while fine-tuning: ensembling the weights of the zero-shot and fine-tuned models (WiSE-FT). Compared to standard fine-tuning, WiSE-FT provides large accuracy improvements under distribution shift, while preserving high accuracy on the target distribution. On ImageNet and five derived distribution shifts, WiSE-FT improves accuracy under distribution shift by 4 to 6 percentage points (pp) over prior work while increasing ImageNet accuracy by 1.6 pp.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications

MethodsALIGN · Contrastive Language-Image Pre-training