SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen; Stefan Uhlich; Fabien Cardinaux; Lukas Mauch; Marzieh; Edraki; Aaron Courville

arXiv:2407.03036·cs.CV·July 4, 2024

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux, Lukas Mauch, Marzieh, Edraki, Aaron Courville

PDF

Open Access

TL;DR

SAFT is a simple yet effective fine-tuning method that updates only a small subset of important parameters to improve out-of-distribution generalization in pre-trained vision-language models like CLIP.

Contribution

SAFT introduces a parameter-efficient fine-tuning approach that preserves pre-trained knowledge while enhancing OOD performance, using only 0.1% of parameters.

Findings

01

SAFT significantly improves CLIP's OOD performance.

02

SAFT outperforms baseline fine-tuning methods.

03

SAFT achieves a 5.15% gain on ImageNet OOD benchmarks.

Abstract

Handling distribution shifts from training data, known as out-of-distribution (OOD) generalization, poses a significant challenge in the field of machine learning. While a pre-trained vision-language model like CLIP has demonstrated remarkable zero-shot performance, further adaptation of the model to downstream tasks leads to undesirable degradation for OOD data. In this work, we introduce Sparse Adaptation for Fine-Tuning (SAFT), a method that prevents fine-tuning from forgetting the general knowledge in the pre-trained model. SAFT only updates a small subset of important parameters whose gradient magnitude is large, while keeping the other parameters frozen. SAFT is straightforward to implement and conceptually simple. Extensive experiments show that with only 0.1% of the model parameters, SAFT can significantly improve the performance of CLIP. It consistently outperforms baseline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvancements in PLL and VCO Technologies · Advanced Electrical Measurement Techniques · Image and Signal Denoising Methods

MethodsContrastive Language-Image Pre-training