AutoFT: Learning an Objective for Robust Fine-Tuning

Caroline Choi; Yoonho Lee; Annie Chen; Allan Zhou; Aditi Raghunathan,; Chelsea Finn

arXiv:2401.10220·cs.LG·March 15, 2024·1 cites

AutoFT: Learning an Objective for Robust Fine-Tuning

Caroline Choi, Yoonho Lee, Annie Chen, Allan Zhou, Aditi Raghunathan,, Chelsea Finn

PDF

Open Access

TL;DR

AutoFT is a data-driven method that automatically searches for optimal fine-tuning objectives and hyperparameters to improve out-of-distribution generalization of foundation models, outperforming existing methods on multiple benchmarks.

Contribution

AutoFT introduces a bi-level optimization framework to automatically discover fine-tuning objectives tailored for robustness against distribution shifts.

Findings

01

AutoFT significantly improves OOD generalization across nine benchmarks.

02

Achieves state-of-the-art results on WILDS iWildCam and FMoW datasets.

03

Outperforms existing robust fine-tuning techniques by notable margins.

Abstract

Foundation models encode rich representations that can be adapted to downstream tasks by fine-tuning. However, fine-tuning a model on one data distribution often degrades performance under distribution shifts. Current approaches to robust fine-tuning use hand-crafted regularization techniques to constrain the fine-tuning process towards the pretrained model. Yet, it is hard to specify how to adapt relevant characteristics of the foundation model during fine-tuning, as this depends on how the pre-training, fine-tuning, and test data distributions relate to each other. We propose AutoFT, a data-driven approach for robust fine-tuning. Given a task, AutoFT searches for a fine-tuning procedure that enhances out-of-distribution (OOD) generalization. Specifically, AutoFT uses bi-level optimization to search for an objective function and hyperparameters that maximize post-adaptation performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition

MethodsWeight Decay · Balanced Selection