AutoFT: Learning an Objective for Robust Fine-Tuning
Caroline Choi, Yoonho Lee, Annie Chen, Allan Zhou, Aditi Raghunathan,, Chelsea Finn

TL;DR
AutoFT is a data-driven method that automatically searches for optimal fine-tuning objectives and hyperparameters to improve out-of-distribution generalization of foundation models, outperforming existing methods on multiple benchmarks.
Contribution
AutoFT introduces a bi-level optimization framework to automatically discover fine-tuning objectives tailored for robustness against distribution shifts.
Findings
AutoFT significantly improves OOD generalization across nine benchmarks.
Achieves state-of-the-art results on WILDS iWildCam and FMoW datasets.
Outperforms existing robust fine-tuning techniques by notable margins.
Abstract
Foundation models encode rich representations that can be adapted to downstream tasks by fine-tuning. However, fine-tuning a model on one data distribution often degrades performance under distribution shifts. Current approaches to robust fine-tuning use hand-crafted regularization techniques to constrain the fine-tuning process towards the pretrained model. Yet, it is hard to specify how to adapt relevant characteristics of the foundation model during fine-tuning, as this depends on how the pre-training, fine-tuning, and test data distributions relate to each other. We propose AutoFT, a data-driven approach for robust fine-tuning. Given a task, AutoFT searches for a fine-tuning procedure that enhances out-of-distribution (OOD) generalization. Specifically, AutoFT uses bi-level optimization to search for an objective function and hyperparameters that maximize post-adaptation performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Human Pose and Action Recognition
MethodsWeight Decay · Balanced Selection
