Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization
Alexandre Ram\'e, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, L\'eon, Bottou, David Lopez-Paz

TL;DR
This paper introduces Model Ratatouille, a method that recycles multiple fine-tuned models from diverse auxiliary tasks to improve out-of-distribution generalization, achieving state-of-the-art results on the DomainBed benchmark.
Contribution
It proposes a novel recycling strategy that leverages auxiliary fine-tuned models as initializations and averages them to enhance OOD generalization.
Findings
Improves state-of-the-art on DomainBed benchmark.
Enhances out-of-distribution robustness.
Promotes collaborative model updating.
Abstract
Foundation models are redefining how AI systems are built. Practitioners now follow a standard procedure to build their machine learning solutions: from a pre-trained foundation model, they fine-tune the weights on the target task of interest. So, the Internet is swarmed by a handful of foundation models fine-tuned on many diverse tasks: these individual fine-tunings exist in isolation without benefiting from each other. In our opinion, this is a missed opportunity, as these specialized models contain rich and diverse features. In this paper, we thus propose model ratatouille, a new strategy to recycle the multiple fine-tunings of the same foundation model on diverse auxiliary tasks. Specifically, we repurpose these auxiliary weights as initializations for multiple parallel fine-tunings on the target task; then, we average all fine-tuned weights to obtain the final model. This recycling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
