Model Ratatouille: Recycling Diverse Models for Out-of-Distribution   Generalization

Alexandre Ram\'e; Kartik Ahuja; Jianyu Zhang; Matthieu Cord; L\'eon; Bottou; David Lopez-Paz

arXiv:2212.10445·cs.LG·August 10, 2023·5 cites

Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization

Alexandre Ram\'e, Kartik Ahuja, Jianyu Zhang, Matthieu Cord, L\'eon, Bottou, David Lopez-Paz

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Model Ratatouille, a method that recycles multiple fine-tuned models from diverse auxiliary tasks to improve out-of-distribution generalization, achieving state-of-the-art results on the DomainBed benchmark.

Contribution

It proposes a novel recycling strategy that leverages auxiliary fine-tuned models as initializations and averages them to enhance OOD generalization.

Findings

01

Improves state-of-the-art on DomainBed benchmark.

02

Enhances out-of-distribution robustness.

03

Promotes collaborative model updating.

Abstract

Foundation models are redefining how AI systems are built. Practitioners now follow a standard procedure to build their machine learning solutions: from a pre-trained foundation model, they fine-tune the weights on the target task of interest. So, the Internet is swarmed by a handful of foundation models fine-tuned on many diverse tasks: these individual fine-tunings exist in isolation without benefiting from each other. In our opinion, this is a missed opportunity, as these specialized models contain rich and diverse features. In this paper, we thus propose model ratatouille, a new strategy to recycle the multiple fine-tunings of the same foundation model on diverse auxiliary tasks. Specifically, we repurpose these auxiliary weights as initializations for multiple parallel fine-tunings on the target task; then, we average all fine-tuned weights to obtain the final model. This recycling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/ModelRatatouille
pytorchOfficial

Videos

Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications