Predicting the Performance of Foundation Models via   Agreement-on-the-Line

Rahul Saxena; Taeyoun Kim; Aman Mehra; Christina Baek; Zico Kolter,; Aditi Raghunathan

arXiv:2404.01542·cs.LG·October 25, 2024·1 cites

Predicting the Performance of Foundation Models via Agreement-on-the-Line

Rahul Saxena, Taeyoun Kim, Aman Mehra, Christina Baek, Zico Kolter,, Aditi Raghunathan

PDF

Open Access

TL;DR

This paper investigates how agreement-on-the-line phenomena can predict out-of-distribution performance of foundation models, emphasizing the importance of ensemble diversity and initialization randomness during light finetuning.

Contribution

It reveals that random head initialization induces agreement-on-the-line in finetuned foundation models, enabling reliable OOD performance prediction across vision and language tasks.

Findings

01

Random head initialization reliably induces agreement-on-the-line.

02

Ensembles of different pretrained foundation models show agreement-on-the-line.

03

Agreement-on-the-line can predict OOD performance with high precision.

Abstract

Estimating the out-of-distribution performance in regimes where labels are scarce is critical to safely deploy foundation models. Recently, it was shown that ensembles of neural networks observe the phenomena "agreement-on-the-line", which can be leveraged to reliably predict OOD performance without labels. However, in contrast to classical neural networks that are trained on in-distribution data from scratch for numerous epochs, foundation models undergo minimal finetuning from heavily pretrained weights, which may reduce the ensemble diversity needed to observe agreement-on-the-line. In our work, we demonstrate that when lightly finetuning multiple runs from a single foundation model, the choice of randomness during training (linear head initialization, data ordering, and data subsetting) can lead to drastically different levels of agreement-on-the-line in the resulting ensemble.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation · Business Process Modeling and Analysis