Predicting the Performance of Foundation Models via Agreement-on-the-Line
Rahul Saxena, Taeyoun Kim, Aman Mehra, Christina Baek, Zico Kolter,, Aditi Raghunathan

TL;DR
This paper investigates how agreement-on-the-line phenomena can predict out-of-distribution performance of foundation models, emphasizing the importance of ensemble diversity and initialization randomness during light finetuning.
Contribution
It reveals that random head initialization induces agreement-on-the-line in finetuned foundation models, enabling reliable OOD performance prediction across vision and language tasks.
Findings
Random head initialization reliably induces agreement-on-the-line.
Ensembles of different pretrained foundation models show agreement-on-the-line.
Agreement-on-the-line can predict OOD performance with high precision.
Abstract
Estimating the out-of-distribution performance in regimes where labels are scarce is critical to safely deploy foundation models. Recently, it was shown that ensembles of neural networks observe the phenomena "agreement-on-the-line", which can be leveraged to reliably predict OOD performance without labels. However, in contrast to classical neural networks that are trained on in-distribution data from scratch for numerous epochs, foundation models undergo minimal finetuning from heavily pretrained weights, which may reduce the ensemble diversity needed to observe agreement-on-the-line. In our work, we demonstrate that when lightly finetuning multiple runs from a single foundation model, the choice of randomness during training (linear head initialization, data ordering, and data subsetting) can lead to drastically different levels of agreement-on-the-line in the resulting ensemble.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation · Business Process Modeling and Analysis
