COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Jiayu Wang; Aws Albarghouthi; Frederic Sala

arXiv:2505.01449·cs.LG·June 6, 2025

COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Jiayu Wang, Aws Albarghouthi, Frederic Sala

PDF

Open Access

TL;DR

COSMOS is a unified framework that accurately predicts LLM adaptation performance and costs, significantly reducing computational overhead and enabling cost-effective model deployment.

Contribution

We introduce COSMOS, a novel prediction framework that estimates adaptation outcomes for LLMs with minimal cost, combining lightweight proxies and scaling laws.

Findings

01

Achieves high prediction accuracy across benchmarks.

02

Reduces computational costs by up to 98.71%.

03

Maintains performance while saving resources.

Abstract

Large language models (LLMs) achieve remarkable performance across numerous tasks by using a diverse array of adaptation strategies. However, optimally selecting a model and adaptation strategy under resource constraints is challenging and often requires extensive experimentation. We investigate whether it is possible to accurately predict both performance and cost without expensive trials. We formalize the strategy selection problem for LLMs and introduce COSMOS, a unified prediction framework that efficiently estimates adaptation outcomes at minimal cost. We instantiate and study the capability of our framework via a pair of powerful predictors: embedding-augmented lightweight proxy models to predict fine-tuning performance, and low-sample scaling laws to forecast retrieval-augmented in-context learning. Extensive evaluation across eight representative benchmarks demonstrates that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Text Readability and Simplification