COSMOS: Predictable and Cost-Effective Adaptation of LLMs
Jiayu Wang, Aws Albarghouthi, Frederic Sala

TL;DR
COSMOS is a unified framework that accurately predicts LLM adaptation performance and costs, significantly reducing computational overhead and enabling cost-effective model deployment.
Contribution
We introduce COSMOS, a novel prediction framework that estimates adaptation outcomes for LLMs with minimal cost, combining lightweight proxies and scaling laws.
Findings
Achieves high prediction accuracy across benchmarks.
Reduces computational costs by up to 98.71%.
Maintains performance while saving resources.
Abstract
Large language models (LLMs) achieve remarkable performance across numerous tasks by using a diverse array of adaptation strategies. However, optimally selecting a model and adaptation strategy under resource constraints is challenging and often requires extensive experimentation. We investigate whether it is possible to accurately predict both performance and cost without expensive trials. We formalize the strategy selection problem for LLMs and introduce COSMOS, a unified prediction framework that efficiently estimates adaptation outcomes at minimal cost. We instantiate and study the capability of our framework via a pair of powerful predictors: embedding-augmented lightweight proxy models to predict fine-tuning performance, and low-sample scaling laws to forecast retrieval-augmented in-context learning. Extensive evaluation across eight representative benchmarks demonstrates that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Text Readability and Simplification
