Optimizing Model Selection for Compound AI Systems
Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Matei, Zaharia, James Zou, Ion Stoica

TL;DR
This paper introduces LLMSelector, an efficient framework for optimizing model choices in compound AI systems, significantly improving performance by intelligently selecting the best LLM for each module.
Contribution
The paper presents LLMSelector, a novel method that leverages empirical insights to efficiently optimize model selection in multi-module AI systems, reducing search complexity.
Findings
LLMSelector improves accuracy by 5%-70% over uniform LLM usage.
Model choices significantly impact compound system performance.
The method scales linearly with the number of modules.
Abstract
Compound AI systems that combine multiple LLM calls, such as self-refine and multi-agent-debate, achieve strong performance on many AI tasks. We address a core question in optimizing compound systems: for each LLM call or module in the system, how should one decide which LLM to use? We show that these LLM choices have a large effect on quality, but the search space is exponential. We propose LLMSelector, an efficient framework for model selection in compound systems, which leverages two key empirical insights: (i) end-to-end performance is often monotonic in how well each module performs, with all other modules held fixed, and (ii) per-module performance can be estimated accurately by an LLM. Building upon these insights, LLMSelector iteratively selects one module and allocates to it the model with the highest module-wise performance, as estimated by an LLM, until no further gain is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Fault Detection and Control Systems · Machine Learning and Algorithms
