Optimizing Model Selection for Compound AI Systems

Lingjiao Chen; Jared Quincy Davis; Boris Hanin; Peter Bailis; Matei; Zaharia; James Zou; Ion Stoica

arXiv:2502.14815·cs.AI·February 21, 2025

Optimizing Model Selection for Compound AI Systems

Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Matei, Zaharia, James Zou, Ion Stoica

PDF

Open Access 1 Repo

TL;DR

This paper introduces LLMSelector, an efficient framework for optimizing model choices in compound AI systems, significantly improving performance by intelligently selecting the best LLM for each module.

Contribution

The paper presents LLMSelector, a novel method that leverages empirical insights to efficiently optimize model selection in multi-module AI systems, reducing search complexity.

Findings

01

LLMSelector improves accuracy by 5%-70% over uniform LLM usage.

02

Model choices significantly impact compound system performance.

03

The method scales linearly with the number of modules.

Abstract

Compound AI systems that combine multiple LLM calls, such as self-refine and multi-agent-debate, achieve strong performance on many AI tasks. We address a core question in optimizing compound systems: for each LLM call or module in the system, how should one decide which LLM to use? We show that these LLM choices have a large effect on quality, but the search space is exponential. We propose LLMSelector, an efficient framework for model selection in compound systems, which leverages two key empirical insights: (i) end-to-end performance is often monotonic in how well each module performs, with all other modules held fixed, and (ii) per-module performance can be estimated accurately by an LLM. Building upon these insights, LLMSelector iteratively selects one module and allocates to it the model with the highest module-wise performance, as estimated by an LLM, until no further gain is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LLMSELECTOR/LLMSELECTOR
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Fault Detection and Control Systems · Machine Learning and Algorithms