Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making
Danial Amin

TL;DR
This paper introduces a Bayesian framework for orchestrating multiple LLMs in cost-sensitive sequential decision-making, improving efficiency and fairness over single-model approaches.
Contribution
It proposes a novel Bayesian, cost-aware multi-LLM orchestration method that treats LLMs as likelihood models, enabling coherent belief updating and cost-effective decisions.
Findings
Reduced total screening costs by 34% in experiments.
Improved demographic parity by 45%.
Multi-LLM aggregation significantly contributes to savings.
Abstract
Large language models (LLMs) are increasingly deployed as autonomous decision agents in settings with asymmetric error costs: hiring (missed talent vs wasted interviews), medical triage (missed emergencies vs unnecessary escalation), and fraud detection (approved fraud vs declined legitimate payments). The dominant design queries a single LLM for a posterior over states, thresholds "confidence," and acts; we prove this is inadequate for sequential decisions with costs. We propose a Bayesian, cost-aware multi-LLM orchestration framework that treats LLMs as approximate likelihood models rather than classifiers. For each candidate state, we elicit likelihoods via contrastive prompting, aggregate across diverse models with robust statistics, and update beliefs with Bayes rule under explicit priors as new evidence arrives. This enables coherent belief updating, expected-cost action…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Text Readability and Simplification
