A Communication-Theoretic Framework for LLM Agents: Cost-Aware Adaptive Reliability
Hamed Omidvar, Vahideh Akhlaghi

TL;DR
This paper introduces a communication-theoretic framework for LLM reliability techniques, providing analytical results and a cost-aware routing method that improves performance and efficiency across diverse tasks.
Contribution
It unifies reliability techniques under a common framework, derives analytical thresholds, and proposes a cost-aware router that optimally balances quality and cost.
Findings
Uniform averaging outperforms quality-weighted averaging above a noise threshold.
Generator-critic refinement is contractive below a certain threshold.
The proposed router achieves up to 56% cost reduction at equal quality.
Abstract
Agents built on large language models (LLMs) rely on a range of reliability techniques, including retry, majority voting, and self-consistency, that have been developed in parallel rather than within a common analytical framework. We observe that an LLM sampled at temperature is a discrete stochastic channel in the sense of Shannon's coding theory, and use this identity as the entry point for such a framework grounded in communication theory. Each of these techniques is a special case of one of six classical reliability operators: diversity combining, hybrid retransmission, iterative generator-critic decoding, rateless sampling, structured redundant verification, and difficulty-adaptive routing. Within the framework we give two closed-form results: a noise-variance threshold above which uniform averaging beats quality-weighted averaging, and a contractivity criterion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
