Atropos: Improving Cost-Benefit Trade-off of LLM-based Agents under Self-Consistency with Early Termination and Model Hotswap
Naryeong Kim, Shin Yoo

TL;DR
Atropos is a technique that predicts and terminates failing LLM inferences early, and hot-swaps to more capable models, significantly improving cost-efficiency for LLM-based agents.
Contribution
It introduces a GCN-based predictive model for early termination and a hot-swap mechanism to enhance cost-benefit trade-offs in LLM agents using self-consistency.
Findings
Predicts inference failure with 85% accuracy at midpoint.
Converts up to 27.57% of failures into successes via hot-swapping.
Achieves 74.35% of closed LLM performance at 23.9% cost.
Abstract
Open-weight Small Language Models(SLMs) can provide faster local inference at lower financial cost, but may not achieve the same performance level as commercial Large Language Models (LLMs) that are orders of magnitudes larger. Consequently, many of the latest applications of LLMs, such as software engineering agents, tend to be evaluated on larger models only, leaving the issue of improving the cost-benefit trade-off of such applications neglected. This paper proposes Atropos, a predictive early-termination analysis and hotswap technique that aims to improve the cost-benefit trade-off for LLM-based agents that use self-consistency. The core component of ATROPOS is a predictive model based on structural properties of LLM inferences: after merging multiple agentic inference paths into a graph representation, ATROPOS uses Graph Convolutional Network (GCN) to predict whether an ongoing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
