TL;DR
AdaptEvolve introduces a confidence-based adaptive LLM selection method that enhances efficiency in evolutionary AI agents, reducing inference costs significantly while maintaining high accuracy.
Contribution
It presents a novel adaptive LLM selection framework that uses model confidence to dynamically balance efficiency and reasoning capability in evolutionary systems.
Findings
Reduced total inference cost by an average of 37.9% across benchmarks.
Retained 97.5% of the accuracy of static large-model baselines.
Demonstrated favorable Pareto frontier in efficiency and accuracy trade-off.
Abstract
Evolutionary agentic systems intensify the trade-off between computational efficiency and reasoning capability by repeatedly invoking large language models (LLMs) during inference. This setting raises a central question: how can an agent dynamically select an LLM that is sufficiently capable for the current generation step while remaining computationally efficient? While model cascades offer a practical mechanism for balancing this trade-off, existing routing strategies typically rely on static heuristics or external controllers and do not explicitly account for model uncertainty. We introduce AdaptEvolve: Adaptive LLM Selection for Multi-LLM Evolutionary Refinement within an evolutionary sequential refinement framework that leverages intrinsic generation confidence to estimate real-time solvability. Empirical results show that confidence-driven selection yields a favourable Pareto…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
