Flexible Routing via Uncertainty Decomposition
Charlotte Peale, Siddartha Devic, Parikshit Gopalan, Udi Wieder, Aravind Gollakota

TL;DR
This paper introduces a new uncertainty-aware routing method for machine learning systems that intelligently decides when to use a low-cost model, an expensive oracle, or abstain, based on decomposing uncertainty components.
Contribution
It presents a novel uncertainty decomposition technique enabling dynamic, cost-sensitive routing and abstention without retraining, with theoretical regret guarantees.
Findings
Effective in reducing unnecessary oracle calls on ambiguous queries.
Adapts to different loss functions and cost parameters easily.
Demonstrates improved routing performance on synthetic and real datasets.
Abstract
A key strategy for balancing performance and cost in modern machine learning systems is to dynamically route queries to either a low-cost model or a more expensive oracle (such as a large pretrained model or human expert), an approach known as model routing. In this work we present a new uncertainty-aware router that (1) avoids unnecessary oracle calls on inherently ambiguous queries, and (2) adapts dynamically to different loss functions and cost parameters through simple hyperparameter changes, without retraining. Our method, applicable to any classification setting where multiple independent annotations per input are available, is based on decomposing total uncertainty into irreducible and reducible components using higher-order predictors [Ahdritz et al., 2025]. This enables a unified approach to both routing and abstention: predict with the weak model when uncertainty is low, route…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
