Early-Exit Neural Networks with Nested Prediction Sets
Metod Jazbec, Patrick Forr\'e, Stephan Mandt, Dan Zhang and, Eric Nalisnick

TL;DR
This paper proposes using anytime-valid confidence sequences for early-exit neural networks to generate nested, reliable uncertainty estimates, improving safety and efficiency in adaptive inference.
Contribution
It introduces AVCSs as a novel method for uncertainty quantification in EENNs, addressing the non-nested set issue of traditional techniques.
Findings
AVCSs produce nested prediction sets across exits
Traditional methods like conformal prediction are unsuitable for EENNs
AVCSs enhance safety and efficiency in early-exit models
Abstract
Early-exit neural networks (EENNs) enable adaptive and efficient inference by providing predictions at multiple stages during the forward pass. In safety-critical applications, these predictions are meaningful only when accompanied by reliable uncertainty estimates. A popular method for quantifying the uncertainty of predictive models is the use of prediction sets. However, we demonstrate that standard techniques such as conformal prediction and Bayesian credible sets are not suitable for EENNs. They tend to generate non-nested sets across exits, meaning that labels deemed improbable at one exit may reappear in the prediction set of a subsequent exit. To address this issue, we investigate anytime-valid confidence sequences (AVCSs), an extension of traditional confidence intervals tailored for data-streaming scenarios. These sequences are inherently nested and thus well-suited for an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Neural Networks and Applications
MethodsSparse Evolutionary Training
