Early-Exit Neural Networks with Nested Prediction Sets

Metod Jazbec; Patrick Forr\'e; Stephan Mandt; Dan Zhang and; Eric Nalisnick

arXiv:2311.05931·cs.LG·June 4, 2024·1 cites

Early-Exit Neural Networks with Nested Prediction Sets

Metod Jazbec, Patrick Forr\'e, Stephan Mandt, Dan Zhang and, Eric Nalisnick

PDF

Open Access

TL;DR

This paper proposes using anytime-valid confidence sequences for early-exit neural networks to generate nested, reliable uncertainty estimates, improving safety and efficiency in adaptive inference.

Contribution

It introduces AVCSs as a novel method for uncertainty quantification in EENNs, addressing the non-nested set issue of traditional techniques.

Findings

01

AVCSs produce nested prediction sets across exits

02

Traditional methods like conformal prediction are unsuitable for EENNs

03

AVCSs enhance safety and efficiency in early-exit models

Abstract

Early-exit neural networks (EENNs) enable adaptive and efficient inference by providing predictions at multiple stages during the forward pass. In safety-critical applications, these predictions are meaningful only when accompanied by reliable uncertainty estimates. A popular method for quantifying the uncertainty of predictive models is the use of prediction sets. However, we demonstrate that standard techniques such as conformal prediction and Bayesian credible sets are not suitable for EENNs. They tend to generate non-nested sets across exits, meaning that labels deemed improbable at one exit may reappear in the prediction set of a subsequent exit. To address this issue, we investigate anytime-valid confidence sequences (AVCSs), an extension of traditional confidence intervals tailored for data-streaming scenarios. These sequences are inherently nested and thus well-suited for an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Fault Detection and Control Systems · Neural Networks and Applications

MethodsSparse Evolutionary Training