DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Parth Patne; Mahdi Taheri; Christian Herglotz; Maksim Jenihhin; Milos Krstic; and Michael H\"ubner

arXiv:2603.12269·cs.AR·March 16, 2026

DART: Input-Difficulty-AwaRe Adaptive Threshold for Early-Exit DNNs

Parth Patne, Mahdi Taheri, Christian Herglotz, Maksim Jenihhin, Milos Krstic, and Michael H\"ubner

PDF

Open Access

TL;DR

DART is a novel framework for early-exit DNNs that uses input difficulty estimation and joint threshold optimization to significantly improve inference speed, energy efficiency, and power consumption while maintaining accuracy.

Contribution

DART introduces a lightweight difficulty estimation module, a joint exit policy optimization algorithm, and an adaptive coefficient system for better early-exit decisions in DNNs.

Findings

01

Achieves up to 3.3× speedup and 5.1× lower energy on CNNs.

02

Extends to Vision Transformers with notable efficiency gains but some accuracy loss.

03

Outperforms baselines on the new DAES metric, indicating better overall trade-offs.

Abstract

Early-exit deep neural networks enable adaptive inference by terminating computation when sufficient confidence is achieved, reducing cost for edge AI accelerators in resource-constrained settings. Existing methods, however, rely on suboptimal exit policies, ignore input difficulty, and optimize thresholds independently. This paper introduces DART (Input-Difficulty-Aware Adaptive Threshold), a framework that overcomes these limitations. DART introduces three key innovations: (1) a lightweight difficulty estimation module that quantifies input complexity with minimal computational overhead, (2) a joint exit policy optimization algorithm based on dynamic programming, and (3) an adaptive coefficient management system. Experiments on diverse DNN benchmarks (AlexNet, ResNet-18, VGG-16) demonstrate that DART achieves up to \textbf{3.3 $\times$ } speedup, \textbf{5.1 $\times$ } lower energy, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning