Zero Time Waste: Recycling Predictions in Early Exit Neural Networks
Maciej Wo{\l}czyk, Bartosz W\'ojcik, Klaudia Ba{\l}azy, Igor Podolak,, Jacek Tabor, Marek \'Smieja, Tomasz Trzci\'nski

TL;DR
This paper introduces Zero Time Waste (ZTW), a novel early exit neural network method that reuses previous predictions to improve inference efficiency and accuracy, reducing wasted computation.
Contribution
ZTW adds direct connections and ensemble-like combination of IC outputs, significantly enhancing early exit performance over existing methods.
Findings
ZTW improves accuracy vs. inference time trade-off.
ZTW reduces computational waste in early exit networks.
Experiments across datasets show superior performance of ZTW.
Abstract
The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. However, if a particular IC does not decide to return an answer early, its predictions are discarded, with its computations effectively being wasted. To solve this issue, we introduce Zero Time Waste (ZTW), a novel approach in which each IC reuses predictions returned by its predecessors by (1) adding direct connections between ICs and (2) combining previous outputs in an ensemble-like manner. We conduct extensive experiments across various datasets and architectures to demonstrate that ZTW achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Machine Learning and Algorithms
