Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

Maciej Wo{\l}czyk; Bartosz W\'ojcik; Klaudia Ba{\l}azy; Igor Podolak,; Jacek Tabor; Marek \'Smieja; Tomasz Trzci\'nski

arXiv:2106.05409·cs.LG·December 7, 2021·30 cites

Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

Maciej Wo{\l}czyk, Bartosz W\'ojcik, Klaudia Ba{\l}azy, Igor Podolak,, Jacek Tabor, Marek \'Smieja, Tomasz Trzci\'nski

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Zero Time Waste (ZTW), a novel early exit neural network method that reuses previous predictions to improve inference efficiency and accuracy, reducing wasted computation.

Contribution

ZTW adds direct connections and ensemble-like combination of IC outputs, significantly enhancing early exit performance over existing methods.

Findings

01

ZTW improves accuracy vs. inference time trade-off.

02

ZTW reduces computational waste in early exit networks.

03

Experiments across datasets show superior performance of ZTW.

Abstract

The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. However, if a particular IC does not decide to return an answer early, its predictions are discarded, with its computations effectively being wasted. To solve this issue, we introduce Zero Time Waste (ZTW), a novel approach in which each IC reuses predictions returned by its predecessors by (1) adding direct connections between ICs and (2) combining previous outputs in an ensemble-like manner. We conduct extensive experiments across various datasets and architectures to demonstrate that ZTW achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gmum/Zero-Time-Waste
pytorchOfficial

Videos

Zero Time Waste: Recycling Predictions in Early Exit Neural Networks· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Machine Learning and Algorithms