Early-exit Convolutional Neural Networks
Edanur Demir, Emre Akbas

TL;DR
This paper introduces Early-exit CNNs (EENets) that adaptively reduce inference computation by allowing the network to exit early for easy inputs, maintaining accuracy while significantly lowering computational costs.
Contribution
The paper proposes a novel early-exit mechanism for CNNs, enabling adaptive inference stopping based on confidence, which reduces computation without sacrificing accuracy.
Findings
Achieves similar accuracy to standard CNNs with only 20% of the original computation.
Applicable to architectures like ResNets, demonstrating versatility.
Reduces inference cost significantly on multiple datasets.
Abstract
This paper is aimed at developing a method that reduces the computational cost of convolutional neural networks (CNN) during inference. Conventionally, the input data pass through a fixed neural network architecture. However, easy examples can be classified at early stages of processing and conventional networks do not take this into account. In this paper, we introduce 'Early-exit CNNs', EENets for short, which adapt their computational cost based on the input by stopping the inference process at certain exit locations. In EENets, there are a number of exit blocks each of which consists of a confidence branch and a softmax branch. The confidence branch computes the confidence score of exiting (i.e. stopping the inference process) at that location; while the softmax branch outputs a classification probability vector. Both branches are learnable and their parameters are separate. During…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsSoftmax
