BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

Surat Teerapittayanon; Bradley McDanel; H.T. Kung

arXiv:1709.01686·cs.NE·September 7, 2017·59 cites

BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks

Surat Teerapittayanon, Bradley McDanel, H.T. Kung

PDF

Open Access 3 Repos

TL;DR

BranchyNet introduces early exit points in deep neural networks, enabling faster inference and reduced energy consumption by allowing easy-to-classify samples to exit early, while more complex samples use the full network.

Contribution

The paper proposes a novel architecture with side branch classifiers for early exiting, improving inference speed and efficiency without sacrificing accuracy.

Findings

01

Significant reduction in inference time across tested networks.

02

Maintains or improves classification accuracy.

03

Effective early exit strategy for real-time applications.

Abstract

Deep neural networks are state of the art methods for many learning tasks due to their ability to extract increasingly better features at each network layer. However, the improved performance of additional layers in a deep network comes at the cost of added latency and energy usage in feedforward inference. As networks continue to get deeper and larger, these costs become more prohibitive for real-time and energy-sensitive applications. To address this issue, we present BranchyNet, a novel deep network architecture that is augmented with additional side branch classifiers. The architecture allows prediction results for a large portion of test samples to exit the network early via these branches when samples can already be inferred with high confidence. BranchyNet exploits the observation that features learned at an early layer of a network may often be sufficient for the classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning

MethodsEarly exiting using confidence measures · 1x1 Convolution · Convolution · Local Response Normalization · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Dense Connections · Max Pooling · Softmax