SplitEE: Early Exit in Deep Neural Networks with Split Computing
Divya J. Bajpai, Vivek K. Trivedi, Sohan L. Yadav, and Manjesh K., Hanawal

TL;DR
SplitEE introduces an innovative method combining early exits and split computing in deep neural networks, optimizing resource use on edge devices with minimal accuracy loss through an online, unsupervised learning algorithm.
Contribution
The paper proposes SplitEE, a novel algorithm that adaptively decides computation depth and exit points in DNNs for resource-constrained environments, working in an online, unsupervised manner.
Findings
Achieves over 50% cost reduction compared to full inference.
Maintains less than 2% accuracy drop with cost savings.
Extensively validated on five datasets.
Abstract
Deep Neural Networks (DNNs) have drawn attention because of their outstanding performance on various tasks. However, deploying full-fledged DNNs in resource-constrained devices (edge, mobile, IoT) is difficult due to their large size. To overcome the issue, various approaches are considered, like offloading part of the computation to the cloud for final inference (split computing) or performing the inference at an intermediary layer without passing through all layers (early exits). In this work, we propose combining both approaches by using early exits in split computing. In our approach, we decide up to what depth of DNNs computation to perform on the device (splitting layer) and whether a sample can exit from this layer or need to be offloaded. The decisions are based on a weighted combination of accuracy, computational, and communication costs. We develop an algorithm named SplitEE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data
MethodsAttention Is All You Need · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Dropout · Linear Warmup With Linear Decay · BERT · DeeBERT
