Enhancing Split Computing and Early Exit Applications through Predefined Sparsity
Luigi Capogrosso, Enrico Fraccaroli, Giulio Petrozziello, Francesco, Setti, Samarjit Chakraborty, Franco Fummi, Marco Cristani

TL;DR
This paper proposes a novel method combining predefined sparsity with Split Computing and Early Exit techniques to significantly reduce computational and storage demands of deep neural networks on resource-constrained devices, without sacrificing accuracy.
Contribution
It introduces the application of predefined sparsity to Split Computing and Early Exit paradigms, a novel approach that enhances efficiency in DNN deployment on edge devices.
Findings
Over 4x reduction in storage and computational complexity
Significant energy savings during training and inference
Maintains performance despite reduced resource usage
Abstract
In the past decade, Deep Neural Networks (DNNs) achieved state-of-the-art performance in a broad range of problems, spanning from object classification and action recognition to smart building and healthcare. The flexibility that makes DNNs such a pervasive technology comes at a price: the computational requirements preclude their deployment on most of the resource-constrained edge devices available today to solve real-time and real-world tasks. This paper introduces a novel approach to address this challenge by combining the concept of predefined sparsity with Split Computing (SC) and Early Exit (EE). In particular, SC aims at splitting a DNN with a part of it deployed on an edge device and the rest on a remote server. Instead, EE allows the system to stop using the remote server and rely solely on the edge device's computation if the answer is already good enough. Specifically, how to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture · Parallel Computing and Optimization Techniques · Cloud Computing and Resource Management
