Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource Constrained IoT Systems
Juliano S. Assine, J. C. S. Santos Filho, Eduardo Valle, Marco, Levorato

TL;DR
This paper introduces slimmable ensemble encoders for split DNNs, enabling real-time adaptation of computational load and data transmission in resource-constrained IoT devices, improving efficiency and flexibility.
Contribution
The paper proposes a novel split computing method using slimmable ensemble encoders that adapt dynamically with minimal overhead, outperforming existing solutions.
Findings
Outperforms existing split computing methods in compression and speed.
Enables real-time adaptation with minimal overhead.
Effective on GPU-less mobile devices.
Abstract
The execution of large deep neural networks (DNN) at mobile edge devices requires considerable consumption of critical resources, such as energy, while imposing demands on hardware capabilities. In approaches based on edge computing the execution of the models is offloaded to a compute-capable device positioned at the edge of 5G infrastructures. The main issue of the latter class of approaches is the need to transport information-rich signals over wireless links with limited and time-varying capacity. The recent split computing paradigm attempts to resolve this impasse by distributing the execution of DNN models across the layers of the systems to reduce the amount of data to be transmitted while imposing minimal computing load on mobile devices. In this context, we propose a novel split computing approach based on slimmable ensemble encoders. The key advantage of our design is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
