ParaDiS: Parallelly Distributable Slimmable Neural Networks
Alexey Ozerov, Anne Lambert, Suresh Kirthi Kumaraswamy

TL;DR
ParaDiS introduces neural networks that can be split among various device configurations without retraining, enabling efficient parallel processing on limited power devices with minimal accuracy loss.
Contribution
The paper proposes ParaDiS, a novel neural network framework that allows parallel distribution across multiple devices without retraining, unlike previous methods.
Findings
ParaDiS switches achieve similar or better accuracy than individually trained models.
Distributable ParaDiS switches have minimal accuracy drop compared to non-distributable slimmable networks.
Distributed ParaDiS models outperform slimmable models significantly in accuracy and efficiency.
Abstract
When several limited power devices are available, one of the most efficient ways to make profit of these resources, while reducing the processing latency and communication load, is to run in parallel several neural sub-networks and to fuse the result at the end of processing. However, such a combination of sub-networks must be trained specifically for each particular configuration of devices (characterized by number of devices and their capacities) which may vary over different model deployments and even within the same deployment. In this work we introduce parallelly distributable slimmable (ParaDiS) neural networks that are splittable in parallel among various device configurations without retraining. While inspired by slimmable networks allowing instant adaptation to resources on just one device, ParaDiS networks consist of several multi-device distributable configurations or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image Processing Techniques
