ParaDiS: Parallelly Distributable Slimmable Neural Networks

Alexey Ozerov; Anne Lambert; Suresh Kirthi Kumaraswamy

arXiv:2110.02724·cs.LG·November 30, 2021·1 cites

ParaDiS: Parallelly Distributable Slimmable Neural Networks

Alexey Ozerov, Anne Lambert, Suresh Kirthi Kumaraswamy

PDF

Open Access

TL;DR

ParaDiS introduces neural networks that can be split among various device configurations without retraining, enabling efficient parallel processing on limited power devices with minimal accuracy loss.

Contribution

The paper proposes ParaDiS, a novel neural network framework that allows parallel distribution across multiple devices without retraining, unlike previous methods.

Findings

01

ParaDiS switches achieve similar or better accuracy than individually trained models.

02

Distributable ParaDiS switches have minimal accuracy drop compared to non-distributable slimmable networks.

03

Distributed ParaDiS models outperform slimmable models significantly in accuracy and efficiency.

Abstract

When several limited power devices are available, one of the most efficient ways to make profit of these resources, while reducing the processing latency and communication load, is to run in parallel several neural sub-networks and to fuse the result at the end of processing. However, such a combination of sub-networks must be trained specifically for each particular configuration of devices (characterized by number of devices and their capacities) which may vary over different model deployments and even within the same deployment. In this work we introduce parallelly distributable slimmable (ParaDiS) neural networks that are splittable in parallel among various device configurations without retraining. While inspired by slimmable networks allowing instant adaptation to resources on just one device, ParaDiS networks consist of several multi-device distributable configurations or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image Processing Techniques