DISCO: Distributed Inference with Sparse Communications
Minghai Qin, Chao Sun, Jaco Hofmann, Dejan Vucinic

TL;DR
DISCO introduces a distributed inference framework that reduces communication, computation, and memory requirements for large DNNs by optimizing sparse data exchanges across multiple nodes, enabling efficient deployment on resource-limited devices.
Contribution
The paper proposes a novel training framework for sparse communication in distributed DNN inference, optimizing data transmission to significantly reduce communication and computation costs.
Findings
Achieves 5x less data communication in ResNet-50 inference.
Reduces overall computation and memory by nearly half.
Provides 4.7x speedup in inference speed.
Abstract
Deep neural networks (DNNs) have great potential to solve many real-world problems, but they usually require an extensive amount of computation and memory. It is of great difficulty to deploy a large DNN model to a single resource-limited device with small memory capacity. Distributed computing is a common approach to reduce single-node memory consumption and to accelerate the inference of DNN models. In this paper, we explore the "within-layer model parallelism", which distributes the inference of each layer into multiple nodes. In this way, the memory requirement can be distributed to many nodes, making it possible to use several edge devices to infer a large DNN model. Due to the dependency within each layer, data communications between nodes during this parallel inference can be a bottleneck when the communication bandwidth is limited. We propose a framework to train DNN models for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI
