Variant Parallelism: Lightweight Deep Convolutional Models for   Distributed Inference on IoT Devices

Navidreza Asadi; Maziar Goudarzi

arXiv:2210.08376·cs.DC·July 6, 2023

Variant Parallelism: Lightweight Deep Convolutional Models for Distributed Inference on IoT Devices

Navidreza Asadi, Maziar Goudarzi

PDF

Open Access

TL;DR

This paper introduces variant parallelism, a lightweight ensemble-based method for distributed deep learning inference on IoT devices, reducing communication overhead and improving fault tolerance while maintaining high accuracy.

Contribution

The authors propose variant parallelism, a novel ensemble-based distribution technique that generates lightweight model variants for efficient, fault-tolerant inference on resource-constrained IoT devices.

Findings

01

Models are 5.8-7.1x smaller in parameters

02

Achieve 4.3-31x fewer MACs and 2.5-13.2x faster response time

03

Maintain or improve accuracy compared to MobileNetV2

Abstract

Two major techniques are commonly used to meet real-time inference limitations when distributing models across resource-constrained IoT devices: (1) model parallelism (MP) and (2) class parallelism (CP). In MP, transmitting bulky intermediate data (orders of magnitude larger than input) between devices imposes huge communication overhead. Although CP solves this problem, it has limitations on the number of sub-models. In addition, both solutions are fault intolerant, an issue when deployed on edge devices. We propose variant parallelism (VP), an ensemble-based deep learning distribution method where different variants of a main model are generated and can be deployed on separate machines. We design a family of lighter models around the original model, and train them simultaneously to improve accuracy over single models. Our experimental results on six common mid-sized object recognition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Advanced Neural Network Applications · Machine Learning and ELM

MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Convolution · 1x1 Convolution · Batch Normalization · Inverted Residual Block · Balanced Selection · Average Pooling