A Discrete-event-based Simulator for Distributed Deep Learning

Xiaoyan Liu; Zhiwei Xu; Yana Qin; Jie Tian

arXiv:2112.00952·cs.LG·November 24, 2022

A Discrete-event-based Simulator for Distributed Deep Learning

Xiaoyan Liu, Zhiwei Xu, Yana Qin, Jie Tian

PDF

Open Access

TL;DR

This paper introduces sim4DistrDL, a discrete-event simulator designed to evaluate distributed deep learning systems, addressing the lack of specialized simulation tools for DNN-based distributed applications.

Contribution

The paper presents a novel discrete-event simulation framework that integrates deep learning and network modules for distributed deep learning environments.

Findings

01

Enables simulation of distributed deep learning configurations

02

Facilitates early-stage scalability assessment of intelligence applications

03

Supports analysis of parameter configuration effects

Abstract

New intelligence applications are driving increasing interest in deploying deep neural networks (DNN) in a distributed way. To set up distributed deep learning involves alterations of a great number of the parameter configurations of network/edge devices and DNN models, which are crucial to achieve best performances. Simulations measure scalability of intelligence applications in the early stage, as well as to determine the effects of different configurations, thus highly desired. However, work on simulating the distributed intelligence environment is still in its infancy. The existing simulation frameworks, such as NS-3, etc., cannot extended in a straightforward way to support simulations of distributed learning. In this paper, we propose a novel discrete event simulator, sim4DistrDL, which includes a deep learning module and a network simulation module to facilitate simulation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Age of Information Optimization · Ferroelectric and Negative Capacitance Devices