Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun, Torsten Hoefler

TL;DR
This paper provides a comprehensive analysis of parallel and distributed deep learning, exploring concurrency types, system architectures, and future directions to enhance DNN training efficiency.
Contribution
It offers an in-depth theoretical and practical overview of parallelization strategies and models for deep neural networks, highlighting trends and future research directions.
Findings
Analysis of concurrency in DNNs from operators to distributed systems
Review of asynchronous stochastic optimization and communication schemes
Identification of future directions for parallelism in deep learning
Abstract
Deep Neural Networks (DNNs) are becoming an important tool in modern computing applications. Accelerating their training is a major challenge and techniques range from distributed algorithms to low-level circuit design. In this survey, we describe the problem from a theoretical perspective, followed by approaches for its parallelization. We present trends in DNN architectures and the resulting implications on parallelization strategies. We then review and model the different types of concurrency in DNNs: from the single operator, through parallelism in network inference and training, to distributed deep learning. We discuss asynchronous stochastic optimization, distributed system architectures, communication schemes, and neural architecture search. Based on those approaches, we extrapolate potential directions for parallelism in deep learning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Parallel Computing and Optimization Techniques · Advanced Neural Network Applications
