Layer-Wise Partitioning and Merging for Efficient and Scalable Deep   Learning

Samson B. Akintoye; Liangxiu Han; Huw Lloyd; Xin Zhang; Darren Dancey,; Haoming Chen; and Daoqiang Zhang

arXiv:2207.11019·cs.DC·July 25, 2022

Layer-Wise Partitioning and Merging for Efficient and Scalable Deep Learning

Samson B. Akintoye, Liangxiu Han, Huw Lloyd, Xin Zhang, Darren Dancey,, Haoming Chen, and Daoqiang Zhang

PDF

Open Access

TL;DR

This paper introduces a novel layer-wise partitioning and merging framework for deep neural network training that enhances speed and scalability by reducing communication overhead and addressing locking issues, achieving near-linear speedup.

Contribution

It proposes a new layer-wise partitioning and merging method combined with parallel forward and backward passes to improve training efficiency and scalability.

Findings

01

Outperforms state-of-the-art methods in training speed

02

Achieves almost linear speedup without accuracy loss

03

Reduces communication overhead during training

Abstract

Deep Neural Network (DNN) models are usually trained sequentially from one layer to another, which causes forward, backward and update locking's problems, leading to poor performance in terms of training time. The existing parallel strategies to mitigate these problems provide suboptimal runtime performance. In this work, we have proposed a novel layer-wise partitioning and merging, forward and backward pass parallel framework to provide better training performance. The novelty of the proposed work consists of 1) a layer-wise partition and merging model which can minimise communication overhead between devices without the memory cost of existing strategies during the training process; 2) a forward pass and backward pass parallelisation and optimisation to address the update locking problem and minimise the total training cost. The experimental evaluation on real use cases shows that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Machine Learning and ELM