dMath: Distributed Linear Algebra for DL

Steven Eliuk; Cameron Upright; Hars Vardhan; Stephen Walsh; Trevor; Gale

arXiv:1611.07819·cs.DC·November 24, 2016·2 cites

dMath: Distributed Linear Algebra for DL

Steven Eliuk, Cameron Upright, Hars Vardhan, Stephen Walsh, Trevor, Gale

PDF

Open Access

TL;DR

dMath is a distributed linear algebra library optimized for deep learning that offers scalable performance, easy-to-use primitives, and efficient memory management for building large-scale neural network applications.

Contribution

The paper introduces dMath, a novel library providing scalable distributed primitives and algorithms tailored for deep learning, with advanced memory management techniques.

Findings

01

Achieves leading scaling performance in DL workloads.

02

Supports a variety of domain-specific algorithms.

03

Enables rapid development of scalable DNN applications.

Abstract

The paper presents a parallel math library, dMath, that demonstrates leading scaling when using intranode, internode, and hybrid-parallelism for deep learning (DL). dMath provides easy-to-use distributed primitives and a variety of domain-specific algorithms including matrix multiplication, convolutions, and others allowing for rapid development of scalable applications like deep neural networks (DNNs). Persistent data stored in GPU memory and advanced memory management techniques avoid costly transfers between host and device. dMath delivers performance, portability, and productivity to its specific domain of support.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Parallel Computing and Optimization Techniques · Algorithms and Data Compression