DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs
Da Zheng, Chao Ma, Minjie Wang, Jinjing Zhou, Qidong Su, Xiang Song,, Quan Gan, Zheng Zhang, George Karypis

TL;DR
DistDGL is a scalable distributed system that enables efficient training of GNNs on billion-scale graphs by optimizing graph partitioning, communication, and computation across multiple machines.
Contribution
The paper introduces DistDGL, a novel distributed GNN training system that achieves high efficiency and scalability on extremely large graphs, with optimized partitioning and communication strategies.
Findings
Achieves linear speedup on large graphs
Trains a billion-scale graph in 13 seconds per epoch on 16 machines
Maintains high model accuracy with distributed training
Abstract
Graph neural networks (GNN) have shown great success in learning from graph-structured data. They are widely used in various applications, such as recommendation, fraud detection, and search. In these domains, the graphs are typically large, containing hundreds of millions of nodes and several billions of edges. To tackle this challenge, we develop DistDGL, a system for training GNNs in a mini-batch fashion on a cluster of machines. DistDGL is based on the Deep Graph Library (DGL), a popular GNN development framework. DistDGL distributes the graph and its associated data (initial features and embeddings) across the machines and uses this distribution to derive a computational decomposition by following an owner-compute rule. DistDGL follows a synchronous training approach and allows ego-networks forming the mini-batches to include non-local nodes. To minimize the overheads associated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Complex Network Analysis Techniques
MethodsDistDGL
