Armada: Memory-Efficient Distributed Training of Large-Scale Graph Neural Networks
Roger Waleffe, Devesh Sarda, Jason Mohoney, Emmanouil-Vasileios, Vlatakis-Gkaragkounis, Theodoros Rekatsinas, Shivaram Venkataraman

TL;DR
Armada introduces GREM, a scalable min-edge-cut partitioning algorithm that significantly reduces memory and runtime for distributed GNN training on large graphs, achieving near state-of-the-art partition quality.
Contribution
The paper presents GREM, a novel streaming greedy partitioning algorithm with continuous refinement, enabling efficient large-scale GNN training with reduced resource requirements.
Findings
GREM achieves partition quality comparable to METIS with 8-65x less memory.
Armada's disaggregated architecture improves training runtime by up to 4.5x.
Cost reductions of up to 3.1x are achieved with Armada.
Abstract
We study distributed training of Graph Neural Networks (GNNs) on billion-scale graphs that are partitioned across machines. Efficient training in this setting relies on min-edge-cut partitioning algorithms, which minimize cross-machine communication due to GNN neighborhood sampling. Yet, min-edge-cut partitioning over large graphs remains a challenge: State-of-the-art (SoTA) offline methods (e.g., METIS) are effective, but they require orders of magnitude more memory and runtime than GNN training itself, while computationally efficient algorithms (e.g., streaming greedy approaches) suffer from increased edge cuts. Thus, in this work we introduce Armada, a new end-to-end system for distributed GNN training whose key contribution is GREM, a novel min-edge-cut partitioning algorithm that can efficiently scale to large graphs. GREM builds on streaming greedy approaches with one key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Advanced Graph Neural Networks · Advanced Neural Network Applications
