EmbedPart: Embedding-Driven Graph Partitioning for Scalable Graph Neural Network Training
Nikolai Merkel, Ruben Mayer, Volker Markl, Hans-Arno Jacobsen

TL;DR
EmbedPart is a novel embedding-driven graph partitioning method that significantly speeds up GNN training on large graphs while maintaining quality, supporting dynamic updates and reordering.
Contribution
It introduces EmbedPart, a fast, embedding-based partitioning approach that outperforms traditional methods like Metis in speed and supports dynamic graph updates.
Findings
EmbedPart achieves over 100x speedup compared to Metis.
EmbedPart maintains competitive partitioning quality.
EmbedPart supports graph updates and fast repartitioning.
Abstract
Graph Neural Networks (GNNs) are widely used for learning on graph-structured data, but scaling GNN training to massive graphs remains challenging. To enable scalable distributed training, graphs are divided into smaller partitions that are distributed across multiple machines such that inter-machine communication is minimized and computational load is balanced. In practice, existing partitioning approaches face a fundamental trade-off between partitioning overhead and partitioning quality. We propose EmbedPart, an embedding-driven partitioning approach that achieves both speed and quality. Instead of operating directly on irregular graph structures, EmbedPart leverages node embeddings produced during the actual GNN training workload and clusters these dense embeddings to derive a partitioning. EmbedPart achieves more than 100x speedup over Metis while maintaining competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
