Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and Caching
Tim Kaler, Alexandros-Stavros Iliopoulos, Philip Murzynowski, Tao B., Schardl, Charles E. Leiserson, Jie Chen

TL;DR
This paper introduces SALIENT++, a caching policy based on probabilistic neighborhood expansion analysis that significantly reduces communication bottlenecks in distributed GNN training, enabling faster and more scalable graph learning.
Contribution
SALIENT++ extends the SALIENT system by incorporating VIP-driven caching for partitioned features, drastically reducing communication volume and storage needs in distributed GNN training.
Findings
SALIENT++ achieves 7.1x faster training than SALIENT on 8 GPUs.
SALIENT++ is 12.7x faster than DistDGL on 8 GPUs.
The VIP analysis effectively guides caching to improve scalability.
Abstract
Training and inference with graph neural networks (GNNs) on massive graphs has been actively studied since the inception of GNNs, owing to the widespread use and success of GNNs in applications such as recommendation systems and financial forensics. This paper is concerned with minibatch training and inference with GNNs that employ node-wise sampling in distributed settings, where the necessary partitioning of vertex features across distributed storage causes feature communication to become a major bottleneck that hampers scalability. To significantly reduce the communication volume without compromising prediction accuracy, we propose a policy for caching data associated with frequently accessed vertices in remote partitions. The proposed policy is based on an analysis of vertex-wise inclusion probabilities (VIP) during multi-hop neighborhood sampling, which may expand the neighborhood…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data
MethodsDistDGL · GraphSAGE
