Communication-Efficient Graph Neural Networks with Probabilistic   Neighborhood Expansion Analysis and Caching

Tim Kaler; Alexandros-Stavros Iliopoulos; Philip Murzynowski; Tao B.; Schardl; Charles E. Leiserson; Jie Chen

arXiv:2305.03152·cs.LG·May 8, 2023·2 cites

Communication-Efficient Graph Neural Networks with Probabilistic Neighborhood Expansion Analysis and Caching

Tim Kaler, Alexandros-Stavros Iliopoulos, Philip Murzynowski, Tao B., Schardl, Charles E. Leiserson, Jie Chen

PDF

Open Access 2 Repos

TL;DR

This paper introduces SALIENT++, a caching policy based on probabilistic neighborhood expansion analysis that significantly reduces communication bottlenecks in distributed GNN training, enabling faster and more scalable graph learning.

Contribution

SALIENT++ extends the SALIENT system by incorporating VIP-driven caching for partitioned features, drastically reducing communication volume and storage needs in distributed GNN training.

Findings

01

SALIENT++ achieves 7.1x faster training than SALIENT on 8 GPUs.

02

SALIENT++ is 12.7x faster than DistDGL on 8 GPUs.

03

The VIP analysis effectively guides caching to improve scalability.

Abstract

Training and inference with graph neural networks (GNNs) on massive graphs has been actively studied since the inception of GNNs, owing to the widespread use and success of GNNs in applications such as recommendation systems and financial forensics. This paper is concerned with minibatch training and inference with GNNs that employ node-wise sampling in distributed settings, where the necessary partitioning of vertex features across distributed storage causes feature communication to become a major bottleneck that hampers scalability. To significantly reduce the communication volume without compromising prediction accuracy, we propose a policy for caching data associated with frequently accessed vertices in remote partitions. The proposed policy is based on an analysis of vertex-wise inclusion probabilities (VIP) during multi-hop neighborhood sampling, which may expand the neighborhood…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data

MethodsDistDGL · GraphSAGE