Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining
Tim Kaler, Nickolas Stathas, Anne Ouyang, Alexandros-Stavros, Iliopoulos, Tao B. Schardl, Charles E. Leiserson, Jie Chen

TL;DR
This paper introduces system-level improvements for graph neural network training and inference, including sampling, pipelining, and parallelization, achieving significant speedups on large-scale datasets without sacrificing accuracy.
Contribution
It proposes a set of techniques to accelerate GNN training and inference in multi-GPU environments, addressing bottlenecks in mini-batch preparation and data transfer.
Findings
Achieves 3x speedup over standard PyTorch-Geometric on single GPU
Attains 8x parallel speedup with 16 GPUs
Maintains comparable accuracy during inference with sampling
Abstract
Improving the training and inference performance of graph neural networks (GNNs) is faced with a challenge uncommon in general neural networks: creating mini-batches requires a lot of computation and data movement due to the exponential growth of multi-hop graph neighborhoods along network layers. Such a unique challenge gives rise to a diverse set of system design choices. We argue in favor of performing mini-batch training with neighborhood sampling in a distributed multi-GPU environment, under which we identify major performance bottlenecks hitherto under-explored by developers: mini-batch preparation and transfer. We present a sequence of improvements to mitigate these bottlenecks, including a performance-engineered neighborhood sampler, a shared-memory parallelization strategy, and the pipelining of batch transfer with GPU computation. We also conduct an empirical analysis that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Ferroelectric and Negative Capacitance Devices
MethodsTest · GraphSAGE
