FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN   Training at Large Scale

Zeyu Zhu; Peisong Wang; Qinghao Hu; Gang Li; Xiaoyao Liang; Jian Cheng

arXiv:2409.14939·cs.LG·September 24, 2024

FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large Scale

Zeyu Zhu, Peisong Wang, Qinghao Hu, Gang Li, Xiaoyao Liang, Jian Cheng

PDF

1 Repo

TL;DR

FastGL is a GPU-efficient framework that accelerates large-scale sampling-based GNN training by optimizing sampling, memory IO, and computation, achieving significant speedups over existing frameworks.

Contribution

FastGL introduces a comprehensive GPU-optimized approach for sampling-based GNN training, including strategies to reduce data traffic, mitigate irregular data access, and decrease synchronization overhead.

Findings

01

Achieves an average speedup of 11.8x over PyG.

02

Reduces memory IO traffic with Match-Reorder strategy.

03

Diminishes synchronization overhead with Fused-Map approach.

Abstract

Graph Neural Networks (GNNs) have shown great superiority on non-Euclidean graph data, achieving ground-breaking performance on various graph-related tasks. As a practical solution to train GNN on large graphs with billions of nodes and edges, the sampling-based training is widely adopted by existing training frameworks. However, through an in-depth analysis, we observe that the efficiency of existing sampling-based training frameworks is still limited due to the key bottlenecks lying in all three phases of sampling-based training, i.e., subgraph sample, memory IO, and computation. To this end, we propose FastGL, a GPU-efficient Framework for accelerating sampling-based training of GNN at Large scale by simultaneously optimizing all above three phases, taking into account both GPU characteristics and graph structure. Specifically, by exploiting the inherent overlap within graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

a1bc2def6g/fastgl-ae
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.