TL;DR
FastGL is a GPU-efficient framework that accelerates large-scale sampling-based GNN training by optimizing sampling, memory IO, and computation, achieving significant speedups over existing frameworks.
Contribution
FastGL introduces a comprehensive GPU-optimized approach for sampling-based GNN training, including strategies to reduce data traffic, mitigate irregular data access, and decrease synchronization overhead.
Findings
Achieves an average speedup of 11.8x over PyG.
Reduces memory IO traffic with Match-Reorder strategy.
Diminishes synchronization overhead with Fused-Map approach.
Abstract
Graph Neural Networks (GNNs) have shown great superiority on non-Euclidean graph data, achieving ground-breaking performance on various graph-related tasks. As a practical solution to train GNN on large graphs with billions of nodes and edges, the sampling-based training is widely adopted by existing training frameworks. However, through an in-depth analysis, we observe that the efficiency of existing sampling-based training frameworks is still limited due to the key bottlenecks lying in all three phases of sampling-based training, i.e., subgraph sample, memory IO, and computation. To this end, we propose FastGL, a GPU-efficient Framework for accelerating sampling-based training of GNN at Large scale by simultaneously optimizing all above three phases, taking into account both GPU characteristics and graph structure. Specifically, by exploiting the inherent overlap within graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
