TL;DR
This paper introduces a scalable, efficient parallel training framework for Graph Neural Networks that significantly reduces training time on large graphs by innovative sampling, data partitioning, and runtime scheduling techniques.
Contribution
It presents a novel parallel training method for GNNs using subgraph sampling, optimized data structures, and a runtime scheduler, enabling scalable and fast training on large graphs.
Findings
Achieves 60x speedup in sampling and 20x in feature propagation on a 40-core system.
Outperforms state-of-the-art methods in scalability, efficiency, and accuracy.
Enables training of deeper GNNs with orders of magnitude faster than existing implementations.
Abstract
Graph Neural Networks (GNNs) are powerful deep learning models to generate node embeddings on graphs. When applying deep GNNs on large graphs, it is still challenging to perform training in an efficient and scalable way. We propose a novel parallel training framework. Through sampling small subgraphs as minibatches, we reduce training workload by orders of magnitude compared with state-of-the-art minibatch methods. We then parallelize the key computation steps on tightly-coupled shared memory systems. For graph sampling, we exploit parallelism within and across sampler instances, and propose an efficient data structure supporting concurrent accesses from samplers. The parallel sampler theoretically achieves near-linear speedup with respect to number of processing units. For feature propagation within subgraphs, we improve cache utilization and reduce DRAM traffic by data partitioning.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
