Accurate, Efficient and Scalable Training of Graph Neural Networks

Hanqing Zeng; Hongkuan Zhou; Ajitesh Srivastava; Rajgopal; Kannan; Viktor Prasanna

arXiv:2010.03166·cs.LG·October 8, 2020

Accurate, Efficient and Scalable Training of Graph Neural Networks

Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal, Kannan, Viktor Prasanna

PDF

2 Repos

TL;DR

This paper introduces a scalable, efficient parallel training framework for Graph Neural Networks that significantly reduces training time on large graphs by innovative sampling, data partitioning, and runtime scheduling techniques.

Contribution

It presents a novel parallel training method for GNNs using subgraph sampling, optimized data structures, and a runtime scheduler, enabling scalable and fast training on large graphs.

Findings

01

Achieves 60x speedup in sampling and 20x in feature propagation on a 40-core system.

02

Outperforms state-of-the-art methods in scalability, efficiency, and accuracy.

03

Enables training of deeper GNNs with orders of magnitude faster than existing implementations.

Abstract

Graph Neural Networks (GNNs) are powerful deep learning models to generate node embeddings on graphs. When applying deep GNNs on large graphs, it is still challenging to perform training in an efficient and scalable way. We propose a novel parallel training framework. Through sampling small subgraphs as minibatches, we reduce training workload by orders of magnitude compared with state-of-the-art minibatch methods. We then parallelize the key computation steps on tightly-coupled shared memory systems. For graph sampling, we exploit parallelism within and across sampler instances, and propose an efficient data structure supporting concurrent accesses from samplers. The parallel sampler theoretically achieves near-linear speedup with respect to number of processing units. For feature propagation within subgraphs, we improve cache utilization and reduce DRAM traffic by data partitioning.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.