GraphTheta: A Distributed Graph Neural Network Learning System With Flexible Training Strategy
Yongchao Liu, Houyi Li, Guowei Zhang, Xintan Zeng, Yongyong Li, Bin, Huang, Peng Zhang, Zhao Li, Xiaowei Zhu, Changhua He, Wenguang Chen

TL;DR
GraphTheta is a scalable distributed system for training large-scale graph neural networks efficiently across many machines, enabling industry-scale applications with improved performance and comparable accuracy.
Contribution
It introduces a novel distributed graph learning system with a new abstraction NN-TGAR and hybrid-parallel execution, supporting large-scale GNN training on big graphs.
Findings
Scales to 1,024 workers on 1.4 billion node graph
Outperforms DistDGL by up to 2.02x
Outperforms GraphLearn by up to 30.56x
Abstract
Graph neural networks (GNNs) have been demonstrated as a powerful tool for analyzing non-Euclidean graph data. However, the lack of efficient distributed graph learning systems severely hinders applications of GNNs, especially when graphs are big and GNNs are relatively deep. Herein, we present GraphTheta, the first distributed and scalable graph learning system built upon vertex-centric distributed graph processing with neural network operators implemented as user-defined functions. This system supports multiple training strategies and enables efficient and scalable big-graph learning on distributed (virtual) machines with low memory. To facilitate graph convolutions, GraphTheta puts forward a new graph learning abstraction named NN-TGAR to bridge the gap between graph processing and graph deep learning. A distributed graph engine is proposed to conduct the stochastic gradient descent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Complex Network Analysis Techniques
