AGL: a Scalable System for Industrial-purpose Graph Machine Learning
Dalong Zhang, Xin Huang, Ziqi Liu, Zhiyang Hu, Xianzheng Song, Zhibang, Ge, Zhiqiang Zhang, Lin Wang, Jun Zhou, Yang Shuang, Yuan Qi

TL;DR
AGL is a scalable, fault-tolerant system designed for industrial graph machine learning, enabling efficient training and inference of GNNs on large-scale graphs using MapReduce infrastructure.
Contribution
The paper introduces AGL, a novel system that integrates training and inference for GNNs at scale, leveraging message passing and MapReduce for improved efficiency and scalability.
Findings
Trains a 2-layer GAT on billion-node graphs in 14 hours.
Completes inference on large graphs in 1.2 hours.
Supports large-scale industrial graph machine learning tasks.
Abstract
Machine learning over graphs have been emerging as powerful learning tools for graph data. However, it is challenging for industrial communities to leverage the techniques, such as graph neural networks (GNNs), and solve real-world problems at scale because of inherent data dependency in the graphs. As such, we cannot simply train a GNN with classic learning systems, for instance parameter server that assumes data parallel. Existing systems store the graph data in-memory for fast accesses either in a single machine or graph stores from remote. The major drawbacks are in three-fold. First, they cannot scale because of the limitations on the volume of the memory, or the bandwidth between graph stores and workers. Second, they require extra development of graph stores without well exploiting mature infrastructures such as MapReduce that guarantee good system properties. Third, they focus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Stochastic Gradient Optimization Techniques
