GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel Applications
Jie Yan, Guangming Tan, Ninghui Sun

TL;DR
GRE introduces a novel graph-parallel framework with new abstractions that significantly improves performance and memory efficiency for large-scale distributed graph processing on multi-core clusters.
Contribution
GRE proposes two new abstractions, Scatter-Combine and Agent-Graph, to enhance parallelism and graph partitioning, outperforming existing frameworks like PowerGraph.
Findings
GRE achieves 2.5-17x performance improvement over PowerGraph.
GRE can process graphs with 1 billion vertices and 17 billion edges.
GRE uses less memory, enabling larger graph processing.
Abstract
Large-scale distributed graph-parallel computing is challenging. On one hand, due to the irregular computation pattern and lack of locality, it is hard to express parallelism efficiently. On the other hand, due to the scale-free nature, real-world graphs are hard to partition in balance with low cut. To address these challenges, several graph-parallel frameworks including Pregel and GraphLab (PowerGraph) have been developed recently. In this paper, we present an alternative framework, Graph Runtime Engine (GRE). While retaining the vertex-centric programming model, GRE proposes two new abstractions: 1) a Scatter-Combine computation model based on active message to exploit massive fined-grained edge-level parallelism, and 2) a Agent-Graph data model based on vertex factorization to partition and represent directed graphs. GRE is implemented on commercial off-the-shelf multi-core cluster.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems
