GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel   Applications

Jie Yan; Guangming Tan; Ninghui Sun

arXiv:1310.5603·cs.DC·October 22, 2013·5 cites

GRE: A Graph Runtime Engine for Large-Scale Distributed Graph-Parallel Applications

Jie Yan, Guangming Tan, Ninghui Sun

PDF

Open Access

TL;DR

GRE introduces a novel graph-parallel framework with new abstractions that significantly improves performance and memory efficiency for large-scale distributed graph processing on multi-core clusters.

Contribution

GRE proposes two new abstractions, Scatter-Combine and Agent-Graph, to enhance parallelism and graph partitioning, outperforming existing frameworks like PowerGraph.

Findings

01

GRE achieves 2.5-17x performance improvement over PowerGraph.

02

GRE can process graphs with 1 billion vertices and 17 billion edges.

03

GRE uses less memory, enabling larger graph processing.

Abstract

Large-scale distributed graph-parallel computing is challenging. On one hand, due to the irregular computation pattern and lack of locality, it is hard to express parallelism efficiently. On the other hand, due to the scale-free nature, real-world graphs are hard to partition in balance with low cut. To address these challenges, several graph-parallel frameworks including Pregel and GraphLab (PowerGraph) have been developed recently. In this paper, we present an alternative framework, Graph Runtime Engine (GRE). While retaining the vertex-centric programming model, GRE proposes two new abstractions: 1) a Scatter-Combine computation model based on active message to exploit massive fined-grained edge-level parallelism, and 2) a Agent-Graph data model based on vertex factorization to partition and represent directed graphs. GRE is implemented on commercial off-the-shelf multi-core cluster.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph Theory and Algorithms · Cloud Computing and Resource Management · Distributed and Parallel Computing Systems