Out-of-Core Edge Partitioning at Linear Run-Time
Ruben Mayer, Kamil Orujzade, Hans-Arno Jacobsen

TL;DR
This paper introduces 2PS-L, a novel out-of-core edge partitioning algorithm that achieves linear run-time while improving partitioning quality, significantly reducing total processing time for distributed graph computations.
Contribution
The paper presents 2PS-L, a new out-of-core edge partitioning algorithm that combines clustering with a two-phase approach to achieve linear run-time and better quality.
Findings
2PS-L outperforms existing stateful streaming partitioners in quality.
2PS-L achieves linear run-time complexity, O(|E|).
Partitioning and processing time are significantly reduced.
Abstract
Graph edge partitioning is an important preprocessing step to optimize distributed computing jobs on graph-structured data. The edge set of a given graph is split into equally-sized partitions, such that the replication of vertices across partitions is minimized. Out-of-core edge partitioning algorithms are able to tackle the problem with low memory overhead. Exsisting out-of-core algorithms mainly work in a streaming manner and can be grouped into two types. While \emph{stateless} streaming edge partitioning is fast and yields low partitioning quality, stateful streaming edge partitioning yields better quality, but is expensive, as it requires a scoring function to be evaluated for every edge on every partition, leading to a time complexity of . In this paper, we propose 2PS-L, a novel out-of-core edge partitioning algorithm that builds upon the stateful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Graph Theory and Algorithms · Cloud Computing and Resource Management
