(Re)partitioning for stream-enabled computation
Le Merrer Erwan, Liang Yizhong, Tr\'edan Gilles

TL;DR
This paper introduces a stream-enabled graph partitioning system that balances load and minimizes edge cuts in real-time, addressing the NP-complete challenge of static graph partitioning in streaming environments.
Contribution
It formulates online graph partitioning as an optimization problem and proposes a greedy algorithm with on-demand improvements, outperforming recent stream partitioning methods.
Findings
Efficient online partitioning with a greedy algorithm.
Tradeoff analysis between cut size and load balancing.
Simulation results show superior performance to existing methods.
Abstract
Partitioning an input graph over a set of workers is a complex operation. Objectives are twofold: split the work evenly, so that every worker gets an equal share, and minimize edge cut to achieve a good work locality (i.e. workers can work independently). Partitioning a graph accessible from memory is a notorious NP-complete problem. Motivated by the regain of interest for the stream processing paradigm (where nodes and edges arrive as a flow to the datacenter), we propose in this paper a stream-enabled graph partitioning system that constantly seeks an optimum between those two objectives. We first expose the hardness of partitioning using classic and static methods; we then exhibit the cut versus load balancing tradeoff, from an application point of view. With this tradeoff in mind, our approach translates the online partitioning problem into a standard optimization problem. A greedy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Interconnection Networks and Systems · Cloud Computing and Resource Management
