Distributed Balanced Partitioning via Linear Embedding
Kevin Aydin, MohammadHossein Bateni, Vahab Mirrokni

TL;DR
This paper introduces a distributed graph partitioning algorithm that embeds nodes onto a line, enabling scalable, efficient partitioning with improved cut sizes and reduced multi-shard queries in real-world applications.
Contribution
The paper presents a novel distributed partitioning method using linear embedding, combining multiple techniques, and demonstrates superior performance over existing algorithms in empirical tests.
Findings
15-25% improvement in cut value over previous algorithms
Scalable distributed implementation for any number of partitions
~40% reduction in multi-shard queries in live Google Maps experiments
Abstract
Balanced partitioning is often a crucial first step in solving large-scale graph optimization problems, e.g., in some cases, a big graph can be chopped into pieces that fit on one machine to be processed independently before stitching the results together. In other cases, links between different parts may show up in the running time and/or network communications cost. We study a distributed balanced partitioning problem where the goal is to partition the vertices of a given graph into k pieces so as to minimize the total cut size. Our algorithm is composed of a few steps that are easily implementable in distributed computation frameworks. The algorithm first embeds nodes of the graph onto a line, and then processes nodes in a distributed manner guided by the linear embedding order. We examine various ways to find the first embedding, e.g., via a hierarchical clustering or Hilbert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
