Optimal Round and Sample-Size Complexity for Partitioning in Parallel Sorting
Wentao Yang, Vipul Harsh, Edgar Solomonik

TL;DR
This paper establishes tight bounds on the number of rounds and sample sizes needed for balanced partitioning in parallel sorting algorithms, demonstrating optimality of existing methods and providing new theoretical insights.
Contribution
It derives matching upper and lower bounds on rounds and samples for parallel partitioning, proving the optimality of current one-round algorithms and extending understanding of multi-round complexities.
Findings
O(log* p) rounds with O(p/log* p) samples suffice for p processors.
Any algorithm with O(p) samples per round requires at least Ω(log* p) rounds.
One round algorithms require Ω(p log p) samples, confirming their optimal sample size.
Abstract
State-of-the-art parallel sorting algorithms for distributed-memory architectures are based on computing a balanced partitioning via sampling and histogramming. By finding samples that partition the sorted keys into evenly-sized chunks, these algorithms minimize the number of communication rounds required. Histogramming (computing positions of samples) guides sampling, enabling a decrease in the overall number of samples collected. We derive lower and upper bounds on the number of sampling/histogramming rounds required to compute a balanced partitioning. We improve on prior results to demonstrate that when using processors, rounds with samples per round suffice. We match that with a lower bound that shows that any algorithm with samples per round requires at least rounds. Additionally, we prove the samples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Complexity and Algorithms in Graphs · Machine Learning and Algorithms
