Scheduling computations with provably low synchronization overheads
Guilherme Rito, Herv\'e Paulino

TL;DR
This paper introduces a variant of work stealing that minimizes synchronization overheads by keeping deques private and only exposing work on demand, achieving bounds independent of total work.
Contribution
The paper presents a new work stealing algorithm with provably low synchronization overheads, improving upon previous algorithms in both theory and design.
Findings
Expected execution time is O(T1/P + T∞)
Synchronization overheads are at most O((C_CAS + C_MFence) P T∞)
First bounds on synchronization overheads independent of total work
Abstract
Work Stealing has been a very successful algorithm for scheduling parallel computations, and is known to achieve high performances even for computations exhibiting fine-grained parallelism. We present a variant of \ws\ that provably avoids most synchronization overheads by keeping processors' deques entirely private by default, and only exposing work when requested by thieves. This is the first paper that obtains bounds on the synchronization overheads that are (essentially) independent of the total amount of work, thus corresponding to a great improvement, in both algorithm design and theory, over state-of-the-art \ws\ algorithms. Consider any computation with work and critical-path length executed by processors using our scheduler. Our analysis shows that the expected execution time is , and the expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Optimization and Search Problems
