Analysis of Work-Stealing and Parallel Cache Complexity
Yan Gu, Zachary Napier, Yihan Sun

TL;DR
This paper simplifies the analysis of the randomized work-stealing scheduler for educational purposes and introduces new parallel cache complexity bounds that improve upon previous results, especially for high-span algorithms.
Contribution
It provides an easy-to-understand analysis of the RWS scheduler and establishes new parallel cache complexity bounds decoupled from span, advancing theoretical understanding.
Findings
Simplified analysis of RWS scheduler without potential functions
New parallel cache bounds that are near optimal
Improved bounds for classic algorithms in parallel cache complexity
Abstract
Parallelism has become extremely popular over the past decade, and there have been a lot of new parallel algorithms and software. The randomized work-stealing (RWS) scheduler plays a crucial role in this ecosystem. In this paper, we study two important topics related to the randomized work-stealing scheduler. Our first contribution is a simplified, classroom-ready version of analysis for the RWS scheduler. The theoretical efficiency of the RWS scheduler has been analyzed for a variety of settings, but most of them are quite complicated. In this paper, we show a new analysis, which we believe is easy to understand, and can be especially useful in education. We avoid using the potential function in the analysis, and we assume a highly asynchronous setting, which is more realistic for today's parallel machines. Our second and main contribution is some new parallel cache complexity for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Interconnection Networks and Systems · Complexity and Algorithms in Graphs
