Generalizing Hierarchical Parallelism
Michael Kruse

TL;DR
This paper proposes a simplified, nested parallelism model for OpenMP to better handle the increasing complexity of hardware hierarchies, aiming for more flexible and scalable parallel programming.
Contribution
It introduces a generalized hierarchical parallelism approach that revisits nested parallelism, simplifying the model and improving flexibility for deep hardware hierarchies.
Findings
Enhanced flexibility in handling hardware hierarchies
Simplified model for nested parallelism
Potential for improved scalability and programmability
Abstract
Since the days of OpenMP 1.0 computer hardware has become more complex, typically by specializing compute units for coarse- and fine-grained parallelism in incrementally deeper hierarchies of parallelism. Newer versions of OpenMP reacted by introducing new mechanisms for querying or controlling its individual levels, each time adding another concept such as places, teams, and progress groups. In this paper we propose going back to the roots of OpenMP in the form of nested parallelism for a simpler model and more flexible handling of arbitrary deep hardware hierarchies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Graph Theory and Algorithms · Parallel Computing and Optimization Techniques
