TL;DR
LB4OMP is an open-source library that enhances multithreaded application performance by providing dynamic load balancing algorithms, addressing load imbalance issues in hierarchical parallel systems like Exascale computing.
Contribution
The paper introduces LB4OMP, a flexible load balancing library that implements various scheduling algorithms to improve multithreaded application performance beyond standard OpenMP options.
Findings
LB4OMP outperforms OpenMP scheduling in many applications.
Node-level load balancing reduces cross-node imbalance.
Improved MPI+OpenMP performance for Exascale computing.
Abstract
Exascale computing systems will exhibit high degrees of hierarchical parallelism, with thousands of computing nodes and hundreds of cores per node. Efficiently exploiting hierarchical parallelism is challenging due to load imbalance that arises at multiple levels. OpenMP is the most widely-used standard for expressing and exploiting the ever-increasing node-level parallelism. The scheduling options in OpenMP are insufficient to address the load imbalance that arises during the execution of multithreaded applications. The limited scheduling options in OpenMP hinder research on novel scheduling techniques which require comparison with others from the literature. This work introduces LB4OMP, an open-source dynamic load balancing library that implements successful scheduling algorithms from the literature. LB4OMP is a research infrastructure designed to spur and support present…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
