Three Practical Workflow Schedulers for Easy Maximum Parallelism
David M. Rogers

TL;DR
This paper introduces three simple, robust workflow schedulers tailored for high-performance computing, demonstrating their scalability and suitability for maximizing system utilization in complex scientific simulations.
Contribution
It presents three new workflow schedulers with simple designs optimized for HPC environments, addressing specific use cases and integrating well with MPI for improved flexibility.
Findings
All three schedulers scale to full Summit utilization.
They have short startup times and low per-task overhead.
The work characterizes minimal task granularity for efficiency.
Abstract
Runtime scheduling and workflow systems are an increasingly popular algorithmic component in HPC because they allow full system utilization with relaxed synchronization requirements. There are so many special-purpose tools for task scheduling, one might wonder why more are needed. Use cases seen on the Summit supercomputer needed better integration with MPI and greater flexibility in job launch configurations. Preparation, execution, and analysis of computational chemistry simulations at the scale of tens of thousands of processors revealed three distinct workflow patterns. A separate job scheduler was implemented for each one using extremely simple and robust designs: file-based, task-list based, and bulk-synchronous. Comparing to existing methods shows unique benefits of this work, including simplicity of design, suitability for HPC centers, short startup time, and well-understood…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
