ExaWorks Software Development Kit: A Robust and Scalable Collection of Interoperable Workflow Technologies
Matteo Turilli, Mihael Hategan-Marandiuc, Mikhail Titov, Ketan, Maheshwari, Aymen Alsaadi, Andre Merzky, Ramon Arambula, Mikhail, Zakharchanka, Matt Cowan, Justin M. Wozniak, Andreas Wilke, Ozgur Ozan Kilic,, Kyle Chard, Rafael Ferreira da Silva, Shantenu Jha, Daniel Laney

TL;DR
The paper introduces the ExaWorks SDK, a curated, interoperable collection of workflow technologies designed to enhance scientific workflows on exascale HPC platforms, emphasizing robustness, scalability, and user support.
Contribution
It presents the development, integration, and testing of the ExaWorks SDK, providing a blueprint for supporting heterogeneous scientific workflows on exascale HPC systems.
Findings
Successful integration of diverse workflow technologies
Implementation of continuous testing on DOE HPC platforms
Development of user-friendly documentation and dashboards
Abstract
Scientific discovery increasingly requires executing heterogeneous scientific workflows on high-performance computing (HPC) platforms. Heterogeneous workflows contain different types of tasks (e.g., simulation, analysis, and learning) that need to be mapped, scheduled, and launched on different computing. That requires a software stack that enables users to code their workflows and automate resource management and workflow execution. Currently, there are many workflow technologies with diverse levels of robustness and capabilities, and users face difficult choices of software that can effectively and efficiently support their use cases on HPC machines, especially when considering the latest exascale platforms. We contributed to addressing this issue by developing the ExaWorks Software Development Kit (SDK). The SDK is a curated collection of workflow technologies engineered following…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management
