WorkflowHub: Community Framework for Enabling Scientific Workflow Research and Development -- Technical Report
Rafael Ferreira da Silva (1), Lo\"ic Pottier (1), Tain\~a Coleman (1),, Ewa Deelman (1), Henri Casanova (2) ((1) University of Southern California,, (2) University of Hawaii at Manoa)

TL;DR
WorkflowHub is a community framework that enables analysis, realistic synthetic trace generation, and simulation of scientific workflows, improving upon previous tools for research and development in workflow systems.
Contribution
It introduces WorkflowHub, a comprehensive framework that addresses limitations of existing tools by providing realistic trace generation and scalable simulation capabilities.
Findings
Generated synthetic traces closely resemble real-world workflows.
WorkflowHub can produce larger-scale workflow traces than previous tools.
Simulated executions match actual workflow performance metrics.
Abstract
Scientific workflows are a cornerstone of modern scientific computing. They are used to describe complex computational applications that require efficient and robust management of large volumes of data, which are typically stored/processed at heterogeneous, distributed resources. The workflow research and development community has employed a number of methods for the quantitative evaluation of existing and novel workflow algorithms and systems. In particular, a common approach is to simulate workflow executions. In previous work, we have presented a collection of tools that have been used for aiding research and development activities in the Pegasus project, and that have been adopted by others for conducting workflow research. Despite their popularity, there are several shortcomings that prevent easy adoption, maintenance, and consistency with the evolving structures and computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
