Workflows Community Summit: Tightening the Integration between Computing Facilities and Scientific Workflows
Rafael Ferreira da Silva, Kyle Chard, Henri Casanova, Dan Laney, Dong, Ahn, Shantenu Jha, William E. Allcock, Gregory Bauer, Dmitry Duplyakin,, Bjoern Enders, Todd M. Heer, Eric Lancon, Sergiu Sanielevici, Kevin Sayers

TL;DR
The paper discusses the importance of scientific workflows, the challenges posed by a fragmented ecosystem of workflow systems, and the efforts by facilities and projects to improve integration and usability.
Contribution
It reports on the third Workflows Community Summit focusing on enhancing collaboration between computing facilities and workflow systems to address fragmentation and improve support.
Findings
Workflows are vital for scientific discoveries and require large-scale computing resources.
Current workflow systems are numerous and often incompatible, creating barriers for users.
Summit discussions aim to foster better integration between facilities and workflows.
Abstract
The importance of workflows is highlighted by the fact that they have underpinned some of the most significant discoveries of the past decades. Many of these workflows have significant computational, storage, and communication demands, and thus must execute on a range of large-scale computer systems, from local clusters to public clouds and upcoming exascale HPC platforms. Historically, infrastructures for workflow execution consisted of complex, integrated systems, developed in-house by workflow practitioners with strong dependencies on a range of legacy technologies. Due to the increasing need to support workflows, dedicated workflow systems were developed to provide abstractions for creating, executing, and adapting workflows conveniently and efficiently while ensuring portability. While these efforts are all worthwhile individually, there are now hundreds of independent workflow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Distributed and Parallel Computing Systems · Research Data Management Practices
