A Systematic Evaluation of the Potential of Carbon-Aware Execution for Scientific Workflows
Kathleen West, Youssef Moawad, Fabian Lehmann, Vasilis Bountris, Ulf Leser, Yehia Elkhatib, Lauritz Thamsen

TL;DR
This paper systematically evaluates how carbon-aware execution strategies, like temporal shifting and resource scaling, can significantly reduce the carbon footprint of scientific workflows on compute clusters and cloud resources.
Contribution
It demonstrates the potential of leveraging delay tolerance, interruptibility, and scalability in scientific workflows for more sustainable execution, filling a key research gap.
Findings
Temporal shifting can reduce carbon emissions by over 80%.
Resource scaling decreases emissions by approximately 67%.
Quantified carbon footprint of real-world workflows on diverse resources.
Abstract
Scientific workflows are critical to scientific data analysis and often involve computationally intensive processing of large datasets on compute clusters. As such, their execution tends to be long-running and resource-intensive, resulting in significant energy consumption and carbon emissions. While carbon-aware computing methods have received considerable attention in general cloud contexts, their application to scientific data analysis workflows remains a critical research gap. Our study addresses this oversight by showing how the delay tolerance, interruptibility, and scalability of scientific workflows can be leveraged for a significantly more sustainable execution model. In this study, we first quantify the problem of carbon emissions associated with running scientific workflows, and then demonstrate the transformative potential for carbon-aware workflow execution. We estimate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Research Data Management Practices
