# Custom Execution Environments with Containers in Pegasus-enabled   Scientific Workflows

**Authors:** Karan Vahi, Mats Rynge, George Papadimitriou, Duncan A. Brown, Rajiv, Mayani, Rafael Ferreira da Silva, Ewa Deelman, Anirban Mandal, Eric Lyons,, Michael Zink

arXiv: 1905.08204 · 2019-05-21

## TL;DR

This paper discusses the importance of containers for ensuring long-term reproducibility in scientific workflows and examines how the Pegasus system supports container use to overcome associated challenges.

## Contribution

It highlights challenges of container integration in distributed workflows and details Pegasus's approach to implementing container support for reproducibility.

## Key findings

- Containers improve long-term reproducibility of workflows.
- Pegasus's container support addresses distributed environment challenges.
- The paper identifies unique issues in containerized scientific workflows.

## Abstract

Science reproducibility is a cornerstone feature in scientific workflows. In most cases, this has been implemented as a way to exactly reproduce the computational steps taken to reach the final results. While these steps are often completely described, including the input parameters, datasets, and codes, the environment in which these steps are executed is only described at a higher level with endpoints and operating system name and versions. Though this may be sufficient for reproducibility in the short term, systems evolve and are replaced over time, breaking the underlying workflow reproducibility. A natural solution to this problem is containers, as they are well defined, have a lifetime independent of the underlying system, and can be user-controlled so that they can provide custom environments if needed. This paper highlights some unique challenges that may arise when using containers in distributed scientific workflows. Further, this paper explores how the Pegasus Workflow Management System implements container support to address such challenges.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.08204/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1905.08204/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1905.08204/full.md

---
Source: https://tomesphere.com/paper/1905.08204