# Evaluation of Docker Containers for Scientific Workloads in the Cloud

**Authors:** Pankaj Saha, Angel Beltre, Piotr Uminski, Madhusudhan Govindaraju

arXiv: 1905.08415 · 2019-05-23

## TL;DR

This paper evaluates Docker and Singularity containers for scientific workloads in cloud environments, analyzing performance, hardware mapping, and resource management to guide application developers.

## Contribution

It provides a comprehensive performance comparison of Docker and Singularity on cloud hardware, including RDMA support and workload mapping strategies.

## Key findings

- Both Docker and Singularity achieve near-native performance.
- Docker offers advantages in overlay networking and MPI application deployment.
- Singularity is tailored for HPC workloads but Docker is more flexible in cloud environments.

## Abstract

The HPC community is actively researching and evaluating tools to support execution of scientific applications in cloud-based environments. Among the various technologies, containers have recently gained importance as they have significantly better performance compared to full-scale virtualization, support for microservices and DevOps, and work seamlessly with workflow and orchestration tools. Docker is currently the leader in containerization technology because it offers low overhead, flexibility, portability of applications, and reproducibility. Singularity is another container solution that is of interest as it is designed specifically for scientific applications. It is important to conduct performance and feature analysis of the container technologies to understand their applicability for each application and target execution environment. This paper presents a (1) performance evaluation of Docker and Singularity on bare metal nodes in the Chameleon cloud (2) mechanism by which Docker containers can be mapped with InfiniBand hardware with RDMA communication and (3) analysis of mapping elements of parallel workloads to the containers for optimal resource management with container-ready orchestration tools. Our experiments are targeted toward application developers so that they can make informed decisions on choosing the container technologies and approaches that are suitable for their HPC workloads on cloud infrastructure. Our performance analysis shows that scientific workloads for both Docker and Singularity based containers can achieve near-native performance. Singularity is designed specifically for HPC workloads. However, Docker still has advantages over Singularity for use in clouds as it provides overlay networking and an intuitive way to run MPI applications with one container per rank for fine-grained resources allocation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.08415/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/1905.08415/full.md

## References

13 references — full list in the complete paper: https://tomesphere.com/paper/1905.08415/full.md

---
Source: https://tomesphere.com/paper/1905.08415