Sarus Suite: Cloud-native Containers for HPC
Alberto Madonna, Matteo Chesi, Gwangmu Lee, Michele Brambilla, Fawzi Roberto Mohamed, Felipe A. Cruz

TL;DR
Sarus Suite is a cloud-native HPC container architecture built on Podman that maintains upstream compatibility while adding HPC-specific features, enabling scalable, high-performance, and flexible container workflows.
Contribution
It introduces Sarus Suite, an HPC container solution aligned with upstream container ecosystems, integrating HPC functionalities through system layers without requiring a specialized runtime.
Findings
Sarus Suite matches the performance and scaling of existing HPC container baselines.
It enables faster per-node container startup compared to traditional HPC container solutions.
Supports direct use of upstream OCI images and Kubernetes-based multi-container workflows.
Abstract
High-performance computing (HPC) systems must support fast-moving software stacks, especially in AI/ML, while preserving scheduler control, scalable startup, and production performance. Yet many HPC container solutions rely on specialized runtime stacks that weaken continuity with mainstream cloud-native workflows and require ongoing effort to sustain compatibility with the evolving upstream ecosystem. We argue that HPC should specialize the integration layer while keeping the container engine aligned with upstream container evolution. We present Sarus Suite, an upstream-aligned HPC container architecture built around an unchanged Podman engine. Sarus Suite adds the HPC-specific functionality needed for production use through complementary system layers for declarative runtime specification, scheduler-native execution, scalable shared-image access, and standards-based host capability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
