Alps, a versatile research infrastructure
Maxime Martinasso, Mark Klein, Thomas C. Schulthess

TL;DR
Alps is a flexible, modular HPC infrastructure that integrates heterogeneous hardware and software-defined clusters to support diverse scientific workloads and improve flexibility over traditional HPC systems.
Contribution
The paper introduces Alps, a novel HPC architecture with resource independence, software-defined clusters, and support for multiple scientific domains, enhancing flexibility and customization.
Findings
Supports diverse scientific workloads including weather prediction and AI.
Enables resource independence and flexible service creation.
Integrates heterogeneous hardware with high-performance networking.
Abstract
The Swiss National Supercomputing Centre (CSCS) has a long-standing tradition of delivering top-tier high-performance computing systems, exemplified by the Piz Daint supercomputer. However, the increasing diversity of scientific needs has exposed limitations in traditional vertically integrated HPC architectures, which often lack flexibility and composability. To address these challenges, CSCS developed Alps, a next-generation HPC infrastructure designed with a transformative principle: resources operate as independent endpoints within a high-speed network. This architecture enables the creation of independent tenant-specific and platform-specific services, tailored to diverse scientific requirements. Alps incorporates heterogeneous hardware, including CPUs and GPUs, interconnected by a high-performance Slingshot network, and offers a modular storage system. A key innovation is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
