Energy-aware operation of HPC systems in Germany
Estela Suarez, Hendryk Bockelmann, Norbert Eicker, Jan Eitzinger,, Salem El Sayed, Thomas Fieseler, Martin Frank, Peter Frech, Pay Giesselmann,, Daniel Hackenberg, Georg Hager, Andreas Herten, Thomas Ilsche, Bastian, Koller, Erwin Laure, Cristina Manzano, Sebastian Oeste

TL;DR
This paper reviews strategies and innovations for improving energy efficiency in German HPC systems, addressing rising energy demands driven by increasing computational needs and AI workloads.
Contribution
It presents a comprehensive overview of state-of-the-art energy-saving techniques and case studies specific to German HPC centers, highlighting practical implementations and best practices.
Findings
Implementation of heterogeneous hardware architectures
Use of advanced monitoring and cooling solutions
Energy-aware scheduling and dynamic power management
Abstract
High-Performance Computing (HPC) systems are among the most energy-intensive scientific facilities, with electric power consumption reaching and often exceeding 20 megawatts per installation. Unlike other major scientific infrastructures such as particle accelerators or high-intensity light sources, which are few around the world, the number and size of supercomputers are continuously increasing. Even if every new system generation is more energy efficient than the previous one, the overall growth in size of the HPC infrastructure, driven by a rising demand for computational capacity across all scientific disciplines, and especially by artificial intelligence workloads (AI), rapidly drives up the energy demand. This challenge is particularly significant for HPC centers in Germany, where high electricity costs, stringent national energy policies, and a strong commitment to environmental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems
