Performance report and optimized implementations of Weather & Climate dwarfs on multi-node systems
Louis Douriez, Alan Gray, David Guibert, Peter Messmer, Erwan Raffin

TL;DR
This paper reports on optimized implementations of Weather & Climate dwarfs for multi-node CPU and GPU systems, achieving significant performance improvements and demonstrating the importance of hardware-specific optimizations and high-bandwidth interconnects.
Contribution
It introduces optimized multi-node CPU and GPU implementations of Weather & Climate dwarfs, highlighting performance gains and hardware considerations for exascale weather prediction models.
Findings
Up to 30% performance improvement on CPU multi-node systems.
Up to 10X speedup on multi-GPU systems with data residency and fast communication.
High-bandwidth interconnects like NVLink/NVSwitch enhance multi-GPU performance.
Abstract
This document is one of the deliverable reports created for the ESCAPE project. ESCAPE stands for Energy-efficient Scalable Algorithms for Weather Prediction at Exascale. The project develops world-class, extreme-scale computing capabilities for European operational numerical weather prediction and future climate models. This is done by identifying Weather & Climate dwarfs which are key patterns in terms of computation and communication (in the spirit of the Berkeley dwarfs). These dwarfs are then optimised for different hardware architectures (single and multi-node) and alternative algorithms are explored. Performance portability is addressed through the use of domain specific languages. Here we summarize the work performed on optimizations of the dwarfs focusing on CPU multi-nodes and multi-GPUs. We limit ourselves to a subset of the dwarf configurations chosen by the consortium.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSatellite Communication Systems
