An efficient multi-GPU implementation for the Discontinuous Galerkin ocean model SLIM
Miguel De Le Court, Vincent Legat, Ange P. Ishimwe, Colin Scherpereel, Emmanuel Hanert, Jonathan Lambrechts

TL;DR
This paper introduces a GPU-optimized 3D Discontinuous Galerkin ocean model implementation that significantly accelerates unstructured-mesh simulations, enabling ultra-high-resolution coastal modeling.
Contribution
It presents a novel multi-GPU implementation of a DG-FE ocean model with optimized computational strategies, achieving high performance and scalability.
Findings
A single GPU matches 1500 CPU cores in performance.
Replacing a 128-core CPU node with 4 GPUs yields 50x speedup.
Model achieves five times finer resolution in a real-world application.
Abstract
Unstructured-mesh ocean models are increasingly used for coastal applications due to their ability to represent complex geometries and apply local grid refinement where needed. However, their broader use has been hindered by their high computational cost, particularly for models based on the Discontinuous Galerkin finite element (DG-FE) method, which involves significantly more degrees of freedom than traditional finite volume or continuous finite element approaches. The rapid emergence of GPU-based high-performance computing architectures now offers a pathway to address this limitation, as DG-FE formulations are inherently well suited to massively parallel, element-wise computations. Here, we present a full 3D DG-FE ocean model implementation optimized for both single- and multi-GPU systems, with support for both NVIDIA and AMD architectures. We detail the computational strategies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
