Accelerating advection for atmospheric modelling on Xilinx and Intel FPGAs
Nick Brown

TL;DR
This paper demonstrates how modern Xilinx and Intel FPGAs can accelerate atmospheric advection computations, achieving significant performance gains and power efficiency compared to CPUs and GPUs, with a focus on portability and scalability.
Contribution
It presents a performance portable dataflow design for atmospheric advection kernels on latest FPGA hardware, with detailed implementation and comparison across multiple architectures.
Findings
FPGA solutions outperform CPU in data transfer and compute overlap
FPGAs are more power-efficient than GPUs for this kernel
Scaling FPGA kernels shows competitive performance with traditional HPC hardware
Abstract
Reconfigurable architectures, such as FPGAs, enable the execution of code at the electronics level, avoiding the assumptions imposed by the general purpose black-box micro-architectures of CPUs and GPUs. Such tailored execution can result in increased performance and power efficiency, and as the HPC community moves towards exascale an important question is the role such hardware technologies can play in future supercomputers. In this paper we explore the porting of the PW advection kernel, an important code component used in a variety of atmospheric simulations and accounting for around 40\% of the runtime of the popular Met Office NERC Cloud model (MONC). Building upon previous work which ported this kernel to an older generation of Xilinx FPGA, we target latest generation Xilinx Alveo U280 and Intel Stratix 10 FPGAs. Exploring the development of a dataflow design which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
