Development and performance of a HemeLB GPU code for human-scale blood flow simulation
I. Zacharoudiou, J.W.S. McCullough, P.V. Coveney

TL;DR
This paper presents a GPU-accelerated version of the HemeLB blood flow simulation code, demonstrating significant performance gains on NVIDIA hardware while maintaining strong scaling, and discusses challenges for exascale computing deployment.
Contribution
A new CUDA C++ implementation of HemeLB that leverages GPU hardware for improved performance on HPC systems.
Findings
GPU version achieves higher performance than CPU version
Retains strong scaling characteristics of the original code
Discusses deployment challenges for exascale HPC systems
Abstract
In recent years, it has become increasingly common for high performance computers (HPC) to possess some level of heterogeneous architecture - typically in the form of GPU accelerators. In some machines these are isolated within a dedicated partition, whilst in others they are integral to all compute nodes - often with multiple GPUs per node - and provide the majority of a machine's compute performance. In light of this trend, it is becoming essential that codes deployed on HPC are updated to execute on accelerator hardware. In this paper we introduce a GPU implementation of the 3D blood flow simulation code HemeLB that has been developed using CUDA C++. We demonstrate how taking advantage of NVIDIA GPU hardware can achieve significant performance improvements compared to the equivalent CPU only code on which it has been built whilst retaining the excellent strong scaling characteristics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Cloud Computing and Resource Management
