Multi-GPU Approach for Training of Graph ML Models on large CFD Meshes
Sebastian Str\"onisch, Maximilian Sander, Andreas Kn\"upfer, Marcus, Meyer

TL;DR
This paper presents a multi-GPU distributed training approach for a graph neural network surrogate model used in CFD simulations, aiming to handle large meshes efficiently while maintaining accuracy.
Contribution
It extends a graph neural network surrogate model to large, industry-relevant CFD meshes by implementing multi-GPU partitioning and halo exchange during training.
Findings
Distributed training enables handling larger meshes.
Traditional models outperform the surrogate in accuracy.
Future improvements are discussed.
Abstract
Mesh-based numerical solvers are an important part in many design tool chains. However, accurate simulations like computational fluid dynamics are time and resource consuming which is why surrogate models are employed to speed-up the solution process. Machine Learning based surrogate models on the other hand are fast in predicting approximate solutions but often lack accuracy. Thus, the development of the predictor in a predictor-corrector approach is the focus here, where the surrogate model predicts a flow field and the numerical solver corrects it. This paper scales a state-of-the-art surrogate model from the domain of graph-based machine learning to industry-relevant mesh sizes of a numerical flow simulation. The approach partitions and distributes the flow domain to multiple GPUs and provides halo exchange between these partitions during training. The utilized graph neural network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Neural Network · Focus
