A Study on the Performance of Distributed Training of Data-driven CFD Simulations
Sergio Iserte, Alejandro Gonz\'alez-Barber\'a, Paloma Barreda, and Krzysztof Rojek

TL;DR
This paper evaluates distributed GPU training for data-driven CFD simulations, demonstrating significant speedups in training time while maintaining high prediction accuracy.
Contribution
It compares CPU, multiGPU, and distributed GPU training approaches for deep learning models in CFD, highlighting the efficiency gains of distributed GPU training.
Findings
Distributed GPU training reduces training time significantly.
High-accuracy fluid state predictions are achievable with distributed training.
Code adaptations enable effective implementation of distributed training methods.
Abstract
Data-driven methods for computer simulations are blooming in many scientific areas. The traditional approach to simulating physical behaviors relies on solving partial differential equations (PDE). Since calculating these iterative equations is highly both computationally demanding and time-consuming, data-driven methods leverage artificial intelligence (AI) techniques to alleviate that workload. Data-driven methods have to be trained in advance to provide their subsequent fast predictions, however, the cost of the training stage is non-negligible. This paper presents a predictive model for inferencing future states of a specific fluid simulation that serves as a use case for evaluating different training alternatives. Particularly, this study compares the performance of only CPU, multiGPU, and distributed approaches for training a time series forecasting deep learning (DL) model. With…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
