Using HPC infrastructures for deep learning applications in fusion research
Diogo R. Ferreira

TL;DR
This paper explores how high performance computing infrastructures can support deep learning applications in fusion research, focusing on resource management, model types, and scalability from single GPU to large-scale HPC systems.
Contribution
It provides practical examples of deep learning models in fusion research and discusses scaling these models across HPC infrastructures, highlighting new integration approaches.
Findings
Deep learning models like CNNs and RNNs are effective for diagnostics in fusion.
Scaling deep learning from single GPU to multi-node HPC is feasible.
Deep learning competes with traditional simulation codes for HPC resources.
Abstract
In the fusion community, the use of high performance computing (HPC) has been mostly dominated by heavy-duty plasma simulations, such as those based on particle-in-cell and gyrokinetic codes. However, there has been a growing interest in applying machine learning for knowledge discovery on top of large amounts of experimental data collected from fusion devices. In particular, deep learning models are especially hungry for accelerated hardware, such as graphics processing units (GPUs), and it is becoming more common to find those models competing for the same resources that are used by simulation codes, which can be either CPU- or GPU-bound. In this paper, we give examples of deep learning models -- such as convolutional neural networks, recurrent neural networks, and variational autoencoders -- that can be used for a variety of tasks, including image processing, disruption prediction,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
