Distributed Equivariant Graph Neural Networks for Large-Scale Electronic Structure Prediction
Manasa Kaniselvan, Alexander Maeder, Chen Hao Xia, Alexandros Nikolaos Ziogas, and Mathieu Luisier

TL;DR
This paper introduces a distributed equivariant graph neural network framework that efficiently predicts electronic structures of large-scale materials, scaling effectively across hundreds of GPUs with high parallel efficiency.
Contribution
The authors develop a distributed eGNN implementation with a novel graph partitioning strategy, enabling large-scale electronic structure predictions on supercomputers.
Findings
Strong scaling up to 128 GPUs
Weak scaling up to 512 GPUs with 87% efficiency
Applicable to structures with up to 190,000 atoms
Abstract
Equivariant Graph Neural Networks (eGNNs) trained on density-functional theory (DFT) data can potentially perform electronic structure prediction at unprecedented scales, enabling investigation of the electronic properties of materials with extended defects, interfaces, or exhibiting disordered phases. However, as interactions between atomic orbitals typically extend over 10+ angstroms, the graph representations required for this task tend to be densely connected, and the memory requirements to perform training and inference on these large structures can exceed the limits of modern GPUs. Here we present a distributed eGNN implementation which leverages direct GPU communication and introduce a partitioning strategy of the input graph to reduce the number of embedding exchanges between GPUs. Our implementation shows strong scaling up to 128 GPUs, and weak scaling up to 512 GPUs with 87%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
