FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs
Yuanchang Zhou, Siyu Hu, Chen Wang, Lin-Wang Wang, Guangming Tan, Weile Jia

TL;DR
FastCHGNet is an optimized, GPU-efficient version of CHGNet that drastically reduces training time from over 8 days to under 2 hours using 32 GPUs, enabling rapid development of universal interatomic potentials.
Contribution
The paper introduces innovative modules and GPU optimizations that significantly accelerate CHGNet training without losing accuracy, supporting multi-GPU scaling.
Findings
Training time reduced to 1.53 hours on 32 GPUs
Memory footprint decreased by a factor of 3.59
Maintains accuracy while enabling faster training
Abstract
Graph neural network universal interatomic potentials (GNN-UIPs) have demonstrated remarkable generalization and transfer capabilities in material discovery and property prediction. These models can accelerate molecular dynamics (MD) simulation by several orders of magnitude while maintaining \textit{ab initio} accuracy, making them a promising new paradigm in material simulations. One notable example is Crystal Hamiltonian Graph Neural Network (CHGNet), pretrained on the energies, forces, stresses, and magnetic moments from the MPtrj dataset, representing a state-of-the-art GNN-UIP model for charge-informed MD simulations. However, training the CHGNet model is time-consuming(8.3 days on one A100 GPU) for three reasons: (i) requiring multi-layer propagation to reach more distant atom information, (ii) requiring second-order derivatives calculation to finish weights updating and (iii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDelphi Technique in Research · Artificial Intelligence in Healthcare and Education
MethodsADaptive gradient method with the OPTimal convergence rate · Graph Neural Network
