FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs

Yuanchang Zhou; Siyu Hu; Chen Wang; Lin-Wang Wang; Guangming Tan; Weile Jia

arXiv:2412.20796·cs.DC·July 25, 2025

FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs

Yuanchang Zhou, Siyu Hu, Chen Wang, Lin-Wang Wang, Guangming Tan, Weile Jia

PDF

Open Access

TL;DR

FastCHGNet is an optimized, GPU-efficient version of CHGNet that drastically reduces training time from over 8 days to under 2 hours using 32 GPUs, enabling rapid development of universal interatomic potentials.

Contribution

The paper introduces innovative modules and GPU optimizations that significantly accelerate CHGNet training without losing accuracy, supporting multi-GPU scaling.

Findings

01

Training time reduced to 1.53 hours on 32 GPUs

02

Memory footprint decreased by a factor of 3.59

03

Maintains accuracy while enabling faster training

Abstract

Graph neural network universal interatomic potentials (GNN-UIPs) have demonstrated remarkable generalization and transfer capabilities in material discovery and property prediction. These models can accelerate molecular dynamics (MD) simulation by several orders of magnitude while maintaining \textit{ab initio} accuracy, making them a promising new paradigm in material simulations. One notable example is Crystal Hamiltonian Graph Neural Network (CHGNet), pretrained on the energies, forces, stresses, and magnetic moments from the MPtrj dataset, representing a state-of-the-art GNN-UIP model for charge-informed MD simulations. However, training the CHGNet model is time-consuming(8.3 days on one A100 GPU) for three reasons: (i) requiring multi-layer propagation to reach more distant atom information, (ii) requiring second-order derivatives calculation to finish weights updating and (iii)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDelphi Technique in Research · Artificial Intelligence in Healthcare and Education

MethodsADaptive gradient method with the OPTimal convergence rate · Graph Neural Network