EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations
Vaibhav Bihani, Utkarsh Pratiush, Sajid Mannan, Tao Du, Zhimin Chen,, Santiago Miret, Matthieu Micoulaut, Morten M Smedskjaer, Sayan Ranu, N M, Anoop Krishnan

TL;DR
This paper systematically benchmarks six equivariant graph neural network force fields for atomistic simulations, revealing their strengths, limitations, and the challenges of out-of-distribution generalization in real-world applications.
Contribution
It provides a comprehensive evaluation framework, introduces new datasets and metrics, and highlights the need for foundation models in ML-based atomic force fields.
Findings
No model outperforms others across all datasets.
Performance on out-of-distribution data is unreliable.
Lower energy or force error does not ensure simulation stability.
Abstract
Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model atomic interactions. However, thorough evaluations of these deploying EGraFFs for the downstream task of real-world atomistic simulations, is lacking. To this end, here we perform a systematic benchmarking of 6 EGraFF algorithms (NequIP, Allegro, BOTNet, MACE, Equiformer, TorchMDNet), with the aim of understanding their capabilities and limitations for realistic atomistic simulations. In addition to our thorough evaluation and analysis on eight existing datasets based on the benchmarking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Protein Structure and Dynamics · Topic Modeling
