EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields   for Atomistic Simulations

Vaibhav Bihani; Utkarsh Pratiush; Sajid Mannan; Tao Du; Zhimin Chen,; Santiago Miret; Matthieu Micoulaut; Morten M Smedskjaer; Sayan Ranu; N M; Anoop Krishnan

arXiv:2310.02428·cs.LG·November 27, 2023·2 cites

EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations

Vaibhav Bihani, Utkarsh Pratiush, Sajid Mannan, Tao Du, Zhimin Chen,, Santiago Miret, Matthieu Micoulaut, Morten M Smedskjaer, Sayan Ranu, N M, Anoop Krishnan

PDF

Open Access

TL;DR

This paper systematically benchmarks six equivariant graph neural network force fields for atomistic simulations, revealing their strengths, limitations, and the challenges of out-of-distribution generalization in real-world applications.

Contribution

It provides a comprehensive evaluation framework, introduces new datasets and metrics, and highlights the need for foundation models in ML-based atomic force fields.

Findings

01

No model outperforms others across all datasets.

02

Performance on out-of-distribution data is unreliable.

03

Lower energy or force error does not ensure simulation stability.

Abstract

Equivariant graph neural networks force fields (EGraFFs) have shown great promise in modelling complex interactions in atomic systems by exploiting the graphs' inherent symmetries. Recent works have led to a surge in the development of novel architectures that incorporate equivariance-based inductive biases alongside architectural innovations like graph transformers and message passing to model atomic interactions. However, thorough evaluations of these deploying EGraFFs for the downstream task of real-world atomistic simulations, is lacking. To this end, here we perform a systematic benchmarking of 6 EGraFF algorithms (NequIP, Allegro, BOTNet, MACE, Equiformer, TorchMDNet), with the aim of understanding their capabilities and limitations for realistic atomistic simulations. In addition to our thorough evaluation and analysis on eight existing datasets based on the benchmarking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Protein Structure and Dynamics · Topic Modeling