Geometry-enhanced Pre-training on Interatomic Potentials
Taoyong Cui, Chenyu Tang, Mao Su, Shufei Zhang, Yuqiang Li, Lei Bai,, Yuhan Dong, Xingao Gong, Wanli Ouyang

TL;DR
This paper introduces a geometry-enhanced pre-training approach for machine learning interatomic potentials that improves accuracy and transferability in molecular dynamics simulations by leveraging unlabeled 3D configurations and geometric learning.
Contribution
The authors propose a novel two-stage geometric structure learning paradigm with self-supervised techniques to enhance MLIPs, especially for complex molecular systems, with minimal additional computational costs.
Findings
Significant accuracy improvements on various molecular benchmarks.
Enhanced generalization of MLIPs across different architectures.
Effective use of unlabeled 3D configurations for pre-training.
Abstract
Machine learning interatomic potentials (MLIPs) enables molecular dynamics (MD) simulations with ab initio accuracy and has been applied to various fields of physical science. However, the performance and transferability of MLIPs are limited by insufficient labeled training data, which require expensive ab initio calculations to obtain the labels, especially for complex molecular systems. To address this challenge, we design a novel geometric structure learning paradigm that consists of two stages. We first generate a large quantity of 3D configurations of target molecular system with classical molecular dynamics simulations. Then, we propose geometry-enhanced self-supervised learning consisting of masking, denoising, and contrastive learning to better capture the topology and 3D geometric information from the unlabeled 3D configurations. We evaluate our method on various benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Protein Structure and Dynamics · Topic Modeling
