TL;DR
This paper evaluates sampling methods for training neural network model chemistries, demonstrating that Metadynamics (MetaMD) effectively explores chemical space and improves model generality at low computational cost.
Contribution
The study introduces and systematically assesses MetaMD as a novel sampling technique for neural network chemistries, showing its advantages over traditional methods like MD and NMS.
Findings
MetaMD ensures diverse sampling of chemical space.
MD sampling does not improve model generality with additional data.
MetaMD scales linearly with molecule size and is easy to implement.
Abstract
Neural network (NN) model chemistries (MCs) promise to facilitate the accurate exploration of chemical space and simulation of large reactive systems. One important path to improving these models is to add layers of physical detail, especially long-range forces. At short range, however, these models are data driven and data limited. Little is systematically known about how data should be sampled, and `test data' chosen randomly from some sampling techniques can provide poor information about generality. If the sampling method is narrow `test error' can appear encouragingly tiny while the model fails catastrophically elsewhere. In this manuscript we competitively evaluate two common sampling methods: molecular dynamics (MD), normal-mode sampling (NMS) and one uncommon alternative, Metadynamics (MetaMD), for preparing training geometries. We show that MD is an inefficient sampling method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
