ArcaNN: automated enhanced sampling generation of training sets for chemically reactive machine learning interatomic potentials
Rolf David (1), Miguel de la Puente (1), Axel Gomez (1), Olaia Anton, (1), Guillaume Stirnemann (1), Damien Laage (1) ((1) PASTEUR, D\'epartement, de Chimie, \'Ecole Normale Sup\'erieure, PSL University, Sorbonne University,, CNRS, Paris)

TL;DR
ArcaNN is a new automated framework that generates high-quality training datasets for reactive machine learning interatomic potentials, improving their accuracy in simulating chemical reactions.
Contribution
It introduces a comprehensive, automated approach combining advanced sampling and iterative training for reactive potential development, addressing a key gap in dataset construction.
Findings
Demonstrated effective sampling of high-energy geometries in a nucleophilic substitution reaction
Achieved uniformly low error of the MLP along the reaction coordinate
Showcased broad applicability for reactive molecular dynamics simulations
Abstract
The emergence of artificial intelligence has profoundly impacted computational chemistry, particularly through machine-learned potentials (MLPs), which offer a balance of accuracy and efficiency in calculating atomic energies and forces to be used in molecular dynamics simulations. These MLPs have significantly advanced molecular dynamics simulations across various applications, including large-scale simulations of materials, interfaces, and chemical reactions. Despite these advances, the construction of training datasets - a critical component for the accuracy of MLPs - has not received proportional attention. This is particularly critical for chemical reactivity which depends on rare barrier-crossing events. Here we address this gap by introducing ArcaNN, a comprehensive framework designed for generating training datasets for reactive MLPs. ArcaNN employs a concurrent learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Various Chemistry Research Topics
