Managing Expectations and Imbalanced Training Data in Reactive Force   Field Development: an Application to Water Adsorption on Alumina

Lo\"ic Dumortier; C\'eline Chizallet; Benoit Creton and; Theodorus de Bruin; Toon Verstraelen

arXiv:2404.14338·physics.chem-ph·April 23, 2024

Managing Expectations and Imbalanced Training Data in Reactive Force Field Development: an Application to Water Adsorption on Alumina

Lo\"ic Dumortier, C\'eline Chizallet, Benoit Creton and, Theodorus de Bruin, Toon Verstraelen

PDF

TL;DR

This paper introduces a Balanced Loss function and workflow for ReaxFF parameter optimization, effectively managing imbalanced training data and expectations, demonstrated on water adsorption on alumina.

Contribution

A novel Balanced Loss function and workflow that replace weight assignment in ReaxFF training, improving parameter optimization for imbalanced data sets.

Findings

01

Successfully optimized ReaxFF for water on alumina

02

Reproduces both common and rare properties accurately

03

Demonstrates robustness in molecular dynamics simulations

Abstract

ReaxFF is a computationally efficient model for reactive molecular dynamics simulations, which has been applied to a wide variety of chemical systems. When ReaxFF parameters are not yet available for a chemistry of interest, they must be (re)optimized, for which one defines a set of training data that the new ReaxFF parameters should reproduce. ReaxFF training sets typically contain diverse properties with different units, some of which are more abundant (by orders of magnitude) than others. To find the best parameters, one conventionally minimizes a weighted sum of squared errors over all data in the training set. One of the challenges in such numerical optimizations is to assign weights so that the optimized parameters represent a good compromise between all the requirements defined in the training set. This work introduces a new loss function, called Balanced Loss, and a workflow…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.