Managing Expectations and Imbalanced Training Data in Reactive Force Field Development: an Application to Water Adsorption on Alumina
Lo\"ic Dumortier, C\'eline Chizallet, Benoit Creton and, Theodorus de Bruin, Toon Verstraelen

TL;DR
This paper introduces a Balanced Loss function and workflow for ReaxFF parameter optimization, effectively managing imbalanced training data and expectations, demonstrated on water adsorption on alumina.
Contribution
A novel Balanced Loss function and workflow that replace weight assignment in ReaxFF training, improving parameter optimization for imbalanced data sets.
Findings
Successfully optimized ReaxFF for water on alumina
Reproduces both common and rare properties accurately
Demonstrates robustness in molecular dynamics simulations
Abstract
ReaxFF is a computationally efficient model for reactive molecular dynamics simulations, which has been applied to a wide variety of chemical systems. When ReaxFF parameters are not yet available for a chemistry of interest, they must be (re)optimized, for which one defines a set of training data that the new ReaxFF parameters should reproduce. ReaxFF training sets typically contain diverse properties with different units, some of which are more abundant (by orders of magnitude) than others. To find the best parameters, one conventionally minimizes a weighted sum of squared errors over all data in the training set. One of the challenges in such numerical optimizations is to assign weights so that the optimized parameters represent a good compromise between all the requirements defined in the training set. This work introduces a new loss function, called Balanced Loss, and a workflow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
