Active Learning the Coarse-Grained Energy Landscape For Water Clusters From Sparse Training Data
Troy D. Loeffler, Tarak K. Patra, Henry Chan, Mathew Cherukara,, Subramanian K.R.S. Sankaranarayanan

TL;DR
This paper introduces an active learning method to train neural networks for water cluster energy landscapes using only 426 structures, achieving high accuracy with minimal data and enabling efficient molecular simulations.
Contribution
The study presents a novel active learning workflow that significantly reduces training data requirements for accurate energy landscape modeling of water clusters.
Findings
Achieved ~2 meV/molecule MAE in energy predictions
Accurately captured cluster structures and free energies
Demonstrated effective training with only 426 structures
Abstract
ANNs are currently trained by generating large quantities (On the order of or greater) of structural data in hopes that the ANN has adequately sampled the energy landscape both near and far-from-equilibrium. This can, however, be a bit prohibitive when it comes to more accurate levels of quantum theory. As such it is desirable to train a model using the absolute minimal data set possible, especially when costs of high-fidelity calculations such as CCSD and QMC are high. Here, we present an Active Learning approach that iteratively trains an ANN model to faithfully replicate the coarse-grained energy surface of water clusters using only 426 total structures in its training data. Our active learning workflow starts with a sparse training dataset which is continually updated via a Monte Carlo scheme that sparsely queries the energy landscape and tests the network performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Machine Learning and Algorithms · Machine Learning and ELM
