Information-entropy-driven generation of material-agnostic datasets for machine-learning interatomic potentials
Aparna P. A. Subramanyam, Danny Perez

TL;DR
This paper introduces an entropy-based automated dataset generation method for machine-learning interatomic potentials that ensures broad configuration space coverage and material-agnostic robustness, improving accuracy and reliability.
Contribution
The authors propose a novel entropy-driven approach to generate diverse, material-agnostic datasets for training interatomic potentials, enhancing their accuracy and robustness across various materials and conditions.
Findings
MLIAPs trained on the dataset are accurate across multiple metrics.
The approach yields highly robust potentials without fine-tuning.
Applicable to a wide range of unary materials and configurations.
Abstract
In contrast to their empirical counterparts, machine-learning interatomic potentials (MLIAPs) promise to deliver near-quantum accuracy over broad regions of configuration space. However, due to their generic functional forms and extreme flexibility, they can catastrophically fail to capture the properties of novel, out-of-sample configurations, making the quality of the training set a determining factor, especially when investigating materials under extreme conditions. We propose a novel automated dataset generation method based on the maximization of the information entropy of the feature distribution, aiming at an extremely broad coverage of the configuration space in a way that is agnostic to the properties of specific target materials. The ability of the dataset to capture unique material properties is demonstrated on a range of unary materials, including elements with the fcc (Al),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
