A framework for a generalisation analysis of machine-learned interatomic potentials
Christoph Ortner, Yangshuai Wang

TL;DR
This paper introduces a rigorous framework to analyze the generalization capabilities of machine-learned interatomic potentials, providing insights into training data requirements and model accuracy for complex atomic simulations.
Contribution
It presents a novel theoretical framework for understanding MLIP generalization, applied to defect simulations, and offers guidance for training data collection and loss function design.
Findings
Accuracy depends on training structure size
Observation types influence model performance
The framework justifies and improves current practices
Abstract
Machine-learned interatomic potentials (MLIPs) and force fields (i.e. interaction laws for atoms and molecules) are typically trained on limited data-sets that cover only a very small section of the full space of possible input structures. MLIPs are nevertheless capable of making accurate predictions of forces and energies in simulations involving (seemingly) much more complex structures. In this article we propose a framework within which this kind of generalisation can be rigorously understood. As a prototypical example, we apply the framework to the case of simulating point defects in a crystalline solid. Here, we demonstrate how the accuracy of the simulation depends explicitly on the size of the training structures, on the kind of observations (e.g., energies, forces, force constants, virials) to which the model has been fitted, and on the fit accuracy. The new theoretical insights…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Electron and X-Ray Spectroscopy Techniques · Advanced Electron Microscopy Techniques and Applications
