Global properties of the energy landscape: a testing and training arena for machine learned potentials
Vlad C\u{a}rare, Fabian L. Thiemann, Joe Morrow, David J. Wales, Edward O. Pyzer-Knapp, Luke Dicks

TL;DR
This paper introduces Landscape17, a comprehensive dataset and benchmark for evaluating machine learning interatomic potentials' ability to accurately reproduce molecular energy landscapes and kinetics, revealing current limitations and guiding future improvements.
Contribution
The paper presents Landscape17, a new dataset and test suite for assessing MLIPs on molecular kinetics, highlighting their current shortcomings and proposing a pathway for enhanced models.
Findings
Current MLIPs miss over half of DFT transition states.
Models generate unphysical stable structures.
Data augmentation improves energy surface reproduction.
Abstract
Machine learning interatomic potentials (MLIPs) have achieved remarkable accuracy on standard benchmarks, yet their ability to reproduce molecular kinetics -- critical for reaction rate calculations -- remains largely unexplored. We introduce Landscape17, a dataset of complete kinetic transition networks (KTNs) for the molecules of the MD17 dataset, computed using hybrid-level density functional theory. Each KTN contains minima, transition states, and approximate steepest-descent paths, along with energies, forces, and Hessian eigenspectra at stationary points. We develop a comprehensive test suite to evaluate the MLIP ability to reproduce these reference landscapes and apply it to a number of state-of-the-art architectures. Our results reveal limitations in current MLIPs: all the models considered miss over half of the DFT transition states and generate stable unphysical structures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
