How accurate are foundational machine learning interatomic potentials for heterogeneous catalysis?
Luuk H. E. Kempen, Raffaele Cheula, Mie Andersen

TL;DR
This paper systematically evaluates the zero-shot performance of 80 foundational MLIPs in heterogeneous catalysis, revealing their strengths, limitations, and the need for application-specific assessment.
Contribution
It provides the first comprehensive benchmark of MLIPs across diverse catalytic tasks and highlights their variable performance and current limitations.
Findings
MLIPs perform well for vacancy and zero-point energy predictions.
Many MLIPs fail catastrophically on magnetic materials.
No single MLIP universally outperforms others.
Abstract
Foundational machine learning interatomic potentials (MLIPs) are being developed at a rapid pace, promising closer and closer approximation to ab initio accuracy. This unlocks the possibility to simulate much larger length and time scales. However, benchmarks for these MLIPs are usually limited to ordered, crystalline and bulk materials. Hence, reported performance does not necessarily accurately reflect MLIP performance in real applications such as heterogeneous catalysis. Here, we systematically analyze zero-shot performance of 80 different MLIPs, evaluating tasks typical for heterogeneous catalysis across a range of different data sets, including adsorption and reaction on surfaces of alloyed metals, oxides, and metal-oxide interfacial systems. We demonstrate that current-generation foundational MLIPs can already perform at high accuracy for applications such as predicting vacancy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Electrocatalysts for Energy Conversion · CO2 Reduction Techniques and Catalysts
