Understanding and improving transferability in machine-learned   activation energy predictors

Joe Gilkes; Mark Storr; Reinhard J. Maurer; Scott Habershon

arXiv:2505.00604·physics.chem-ph·May 2, 2025

Understanding and improving transferability in machine-learned activation energy predictors

Joe Gilkes, Mark Storr, Reinhard J. Maurer, Scott Habershon

PDF

1 Repo

TL;DR

This paper investigates why machine learning models for predicting activation energies often fail to generalize across different reactions and introduces a new convolutional model with quantum-chemical descriptors that improves transferability and interpretability.

Contribution

It identifies limitations of existing ML models based on difference fingerprints and proposes a convolutional model incorporating quantum-chemical descriptors and transition state info to enhance transferability.

Findings

01

Difference fingerprint models lack transferability outside training data.

02

The proposed convolutional model improves out-of-distribution prediction accuracy.

03

Atom-level contributions help interpret reactivity patterns.

Abstract

The calculation of reactive properties is a challenging task in chemical reaction discovery. Machine learning (ML) methods play an important role in accelerating electronic structure predictions of activation energies and reaction enthalpies, and are a crucial ingredient to enable large-scale automated reaction network discovery with $> 1 0^{3}$ reactions. Unfortunately, the predictive accuracy of existing ML models does not yet reach the required accuracy across the space of possible chemical reactions to enable subsequent kinetic simulations that even qualitatively agree with experimental kinetics. Here, we comprehensively assess the underlying reasons for prediction failures within a selection of machine-learned models of reactivity. Models based on difference fingerprints between reactant and product structures lack transferability despite providing good in-distribution predictions.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Kinetica-jl/Kinetica.jl
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.