TL;DR
This paper investigates why machine learning models for predicting activation energies often fail to generalize across different reactions and introduces a new convolutional model with quantum-chemical descriptors that improves transferability and interpretability.
Contribution
It identifies limitations of existing ML models based on difference fingerprints and proposes a convolutional model incorporating quantum-chemical descriptors and transition state info to enhance transferability.
Findings
Difference fingerprint models lack transferability outside training data.
The proposed convolutional model improves out-of-distribution prediction accuracy.
Atom-level contributions help interpret reactivity patterns.
Abstract
The calculation of reactive properties is a challenging task in chemical reaction discovery. Machine learning (ML) methods play an important role in accelerating electronic structure predictions of activation energies and reaction enthalpies, and are a crucial ingredient to enable large-scale automated reaction network discovery with reactions. Unfortunately, the predictive accuracy of existing ML models does not yet reach the required accuracy across the space of possible chemical reactions to enable subsequent kinetic simulations that even qualitatively agree with experimental kinetics. Here, we comprehensively assess the underlying reasons for prediction failures within a selection of machine-learned models of reactivity. Models based on difference fingerprints between reactant and product structures lack transferability despite providing good in-distribution predictions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
