Uncertainty Estimation for Molecules: Desiderata and Methods
Tom Wollschl\"ager, Nicholas Gao, Bertrand Charpentier and, Mohamed Amine Ketata, Stephan G\"unnemann

TL;DR
This paper evaluates uncertainty estimation methods for molecular graph neural networks, identifies unmet desiderata, and proposes a Gaussian Process-based approach that improves out-of-distribution detection while maintaining accuracy.
Contribution
It introduces Localized Neural Kernel, a novel GP-based method satisfying key desiderata for uncertainty estimation in molecular GNNs.
Findings
LNK outperforms dropout and evidential regression in out-of-distribution detection.
LNK maintains high predictive accuracy.
Survey shows existing methods do not meet all desiderata.
Abstract
Graph Neural Networks (GNNs) are promising surrogates for quantum mechanical calculations as they establish unprecedented low errors on collections of molecular dynamics (MD) trajectories. Thanks to their fast inference times they promise to accelerate computational chemistry applications. Unfortunately, despite low in-distribution (ID) errors, such GNNs might be horribly wrong for out-of-distribution (OOD) samples. Uncertainty estimation (UE) may aid in such situations by communicating the model's certainty about its prediction. Here, we take a closer look at the problem and identify six key desiderata for UE in molecular force fields, three 'physics-informed' and three 'application-focused' ones. To overview the field, we survey existing methods from the field of UE and analyze how they fit to the set desiderata. By our analysis, we conclude that none of the previous works satisfies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Metabolomics and Mass Spectrometry Studies
MethodsNone · Dropout · Gaussian Process
