Evaluating Scalable Uncertainty Estimation Methods for DNN-Based Molecular Property Prediction
Gabriele Scalia, Colin A. Grambow, Barbara Pernici, Yi-Pei Li, William, H. Green

TL;DR
This paper compares scalable uncertainty estimation methods for deep neural networks in molecular property prediction, highlighting their performance differences and the importance of uncertainty quantification for reliable predictions.
Contribution
It introduces a unified framework and quantitative criteria to compare MC-Dropout, deep ensembles, and bootstrapping in molecular property prediction tasks.
Findings
Ensembling and bootstrapping outperform MC-Dropout in uncertainty estimation.
Different methods have specific advantages depending on the context.
Out-of-domain uncertainty remains a significant challenge.
Abstract
Advances in deep neural network (DNN) based molecular property prediction have recently led to the development of models of remarkable accuracy and generalization ability, with graph convolution neural networks (GCNNs) reporting state-of-the-art performance for this task. However, some challenges remain and one of the most important that needs to be fully addressed concerns uncertainty quantification. DNN performance is affected by the volume and the quality of the training samples. Therefore, establishing when and to what extent a prediction can be considered reliable is just as important as outputting accurate predictions, especially when out-of-domain molecules are targeted. Recently, several methods to account for uncertainty in DNNs have been proposed, most of which are based on approximate Bayesian inference. Among these, only a few scale to the large datasets required in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Protein Structure and Dynamics
MethodsConvolution
