# Bayesian semi-supervised learning for uncertainty-calibrated prediction   of molecular properties and active learning

**Authors:** Yao Zhang, Alpha A. Lee

arXiv: 1902.00925 · 2019-07-26

## TL;DR

This paper introduces Bayesian semi-supervised graph convolutional neural networks for predicting molecular properties, emphasizing uncertainty quantification and active learning to improve reliability and data efficiency in drug discovery.

## Contribution

It presents a novel Bayesian semi-supervised approach that enhances uncertainty estimation and active learning capabilities in molecular property prediction.

## Key findings

- Uncertainty estimates are statistically principled and reliable.
- Semi-supervised learning improves performance with limited data.
- Active learning accelerates discovery by selecting informative samples.

## Abstract

Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: Outliers can derail a discovery campaign, thus models need reliably predict when it will fail, even when the training data is biased; experiments are expensive, thus models need to be data-efficient and suggest informative training sets using active learning. We show that uncertainty quantification and active learning can be achieved by Bayesian semi-supervised graph convolutional neural networks. The Bayesian approach estimates uncertainty in a statistically principled way through sampling from the posterior distribution. Semi-supervised learning disentangles representation learning and regression, keeping uncertainty estimates accurate in the low data limit and allowing the model to start active learning from a small initial pool of training data. Our study highlights the promise of Bayesian deep learning for chemistry.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.00925/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1902.00925/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/1902.00925/full.md

---
Source: https://tomesphere.com/paper/1902.00925