TL;DR
This paper introduces an entropy-based active learning method for graph neural network surrogate models in materials science, significantly reducing data requirements by intelligently selecting the most uncertain samples for training.
Contribution
It develops a novel coupling of graph neural networks with Gaussian processes to quantify uncertainty and guide active learning in materials property prediction.
Findings
Active learning doubles the training efficiency compared to random sampling.
The method effectively identifies uncertain regions in chemical space.
Uncertainty quantification accelerates model performance improvement.
Abstract
Graph neural networks, trained on experimental or calculated data are becoming an increasingly important tool in computational materials science. Networks, once trained, are able to make highly accurate predictions at a fraction of the cost of experiments or first-principles calculations of comparable accuracy. However these networks typically rely on large databases of labelled experiments to train the model. In scenarios where data is scarce or expensive to obtain this can be prohibitive. By building a neural network that provides a confidence on the predicted properties, we are able to develop an active learning scheme that can reduce the amount of labelled data required, by identifying the areas of chemical space where the model is most uncertain. We present a scheme for coupling a graph neural network with a Gaussian process to featurise solid-state materials and predict properties…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
