Transfer learning on large datasets for the accurate prediction of material properties
Noah Hoffmann (1), Jonathan Schmidt (2,1), Silvana Botti (2), Miguel, A. L. Marques (1) ((1) Institut f\"ur Physik, Martin-Luther-Universit\"at, Halle-Wittenberg, D-06099 Halle, Germany, (2) Institut f\"ur, Festk\"orpertheorie und -optik, Friedrich-Schiller-Universit\"at Jena,

TL;DR
This paper demonstrates that transfer learning with graph neural networks trained on large crystal datasets can effectively extend predictions to different density functionals, reducing data needs and improving accuracy in material property prediction.
Contribution
It introduces a transfer learning approach that leverages large datasets for different density functionals, enhancing material property predictions with fewer data and analyzing dataset size effects.
Findings
Pre-training reduces dataset size needed for accuracy.
Error scales linearly with dataset size on a log-log scale.
Additional low-cost functional calculations improve predictions.
Abstract
Graph neural networks trained on large crystal structure databases are extremely effective in replacing ab initio calculations in the discovery and characterization of materials. However, crystal structure datasets comprising millions of materials exist only for the Perdew-Burke-Ernzerhof (PBE) functional. In this work, we investigate the effectiveness of transfer learning to extend these models to other density functionals. We show that pre-training significantly reduces the size of the dataset required to achieve chemical accuracy and beyond. We also analyze in detail the relationship between the transfer-learning performance and the size of the datasets used for the initial training of the model and transfer learning. We confirm a linear dependence of the error on the size of the datasets on a log-log scale, with a similar slope for both training and the pre-training datasets. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · X-ray Diffraction in Crystallography
