Transfer learning for solvation free energies: from quantum chemistry to experiments
Florence H. Vermeire, William H. Green

TL;DR
This paper introduces a transfer learning approach combining quantum calculations and experimental data to accurately predict solvation free energies, improving out-of-sample predictions despite experimental noise.
Contribution
It demonstrates the effectiveness of pre-training neural networks on quantum data to enhance solvation energy predictions with limited experimental data.
Findings
Pre-trained models outperform quantum calculations in accuracy.
Significant improvements in out-of-sample predictions for new solvents and solutes.
Model errors are limited by experimental data noise, with potential for further reduction.
Abstract
Data scarcity, bias, and experimental noise are all frequently encountered problems in the application of deep learning to chemical and material science disciplines. Transfer learning has proven effective in compensating for the lack in data. The use of quantum calculations in machine learning enables the generation of a diverse dataset and ensures that learning is less affected by noise inherent to experimental databases. In this work, we propose a transfer learning approach for the prediction of solvation free energies that combines fundamentals from quantum calculations with the higher accuracy of experimental measurements. The employed model architecture is based on the directed-message passing neural network for the molecular embedding of solvent and solute molecules. A significant advantage of models pre-trained on quantum calculations is demonstrated for small experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
