Meta-Learning GNN Initializations for Low-Resource Molecular Property Prediction
Cuong Q. Nguyen, Constantine Kreatsoulas, and Kim M. Branson

TL;DR
This paper evaluates meta-learning methods, specifically MAML and its variants, for initializing graph neural networks to improve low-resource molecular property prediction, showing significant performance gains over traditional pre-training.
Contribution
It demonstrates that meta-initializations learned via MAML significantly enhance GNN performance in low-resource drug discovery tasks, outperforming multi-task pre-training on most benchmarks.
Findings
Meta-initializations outperform pre-training on 16/20 in-distribution tasks.
Meta-initializations outperform on all out-of-distribution tasks.
Consistent best performance across various fine-tuning data sizes.
Abstract
Building in silico models to predict chemical properties and activities is a crucial step in drug discovery. However, limited labeled data often hinders the application of deep learning in this setting. Meanwhile advances in meta-learning have enabled state-of-the-art performances in few-shot learning benchmarks, naturally prompting the question: Can meta-learning improve deep learning performance in low-resource drug discovery projects? In this work, we assess the transferability of graph neural networks initializations learned by the Model-Agnostic Meta-Learning (MAML) algorithm - and its variants FO-MAML and ANIL - for chemical properties and activities tasks. Using the ChEMBL20 dataset to emulate low-resource settings, our benchmark shows that meta-initializations perform comparably to or outperform multi-task pre-training baselines on 16 out of 20 in-distribution tasks and on all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Metabolomics and Mass Spectrometry Studies
