3D Infomax improves GNNs for Molecular Property Prediction
Hannes St\"ark, Dominique Beaini, Gabriele Corso, Prudencio Tossou,, Christian Dallago, Stephan G\"unnemann, Pietro Li\`o

TL;DR
This paper introduces a self-supervised pre-training method for GNNs that captures 3D molecular information from 2D graphs, significantly enhancing molecular property prediction without needing explicit 3D structures.
Contribution
It proposes a novel 3D Infomax pre-training approach that enables GNNs to implicitly learn 3D geometry from 2D molecular graphs, improving downstream prediction tasks.
Findings
22% average MAE reduction on quantum properties
Effective transfer of learned representations across datasets
Significant performance gains in molecular property prediction
Abstract
Molecular property prediction is one of the fastest-growing applications of deep learning with critical real-world impacts. Including 3D molecular structure as input to learned models improves their performance for many molecular tasks. However, this information is infeasible to compute at the scale required by several real-world applications. We propose pre-training a model to reason about the geometry of molecules given only their 2D molecular graphs. Using methods from self-supervised learning, we maximize the mutual information between 3D summary vectors and the representations of a Graph Neural Network (GNN) such that they contain latent 3D information. During fine-tuning on molecules with unknown geometry, the GNN still generates implicit 3D information and can use it to improve downstream tasks. We show that 3D pre-training provides significant improvements for a wide range of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Graph Neural Networks
MethodsGraph Neural Network
