Pretraining Graph Transformers with Atom-in-a-Molecule Quantum Properties for Improved ADMET Modeling
Alessio Fallani, Ramil Nugmanov, Jose Arjona-Medina, J\"org Kurt, Wegner, Alexandre Tkatchenko, Kostiantyn Chernichenko

TL;DR
This study demonstrates that pretraining Graph Transformers with atom-level quantum properties enhances ADMET property prediction accuracy, with analysis revealing improved atomic environment representations and model interpretability.
Contribution
It introduces a novel pretraining approach using atomic quantum properties for Graph Transformers, outperforming other strategies in ADMET modeling.
Findings
Pretraining with atomic quantum properties yields better ADMET prediction results.
Models retain pretraining information after fine-tuning, affecting latent representations.
Pretraining influences the capture of atomic environment features in the model.
Abstract
We evaluate the impact of pretraining Graph Transformer architectures on atom-level quantum-mechanical features for the modeling of absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties of drug-like compounds. We compare this pretraining strategy with two others: one based on molecular quantum properties (specifically the HOMO-LUMO gap) and one using a self-supervised atom masking technique. After fine-tuning on Therapeutic Data Commons ADMET datasets, we evaluate the performance improvement in the different models observing that models pretrained with atomic quantum mechanical properties produce in general better results. We then analyse the latent representations and observe that the supervised strategies preserve the pretraining information after finetuning and that different pretrainings produce different trends in latent expressivity across layers.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular Junctions and Nanostructures · Machine Learning in Materials Science · CO2 Reduction Techniques and Catalysts
MethodsAttention Is All You Need · Laplacian EigenMap · Dense Connections · Residual Connection · Position-Wise Feed-Forward Layer · Adam · Linear Layer · Laplacian Positional Encodings · Label Smoothing · Dropout
