Benchmarking Accuracy and Generalizability of Four Graph Neural Networks   Using Large In Vitro ADME Datasets from Different Chemical Spaces

Fabio Broccatelli; Richard Trager; Michael Reutlinger; George Karypis,; Mufei Li

arXiv:2111.13964·q-bio.QM·February 28, 2022

Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces

Fabio Broccatelli, Richard Trager, Michael Reutlinger, George Karypis,, Mufei Li

PDF

1 Repo

TL;DR

This study benchmarks four graph neural network models against traditional machine learning methods on large ADME datasets, revealing GAT as a promising approach with comparable accuracy to experimental assays, highlighting the importance of model and data quality.

Contribution

It provides a comprehensive comparison of GNN variants on industrial ADME datasets, demonstrating GAT's superior performance and the impact of experimental error on model accuracy.

Findings

01

GAT outperforms other GNN variants and traditional models.

02

All GNNs significantly outperform fingerprint-based models.

03

Model accuracy is comparable to inter-laboratory experimental variability.

Abstract

In this work, we benchmark a variety of single- and multi-task graph neural network (GNN) models against lower-bar and higher-bar traditional machine learning approaches employing human engineered molecular features. We consider four GNN variants -- Graph Convolutional Network (GCN), Graph Attention Network (GAT), Message Passing Neural Network (MPNN), and Attentive Fingerprint (AttentiveFP). So far deep learning models have been primarily benchmarked using lower-bar traditional models solely based on fingerprints, while more realistic benchmarks employing fingerprints, whole-molecule descriptors and predictions from other related endpoints (e.g., LogD7.4) appear to be scarce for industrial ADME datasets. In addition to time-split test sets based on Genentech data, this study benefits from the availability of measurements from an external chemical space (Roche data). We identify GAT as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

awslabs/dgl-lifesci
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Neural Network · Graph Attention Network