Comparison of Optimised Geometric Deep Learning Architectures, over Varying Toxicological Assay Data Environments
Alexander D. Kalian, Lennart Otte, Jaewook Lee, Emilio Benfenati, Jean-Lou C.M. Dorne, Claire Potter, Olivia J. Osborne, Miao Guo, Christer Hogstrand

TL;DR
This study systematically compares different geometric deep learning architectures, specifically GCNs, GATs, and GINs, across various toxicological datasets to identify optimal models based on data abundance and assay characteristics.
Contribution
It provides a comprehensive evaluation of GNN architectures in cheminformatics, highlighting the conditions under which each architecture performs best, which was previously underexplored.
Findings
GINs outperform GCNs and GATs in data-rich environments
GATs excel in data-scarce environments
Optimized hyperparameters reveal distinct convergence behaviors
Abstract
Geometric deep learning is an emerging technique in Artificial Intelligence (AI) driven cheminformatics, however the unique implications of different Graph Neural Network (GNN) architectures are poorly explored, for this space. This study compared performances of Graph Convolutional Networks (GCNs), Graph Attention Networks (GATs) and Graph Isomorphism Networks (GINs), applied to 7 different toxicological assay datasets of varying data abundance and endpoint, to perform binary classification of assay activation. Following pre-processing of molecular graphs, enforcement of class-balance and stratification of all datasets across 5 folds, Bayesian optimisations were carried out, for each GNN applied to each assay dataset (resulting in 21 unique Bayesian optimisations). Optimised GNNs performed at Area Under the Curve (AUC) scores ranging from 0.728-0.849 (averaged across all folds),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods
