PharML.Bind: Pharmacologic Machine Learning for Protein-Ligand Interactions
Aaron D. Vose, Jacob Balma, Damon Farnsworth, Kaylie Anderson, and, Yuri K. Peterson

TL;DR
PharML.Bind is a machine learning toolkit using graph neural networks that rapidly and accurately predicts protein-ligand interactions, significantly speeding up drug discovery processes with high accuracy on large datasets.
Contribution
It introduces a novel GNN architecture and active-site-agnostic approach enabling fast, accurate predictions on extensive protein-ligand data, surpassing previous methods in speed and generalization.
Findings
Achieves 98.3% accuracy on large test set
Predicts interactions for hundreds of thousands of compounds in minutes
Demonstrates generalization to unseen proteins and drugs
Abstract
Is it feasible to create an analysis paradigm that can analyze and then accurately and quickly predict known drugs from experimental data? PharML.Bind is a machine learning toolkit which is able to accomplish this feat. Utilizing deep neural networks and big data, PharML.Bind correlates experimentally-derived drug affinities and protein-ligand X-ray structures to create novel predictions. The utility of PharML.Bind is in its application as a rapid, accurate, and robust prediction platform for discovery and personalized medicine. This paper demonstrates that graph neural networks (GNNs) can be trained to screen hundreds of thousands of compounds against thousands of targets in minutes, a vastly shorter time than previous approaches. This manuscript presents results from training and testing using the entirety of BindingDB after cleaning; this includes a test set with 19,708 X-ray…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Bioinformatics and Genomic Networks
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
