AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands
Ayan Chatterjee, Robin Walters, Zohair Shafi, Omair Shafi Ahmed,, Michael Sebek, Deisy Gysi, Rose Yu, Tina Eliassi-Rad, Albert-L\'aszl\'o, Barab\'asi, Giulia Menichetti

TL;DR
AI-Bind enhances drug-target interaction predictions for novel proteins and ligands by addressing model generalization issues through network-based sampling and unsupervised pre-training, validated on SARS-CoV-2 proteins.
Contribution
Introduces AI-Bind, a novel pipeline combining network sampling and pre-training to improve binding predictions for unseen proteins and ligands.
Findings
AI-Bind outperforms existing models on novel structures.
Predicted drug interactions with SARS-CoV-2 proteins validated by docking.
Identified potential binding sites on amino acid sequences.
Abstract
Identifying novel drug-target interactions (DTI) is a critical and rate limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We first unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Then, we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training, allowing us to limit the annotation imbalance and improve binding predictions for novel proteins and ligands. We illustrate the value of AI-Bind by predicting drugs and natural compounds with binding affinity to SARS-CoV-2 viral proteins and the associated human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Bioinformatics and Genomic Networks · Genetics, Bioinformatics, and Biomedical Research
