DebiasedDTA: A Framework for Improving the Generalizability of Drug-Target Affinity Prediction Models
R{\i}za \"Oz\c{c}elik, Alperen Ba\u{g}, Berk At{\i}l, Melih Barsbey,, Arzucan \"Ozg\"ur, Elif \"Ozk{\i}r{\i}ml{\i}

TL;DR
DebiasedDTA is a new training framework that reweights samples to reduce dataset bias, thereby improving the ability of drug-target affinity models to generalize to unseen biomolecules.
Contribution
It introduces a reweighting method applicable to various models, enhancing generalizability and performance on unseen data, and provides an open-source library for the community.
Findings
Improved prediction accuracy on unseen biomolecules.
Enhanced model robustness across different architectures.
Applicable to multiple datasets and biomolecule representations.
Abstract
Computational models that accurately predict the binding affinity of an input protein-chemical pair can accelerate drug discovery studies. These models are trained on available protein-chemical interaction datasets, which may contain dataset biases that may lead the model to learn dataset-specific patterns, instead of generalizable relationships. As a result, the prediction performance of models drops for previously unseen biomolecules, the prediction models cannot generalize to biomolecules outside of the dataset. The latest approaches that aim to improve model generalizability either have limited applicability or introduce the risk of degrading prediction performance. Here, we present DebiasedDTA, a novel drug-target affinity (DTA) prediction model training framework that addresses dataset biases to improve the generalizability of affinity prediction models.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · vaccines and immunoinformatics approaches
