LaPro-DTA: Latent Dual-View Drug Representations and Salient Protein Feature Extraction for Generalizable Drug--Target Affinity Prediction
Zihan Dun, Liuyi Xu, An-Yang Lu, Shuang Li, Yining Qian

TL;DR
LaPro-DTA introduces a robust framework for drug-target affinity prediction that enhances generalization to unseen data by dual-view drug representations and salient protein feature extraction, outperforming existing methods.
Contribution
It proposes a novel latent dual-view drug representation and a pattern-aware protein feature extraction strategy to improve generalization and interpretability in DTA prediction.
Findings
8% MSE reduction on Davis dataset in unseen-drug setting
Outperforms state-of-the-art methods in benchmark tests
Provides interpretable insights into binding mechanisms
Abstract
Drug--target affinity prediction is pivotal for accelerating drug discovery, yet existing methods suffer from significant performance degradation in realistic cold-start scenarios (unseen drugs/targets/pairs), primarily driven by overfitting to training instances and information loss from irrelevant target sequences. In this paper, we propose LaPro-DTA, a framework designed to achieve robust and generalizable DTA prediction. To tackle overfitting, we devise a latent dual-view drug representation mechanism. It synergizes an instance-level view to capture fine-grained substructures with stochastic perturbation and a distribution-level view to distill generalized chemical scaffolds via semantic remapping, thereby enforcing the model to learn transferable structural rules rather than memorizing specific samples. To mitigate information loss, we introduce a salient protein feature extraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Machine Learning in Bioinformatics
