BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening
Anjie Qiao, Zhen Wang, Yaliang Li, Jiahua Rao, Yuedong Yang

TL;DR
BindCLIP introduces a unified contrastive-generative framework for virtual screening that enhances interaction-aware ligand-pocket representations, improving out-of-distribution performance and ligand ranking accuracy.
Contribution
It combines contrastive learning with a pose-generation objective and novel regularizers to produce more accurate and generalizable virtual screening embeddings.
Findings
Improves virtual screening accuracy on public benchmarks.
Enhances out-of-distribution ligand ranking performance.
Achieves better interaction-relevant embedding representations.
Abstract
Virtual screening aims to efficiently identify active ligands from massive chemical libraries for a given target pocket. Recent CLIP-style models such as DrugCLIP enable scalable virtual screening by embedding pockets and ligands into a shared space. However, our analyses indicate that such representations can be insensitive to fine-grained binding interactions and may rely on shortcut correlations in training data, limiting their ability to rank ligands by true binding compatibility. To address these issues, we propose BindCLIP, a unified contrastive-generative representation learning framework for virtual screening. BindCLIP jointly trains pocket and ligand encoders using CLIP-style contrastive learning together with a pocket-conditioned diffusion objective for binding pose generation, so that pose-level supervision directly shapes the retrieval embedding space toward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Protein Structure and Dynamics
