BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

Anjie Qiao; Zhen Wang; Yaliang Li; Jiahua Rao; Yuedong Yang

arXiv:2602.15236·cs.LG·February 18, 2026

BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

Anjie Qiao, Zhen Wang, Yaliang Li, Jiahua Rao, Yuedong Yang

PDF

Open Access

TL;DR

BindCLIP introduces a unified contrastive-generative framework for virtual screening that enhances interaction-aware ligand-pocket representations, improving out-of-distribution performance and ligand ranking accuracy.

Contribution

It combines contrastive learning with a pose-generation objective and novel regularizers to produce more accurate and generalizable virtual screening embeddings.

Findings

01

Improves virtual screening accuracy on public benchmarks.

02

Enhances out-of-distribution ligand ranking performance.

03

Achieves better interaction-relevant embedding representations.

Abstract

Virtual screening aims to efficiently identify active ligands from massive chemical libraries for a given target pocket. Recent CLIP-style models such as DrugCLIP enable scalable virtual screening by embedding pockets and ligands into a shared space. However, our analyses indicate that such representations can be insensitive to fine-grained binding interactions and may rely on shortcut correlations in training data, limiting their ability to rank ligands by true binding compatibility. To address these issues, we propose BindCLIP, a unified contrastive-generative representation learning framework for virtual screening. BindCLIP jointly trains pocket and ligand encoders using CLIP-style contrastive learning together with a pocket-conditioned diffusion objective for binding pose generation, so that pose-level supervision directly shapes the retrieval embedding space toward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Protein Structure and Dynamics