DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
Gabriele Corso, Hannes St\"ark, Bowen Jing, Regina Barzilay, Tommi, Jaakkola

TL;DR
DiffDock introduces a diffusion-based generative model for molecular docking, significantly improving accuracy and robustness over previous methods, with fast inference and reliable confidence estimates.
Contribution
The paper presents DiffDock, a novel diffusion generative model for molecular docking that outperforms existing methods in accuracy and applicability to folded structures.
Findings
38% top-1 success rate on PDBBind
Maintains higher precision on folded structures (21.7%)
Fast inference with high confidence accuracy
Abstract
Predicting the binding structure of a small molecule ligand to a protein -- a task known as molecular docking -- is critical to drug design. Recent deep learning methods that treat docking as a regression problem have decreased runtime compared to traditional search-based methods but have yet to offer substantial improvements in accuracy. We instead frame molecular docking as a generative modeling problem and develop DiffDock, a diffusion generative model over the non-Euclidean manifold of ligand poses. To do so, we map this manifold to the product space of the degrees of freedom (translational, rotational, and torsional) involved in docking and develop an efficient diffusion process on this space. Empirically, DiffDock obtains a 38% top-1 success rate (RMSD<2A) on PDBBind, significantly outperforming the previous state-of-the-art of traditional docking (23%) and deep learning (20%)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Machine Learning in Materials Science
MethodsDiffusion
