TacoGFN: Target-conditioned GFlowNet for Structure-based Drug Design

Tony Shen; Seonghwan Seo; Grayson Lee; Mohit Pandey; Jason R Smith,; Artem Cherkasov; Woo Youn Kim; Martin Ester

arXiv:2310.03223·cs.LG·September 9, 2024·1 cites

TacoGFN: Target-conditioned GFlowNet for Structure-based Drug Design

Tony Shen, Seonghwan Seo, Grayson Lee, Mohit Pandey, Jason R Smith,, Artem Cherkasov, Woo Youn Kim, Martin Ester

PDF

Open Access 3 Repos

TL;DR

TacoGFN is a novel GFlowNet-based model that generates protein-binding molecules conditioned on pocket structures, achieving state-of-the-art success rates and binding affinity scores in structure-based drug design.

Contribution

It introduces a reward-conditioned generative approach that overcomes data limitations and outperforms existing methods in structure-based drug design.

Findings

01

Achieves 56.0% success rate on CrossDocked2020 benchmark.

02

Median Vina Dock score of -8.44 kcal/mol, outperforming previous models.

03

Fine-tuning improves success rate to 88.8% and score to -10.93 kcal/mol.

Abstract

Searching the vast chemical space for drug-like molecules that bind with a protein pocket is a challenging task in drug discovery. Recently, structure-based generative models have been introduced which promise to be more efficient by learning to generate molecules for any given protein structure. However, since they learn the distribution of a limited protein-ligand complex dataset, structure-based methods do not yet outperform optimization-based methods that generate binding molecules for just one pocket. To overcome limitations on data while leveraging learning across protein targets, we choose to model the reward distribution conditioned on pocket structure, instead of the training data distribution. We design TacoGFN, a novel GFlowNet-based approach for structure-based drug design, which can generate molecules conditioned on any protein pocket structure with probabilities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings