BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning
Artem Zholus, Maksim Kuznetsov, Roman Schutski, Rim Shayakhmetov,, Daniil Polykovskiy, Sarath Chandar, Alex Zhavoronkov

TL;DR
BindGPT is a scalable, language model-based framework for 3D molecular design that generates molecules within protein binding sites, combining pretraining and reinforcement learning to outperform specialized models efficiently.
Contribution
The paper introduces BindGPT, a unified generative model that creates 3D molecules conditioned on protein pockets without equivariance assumptions, leveraging large-scale pretraining and reinforcement learning.
Findings
Performs on par or better than diffusion and graph models
Samples molecules two orders of magnitude faster
Eliminates the need for graph reconstruction step
Abstract
Generating novel active molecules for a given protein is an extremely challenging task for generative models that requires an understanding of the complex physical interactions between the molecule and its environment. In this paper, we present a novel generative model, BindGPT which uses a conceptually simple but powerful approach to create 3D molecules within the protein's binding site. Our model produces molecular graphs and conformations jointly, eliminating the need for an extra graph reconstruction step. We pretrain BindGPT on a large-scale dataset and fine-tune it with reinforcement learning using scores from external simulation software. We demonstrate how a single pretrained language model can serve at the same time as a 3D molecular generative model, conformer generator conditioned on the molecular graph, and a pocket-conditioned 3D molecule generator. Notably, the model does…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Innovative Microfluidic and Catalytic Techniques Innovation · Modular Robots and Swarm Intelligence
MethodsDiffusion
