GenMol: A Drug Discovery Generalist with Discrete Diffusion
Seul Lee, Karsten Kreis, Srimukh Prasad Veccham, Meng Liu, Danny Reidenbach, Yuxing Peng, Saee Paliwal, Weili Nie, Arash Vahdat

TL;DR
GenMol is a versatile discrete diffusion model that unifies multiple drug discovery tasks, improving efficiency and performance in molecular generation and optimization.
Contribution
It introduces a single discrete diffusion framework with fragment-based generation, fragment remasking, and molecular context guidance for diverse drug discovery applications.
Findings
Outperforms GPT-based models in de novo generation
Achieves state-of-the-art in goal-directed hit generation
Enables effective exploration of chemical space
Abstract
Drug discovery is a complex process that involves multiple stages and tasks. However, existing molecular generative models can only tackle some of these tasks. We present Generalist Molecular generative model (GenMol), a versatile framework that uses only a single discrete diffusion model to handle diverse drug discovery scenarios. GenMol generates Sequential Attachment-based Fragment Embedding (SAFE) sequences through non-autoregressive bidirectional parallel decoding, thereby allowing the utilization of a molecular context that does not rely on the specific token ordering while having better sampling efficiency. GenMol uses fragments as basic building blocks for molecules and introduces fragment remasking, a strategy that optimizes molecules by regenerating masked fragments, enabling effective exploration of chemical space. We further propose molecular context guidance (MCG), a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Monoclonal and Polyclonal Antibodies Research · Biosimilars and Bioanalytical Methods
MethodsDiffusion
