Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Chenyu Wang, Masatoshi Uehara, Yichun He, Amy Wang, Tommaso, Biancalani, Avantika Lal, Tommi Jaakkola, Sergey Levine, Hanchen Wang, Aviv, Regev

TL;DR
This paper introduces DRAKES, a novel method for optimizing discrete diffusion models with reward signals, enabling the generation of biologically relevant DNA and protein sequences while maintaining naturalness.
Contribution
The paper presents DRAKES, a new algorithm that allows reward optimization in discrete diffusion models using backpropagation with Gumbel-Softmax, addressing unique challenges of discrete sequences.
Findings
DRAKES effectively generates DNA sequences with high enhancer activity.
DRAKES produces protein sequences with improved stability.
The approach balances naturalness and task-specific rewards.
Abstract
Recent studies have demonstrated the strong empirical performance of diffusion models on discrete sequences across domains from natural language to biological sequence generation. For example, in the protein inverse folding task, conditional diffusion models have achieved impressive results in generating natural-like sequences that fold back into the original structure. However, practical design tasks often require not only modeling a conditional distribution but also optimizing specific task objectives. For instance, we may prefer protein sequences with high stability. To address this, we consider the scenario where we have pre-trained discrete diffusion models that can generate natural-like sequences, as well as reward models that map sequences to task objectives. We then formulate the reward maximization problem within discrete diffusion models, analogous to reinforcement learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Nucleic Acid Chemistry
MethodsDiffusion
