Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design
Prin Phunyaphibarn, Minhyuk Sung

TL;DR
This paper introduces the CSMC sampler, a novel method for reward-guided discrete diffusion that improves molecule and biological sequence generation by effectively performing local search without relying on noisy intermediate rewards.
Contribution
The paper proposes the Clean-Sample Markov Chain (CSMC) sampler, enabling effective test-time reward-guided sampling for discrete diffusion models without using intermediate rewards.
Findings
CSMC outperforms prior reward-guided methods in molecule and sequence generation.
The method effectively constructs a Markov chain with the target distribution as stationary.
Experiments demonstrate improved sample quality with various reward functions.
Abstract
Discrete diffusion models have recently emerged as a powerful class of generative models for chemistry and biology data. In these fields, the goal is to generate various samples with high rewards (e.g., drug-likeness in molecules), making reward-based guidance crucial. Most existing methods are based on guiding the diffusion model using intermediate rewards but tend to underperform since intermediate rewards are noisy due to the non-smooth nature of reward functions used in scientific domains. To address this, we propose Clean-Sample Markov Chain (CSMC) Sampler, a method that performs effective test-time reward-guided sampling for discrete diffusion models, enabling local search without relying on intermediate rewards. CSMC constructs a Markov chain of clean samples using the Metropolis-Hastings algorithm such that its stationary distribution is the target distribution. We design a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference
