Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design

Prin Phunyaphibarn; Minhyuk Sung

arXiv:2602.09424·cs.LG·February 11, 2026

Reward-Guided Discrete Diffusion via Clean-Sample Markov Chain for Molecule and Biological Sequence Design

Prin Phunyaphibarn, Minhyuk Sung

PDF

Open Access

TL;DR

This paper introduces the CSMC sampler, a novel method for reward-guided discrete diffusion that improves molecule and biological sequence generation by effectively performing local search without relying on noisy intermediate rewards.

Contribution

The paper proposes the Clean-Sample Markov Chain (CSMC) sampler, enabling effective test-time reward-guided sampling for discrete diffusion models without using intermediate rewards.

Findings

01

CSMC outperforms prior reward-guided methods in molecule and sequence generation.

02

The method effectively constructs a Markov chain with the target distribution as stationary.

03

Experiments demonstrate improved sample quality with various reward functions.

Abstract

Discrete diffusion models have recently emerged as a powerful class of generative models for chemistry and biology data. In these fields, the goal is to generate various samples with high rewards (e.g., drug-likeness in molecules), making reward-based guidance crucial. Most existing methods are based on guiding the diffusion model using intermediate rewards but tend to underperform since intermediate rewards are noisy due to the non-smooth nature of reward functions used in scientific domains. To address this, we propose Clean-Sample Markov Chain (CSMC) Sampler, a method that performs effective test-time reward-guided sampling for discrete diffusion models, enabling local search without relying on intermediate rewards. CSMC constructs a Markov chain of clean samples using the Metropolis-Hastings algorithm such that its stationary distribution is the target distribution. We design a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference