Steering Masked Discrete Diffusion Models via Discrete Denoising   Posterior Prediction

Jarrid Rector-Brooks; Mohsin Hasan; Zhangzhi Peng; Zachary Quinn,; Chenghao Liu; Sarthak Mittal; Nouha Dziri; Michael Bronstein; Yoshua Bengio,; Pranam Chatterjee; Alexander Tong; Avishek Joey Bose

arXiv:2410.08134·cs.LG·October 11, 2024·2 cites

Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction

Jarrid Rector-Brooks, Mohsin Hasan, Zhangzhi Peng, Zachary Quinn,, Chenghao Liu, Sarthak Mittal, Nouha Dziri, Michael Bronstein, Yoshua Bengio,, Pranam Chatterjee, Alexander Tong, Avishek Joey Bose

PDF

Open Access

TL;DR

This paper introduces Discrete Denoising Posterior Prediction (DDPP), a scalable probabilistic inference framework for steering Masked Diffusion Models (MDMs) in discrete data generation, applicable to images, text, and proteins.

Contribution

The paper proposes DDPP, a novel, simulation-free framework for steering pre-trained MDMs through probabilistic inference, enabling control over diverse non-differentiable reward functions.

Findings

01

Successfully steered MDMs for class-conditional image modeling

02

Achieved RLHF-based alignment of MDMs with text rewards

03

Generated protein sequences with enhanced properties validated in wet-lab experiments

Abstract

Generative modeling of discrete data underlies important applications spanning text-based agents like ChatGPT to the design of the very building blocks of life in protein sequences. However, application domains need to exert control over the generated data by steering the generative process - typically via RLHF - to satisfy a specified property, reward, or affinity metric. In this paper, we study the problem of steering Masked Diffusion Models (MDMs), a recent class of discrete diffusion models that offer a compelling alternative to traditional autoregressive models. We introduce Discrete Denoising Posterior Prediction (DDPP), a novel framework that casts the task of steering pre-trained MDMs as a problem of probabilistic inference by learning to sample from a target Bayesian posterior. Our DDPP framework leads to a family of three novel objectives that are all simulation-free, and thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis

MethodsDiffusion