Bellman Diffusion Models

Liam Schramm; Abdeslam Boularias

arXiv:2407.12163·cs.LG·November 4, 2025

Bellman Diffusion Models

Liam Schramm, Abdeslam Boularias

PDF

Open Access

TL;DR

This paper investigates using diffusion models to represent the successor state measure in policies, demonstrating that Bellman flow constraints simplify the diffusion-based Bellman update for reinforcement learning.

Contribution

It introduces a novel approach of applying diffusion models to the successor state measure, leveraging Bellman flow constraints for simplified policy updates.

Findings

01

Bellman flow constraints lead to a simple Bellman update on diffusion step distribution.

02

Diffusion models effectively represent policies in offline reinforcement learning.

03

The approach improves policy modeling by integrating diffusion with Bellman equations.

Abstract

Diffusion models have seen tremendous success as generative architectures. Recently, they have been shown to be effective at modelling policies for offline reinforcement learning and imitation learning. We explore using diffusion as a model class for the successor state measure (SSM) of a policy. We find that enforcing the Bellman flow constraints leads to a simple Bellman update on the diffusion step distribution.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpinion Dynamics and Social Influence

MethodsDiffusion