Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Yangming Li, Chieh-Hsin Lai, Carola-Bibiane Sch\"onlieb, Yuki, Mitsufuji, Stefano Ermon

TL;DR
This paper introduces Bellman Diffusion, a new generative modeling framework that preserves linearity in distribution space for MDPs, enabling faster and more accurate distributional RL applications.
Contribution
It proposes Bellman Diffusion, a linear operator-based DGM that maintains the linearity of the Bellman equation, with divergence-based training and a novel SDE for sampling.
Findings
Achieves accurate distribution field estimations.
Converges 1.5x faster than histogram-based methods.
Effectively generates images with high fidelity.
Abstract
Deep Generative Models (DGMs), including Energy-Based Models (EBMs) and Score-based Generative Models (SGMs), have advanced high-fidelity data generation and complex continuous distribution approximation. However, their application in Markov Decision Processes (MDPs), particularly in distributional Reinforcement Learning (RL), remains underexplored, with conventional histogram-based methods dominating the field. This paper rigorously highlights that this application gap is caused by the nonlinearity of modern DGMs, which conflicts with the linearity required by the Bellman equation in MDPs. For instance, EBMs involve nonlinear operations such as exponentiating energy functions and normalizing constants. To address this, we introduce Bellman Diffusion, a novel DGM framework that maintains linearity in MDPs through gradient and scalar field modeling. With divergence-based training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Mathematical and Theoretical Analysis · Statistical and Computational Modeling
MethodsDiffusion
