Bayesian Flow Networks
Alex Graves, Rupesh Kumar Srivastava, Timothy Atkinson, Faustino Gomez

TL;DR
Bayesian Flow Networks (BFNs) are a new generative modeling approach that updates independent distributions via Bayesian inference, enabling efficient, differentiable, and flexible generation for both discrete and continuous data, with competitive results.
Contribution
This paper introduces BFNs, a novel generative model that simplifies diffusion processes by removing the forward step and supports gradient-based guidance in discrete domains.
Findings
Achieved competitive log-likelihoods on image datasets
Outperformed discrete diffusion models on language modeling
Supports gradient-based sample guidance in discrete data
Abstract
This paper introduces Bayesian Flow Networks (BFNs), a new class of generative model in which the parameters of a set of independent distributions are modified with Bayesian inference in the light of noisy data samples, then passed as input to a neural network that outputs a second, interdependent distribution. Starting from a simple prior and iteratively updating the two distributions yields a generative procedure similar to the reverse process of diffusion models; however it is conceptually simpler in that no forward process is required. Discrete and continuous-time loss functions are derived for continuous, discretised and discrete data, along with sample generation procedures. Notably, the network inputs for discrete data lie on the probability simplex, and are therefore natively differentiable, paving the way for gradient-based sample guidance and few-step generation in discrete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Healthcare
MethodsDiffusion
