Sampling from Flow Language Models via Marginal-Conditioned Bridges

Iskander Azangulov; Leo Zhang

arXiv:2605.13681·cs.LG·May 14, 2026

Sampling from Flow Language Models via Marginal-Conditioned Bridges

Iskander Azangulov, Leo Zhang

PDF

1 Repo

TL;DR

This paper introduces a novel sampling method for Flow Language Models that improves quality and diversity by using posterior-predictive sampling with a principled, training-free approach, and provides theoretical analysis of its properties.

Contribution

The paper proposes a posterior-predictive sampling method for FLMs that preserves token marginals, improves sampling quality, and offers a theoretical comparison to existing methods.

Findings

01

Posterior-predictive sampling improves quality-diversity tradeoff.

02

The method preserves token-wise posterior marginals.

03

Theoretical analysis shows the method's error bounds and advantages.

Abstract

Flow Language Models (FLMs) are a recently introduced class of language models which adapt continuous flow matching for one-hot encoded token sequences. Their denoisers have a special structure absent from generic continuous diffusion models: each block of the denoising mean is a posterior marginal distribution over the clean token at that position. Standard DDPM-style samplers collapse these marginals to a single conditional-mean endpoint and bridge toward this simplex-valued point, which is generally not a valid one-hot sequence. We argue that the natural sampler for an FLM is instead posterior-predictive. At each reverse step, we sample a clean one-hot endpoint from the factorized posterior defined by the FLM token marginals, and then sample the next continuous state from the analytic Ornstein--Uhlenbeck bridge conditioned on that endpoint. The method is training-free, uses the same…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

imbirik/mcb
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.