TL;DR
This paper introduces a novel sampling method for Flow Language Models that improves quality and diversity by using posterior-predictive sampling with a principled, training-free approach, and provides theoretical analysis of its properties.
Contribution
The paper proposes a posterior-predictive sampling method for FLMs that preserves token marginals, improves sampling quality, and offers a theoretical comparison to existing methods.
Findings
Posterior-predictive sampling improves quality-diversity tradeoff.
The method preserves token-wise posterior marginals.
Theoretical analysis shows the method's error bounds and advantages.
Abstract
Flow Language Models (FLMs) are a recently introduced class of language models which adapt continuous flow matching for one-hot encoded token sequences. Their denoisers have a special structure absent from generic continuous diffusion models: each block of the denoising mean is a posterior marginal distribution over the clean token at that position. Standard DDPM-style samplers collapse these marginals to a single conditional-mean endpoint and bridge toward this simplex-valued point, which is generally not a valid one-hot sequence. We argue that the natural sampler for an FLM is instead posterior-predictive. At each reverse step, we sample a clean one-hot endpoint from the factorized posterior defined by the FLM token marginals, and then sample the next continuous state from the analytic Ornstein--Uhlenbeck bridge conditioned on that endpoint. The method is training-free, uses the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
