FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation

Runzhe Zhang; Letian Chen; Wenpeng Zhang; Zhouhan Lin; Peilin Zhao

arXiv:2605.20199·cs.CL·May 21, 2026

FlowLM: Few-Step Language Modeling via Diffusion-to-Flow Adaptation

Runzhe Zhang, Letian Chen, Wenpeng Zhang, Zhouhan Lin, Peilin Zhao

PDF

TL;DR

FlowLM is a novel language model that transforms pre-trained diffusion models into efficient flow-based models, enabling high-quality few-step text generation with minimal training.

Contribution

We introduce FlowLM, a method to convert diffusion language models into flow models via fine-tuning, achieving superior few-step generation performance with fewer training epochs.

Findings

01

FlowLM rivals 2000-step diffusion sampling quality with fewer steps.

02

Fine-tuning FlowLM saturates performance in half the epochs of training from scratch.

03

Using clean data prediction improves the flow matching training objective.

Abstract

We present FlowLM, a flow matching language model transformed from pre-trained diffusion language models via efficient fine-tuning. By re-aligning the curved sampling trajectories of diffusion models into straight-line flows, FlowLM enables high quality few-step generation that rivals or even outperforms the quality of 2,000-step diffusion sampling with very few training epochs. Remarkably, finetuned FlowLM reaches performance saturation with only half as many training epochs as training from scratch, both approaches greatly outperforming the original diffusion model, thereby validating our method. Furthermore, we validate a more effective training objective for flow matching: predicting clean data to consistently guide the sampling process towards the true data distribution. Empirical results demonstrate that our approach is highly effective for high-quality, few-step text generation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.