PairFlow: Closed-Form Source-Target Coupling for Few-Step Generation in Discrete Flow Models
Mingue Park, Jisung Hwang, Seungwoo Yoo, Kyeongmin Yeo, Minhyuk Sung

TL;DR
PairFlow is a lightweight preprocessing method that enables few-step sampling in discrete flow models without requiring a pretrained teacher, significantly reducing training overhead while maintaining or improving performance.
Contribution
We propose PairFlow, a novel closed-form source-target coupling technique for training discrete flow models efficiently without finetuning or pretrained teachers.
Findings
Achieves up to 98.3% reduction in compute compared to full training.
Matches or surpasses performance of two-stage finetuning methods.
Effective on molecular, binary, and RGB image data.
Abstract
We introduce , a lightweight preprocessing step for training Discrete Flow Models (DFMs) to achieve few-step sampling without requiring a pretrained teacher. DFMs have recently emerged as a new class of generative models for discrete data, offering strong performance. However, they suffer from slow sampling due to their iterative nature. Existing acceleration methods largely depend on finetuning, which introduces substantial additional training overhead. addresses this issue with a lightweight preprocessing step. Inspired by ReFlow and its extension to DFMs, we train DFMs from coupled samples of source and target distributions, without requiring any pretrained teacher. At the core of our approach is a closed-form inversion for DFMs, which allows efficient construction of paired source-target samples. Despite its extremely low cost, taking only up…
Peer Reviews
Decision·ICLR 2026 Poster
- Starting from real data, back to noise. This is just a little bit change of the reflow. But I think it is a cool idea. Intuitively, if your reflow model is not trained or sampled well, then the quality of your sampled-images maybe much worse than that of real data. But starting from clean image, you can avoid this. - this paper has some smart ideas to get the theoretical results, for example, how to get the closed-form formula of forward velocity (A.1). Although the proof about the closed-form
The main weakness of this paper is dued to its proof of the closed-form formular of the backward velocity. - line 770 ~ line 792, has so many typos. Those typos make the proof unreadable, though although I can grasp the approach the authors intended to use. - I do some calculations, and find some part results are right, but the proof process is flawed. There are numerous algebraic mistakes (e.g., signs flipped, missing ±1 terms, and other careless errors) that makes people suspect the correctne
The approach is mathematically interesting, and offers significant practical utility, as it eliminates the need for a pre-trained teacher for distillation while accelerating inference through few-step sampling in discrete settings.
Deriving the closed-form velocity field although possible requires summing over all the training data, which can become significant for large systems. Further, it is possible that using the analytical vector field to determine source-target pairs leads to a distilled model that overfits to the data due to the way the analytical field is obtained. It would be useful to demonstrate that this in fact does not occur. There is some evidence to suggest that this may be happening (novelty of molecules
1. The paper is well-motivated and easy to follow. 2. The derivation of the closed-form velocity field of DFM is a significant theoretical contribution. 3. The proposed method has a huge improvement in computation complexity.
1. The continuous flow experiments in Appendix E.3 suggest that the advantage of closed-form pairing may diminish with increasing data dimensionality. A brief discussion and further experiments on the scalability of the discrete PairFlow method to very large vocabularies and sequence lengths (Image datasets of higher resolution or language modeling) would be beneficial for setting expectations for future applications. 2. Typos in Line 127-128: the codomain of $p_t(\cdot)$ and $v_t(\cdot)$.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Materials Science · Advanced Neural Network Applications
